|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectedu.stanford.nlp.pipeline.TokenizerAnnotator
edu.stanford.nlp.pipeline.WhitespaceTokenizerAnnotator
public class WhitespaceTokenizerAnnotator
This annotator uses a WhitespaceTokenizer to split TextAnnotations
into TokensAnnotations.
If either the property EOL_PROPERTY or the property
NEWLINE_SPLITTER_PROPERTY defined in StanfordCoreNLP are present
and set to true, newlines are returned as tokens. In practice,
either will mean the newlines get removed by the sentence splitter.
| Field Summary | |
|---|---|
static java.lang.String |
EOL_PROPERTY
|
| Constructor Summary | |
|---|---|
WhitespaceTokenizerAnnotator(java.util.Properties props)
|
|
| Method Summary |
|---|
| Methods inherited from class edu.stanford.nlp.pipeline.TokenizerAnnotator |
|---|
annotate |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static final java.lang.String EOL_PROPERTY
| Constructor Detail |
|---|
public WhitespaceTokenizerAnnotator(java.util.Properties props)
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||