|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectedu.stanford.nlp.pipeline.TokenizerAnnotator
public abstract class TokenizerAnnotator
This is an abstract base class for any annotator class that uses a
Tokenizer to split TextAnnotation into TokensAnnotation.
The only method the subclass needs to define is a method
that produces a Tokenizer of CoreLabels, which is then
used to split the TextAnnotation of the given Annotation
into CoreLabels.
In order to maintain thread safety, getTokenizer should return
a thread-safe tokenizer. In the case of tokenizers built from
.flex files, that will mean new tokenizers for each call.
| Constructor Summary | |
|---|---|
TokenizerAnnotator(boolean verbose)
|
|
| Method Summary | |
|---|---|
void |
annotate(Annotation annotation)
Does the actual work of splitting TextAnnotation into CoreLabels, which are then attached to the TokensAnnotation. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public TokenizerAnnotator(boolean verbose)
| Method Detail |
|---|
public void annotate(Annotation annotation)
annotate in interface Annotator
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||