public class TokensRegexAnnotator extends Object implements Annotator
Uses TokensRegex patterns to annotate tokens.
Configuration:
rules - Name of file containing extraction rules
(see CoreMapExpressionExtractor and SequenceMatchRulesoptions.xxx = ...)
setTokenOffsets - whether to explicit set the token offsets of individual tokens (needed to token sequence matches to work)extractWithTokens - whether to return unmatched tokens as wellflatten - whether to flatten matched expressions into individual tokensmatchedExpressionsAnnotationKey - Annotation key where matched expressions are stored as a listMultiple TokensRegexAnnotator can be configured using the same properties file by specifying
difference prefix for the TokensRegexAnnotator
Annotator.RequirementBINARIZED_TREES_REQUIREMENT, CLEAN_XML_REQUIREMENT, COLUMN_DATA_CLASSIFIER, DETERMINISTIC_COREF_REQUIREMENT, GENDER_REQUIREMENT, GUTIME_REQUIREMENT, HEIDELTIME_REQUIREMENT, LEMMA_REQUIREMENT, NATLOG_REQUIREMENT, NER_REQUIREMENT, NUMBER_REQUIREMENT, PARSE_AND_TAG, PARSE_REQUIREMENT, PARSE_TAG_BINARIZED_TREES, POS_REQUIREMENT, QUANTIFIABLE_ENTITY_NORMALIZATION_REQUIREMENT, QUOTE_REQUIREMENT, RELATION_EXTRACTOR_REQUIREMENT, SSPLIT_REQUIREMENT, STANFORD_CLEAN_XML, STANFORD_COLUMN_DATA_CLASSIFIER, STANFORD_DEPENDENCIES, STANFORD_DETERMINISTIC_COREF, STANFORD_ENTITY_MENTIONS, STANFORD_GENDER, STANFORD_LEMMA, STANFORD_NATLOG, STANFORD_NER, STANFORD_PARSE, STANFORD_POS, STANFORD_QUOTE, STANFORD_REGEXNER, STANFORD_RELATION, STANFORD_SENTIMENT, STANFORD_SSPLIT, STANFORD_TOKENIZE, STANFORD_TRUECASE, STEM_REQUIREMENT, SUTIME_REQUIREMENT, TIME_WORDS_REQUIREMENT, TOKENIZE_AND_SSPLIT, TOKENIZE_REQUIREMENT, TOKENIZE_SSPLIT_NER, TOKENIZE_SSPLIT_PARSE, TOKENIZE_SSPLIT_PARSE_NER, TOKENIZE_SSPLIT_POS, TOKENIZE_SSPLIT_POS_LEMMA, TRUECASE_REQUIREMENT| Constructor and Description |
|---|
TokensRegexAnnotator(Properties props) |
TokensRegexAnnotator(String... files) |
TokensRegexAnnotator(String name,
Properties props) |
| Modifier and Type | Method and Description |
|---|---|
void |
annotate(Annotation annotation)
Given an Annotation, perform a task on this Annotation.
|
Set<Annotator.Requirement> |
requirementsSatisfied()
Returns a set of requirements for which tasks this annotator can
provide.
|
Set<Annotator.Requirement> |
requires()
Returns the set of tasks which this annotator requires in order
to perform.
|
public TokensRegexAnnotator(String... files)
public TokensRegexAnnotator(String name, Properties props)
public TokensRegexAnnotator(Properties props)
public void annotate(Annotation annotation)
Annotatorpublic Set<Annotator.Requirement> requires()
Annotatorpublic Set<Annotator.Requirement> requirementsSatisfied()
AnnotatorrequirementsSatisfied in interface Annotator