public class AnnotatorImplementations extends Object
StanfordCoreNLP will automatically load
the new annotator instead.| Constructor and Description |
|---|
AnnotatorImplementations() |
| Modifier and Type | Method and Description |
|---|---|
CleanXmlAnnotator |
cleanXML(Properties properties,
String xmlTagsToRemove,
String sentenceEndingTags,
String dateTags,
boolean allowFlawedXml)
Clean XML input
|
Annotator |
coref(Properties properties)
Annotate for coreference
|
Annotator |
custom(Properties properties,
String property) |
Annotator |
dependencies(Properties properties)
Annotate dependency relations in sentences
|
Annotator |
gender(Properties properties,
boolean verbose)
Annotate for gender of tokens
|
Annotator |
mentions(Properties properties,
String name)
Annotate mentions
|
Annotator |
morpha(Properties properties,
boolean verbose)
Annotate lemmas
|
Annotator |
natlog(Properties properties)
Annotate operators (e.g., quantifiers) and polarity of tokens in a sentence
|
Annotator |
ner(Properties properties)
Annotate for named entities -- note that this combines multiple NER tag sets, and some auxiliary things (like temporal tagging)
|
Annotator |
parse(Properties properties)
Annotate parse trees
|
Annotator |
posTagger(Properties properties)
Part of speech tag
|
Annotator |
quote(Properties properties)
Annotate quotes and extract them like sentences
|
Annotator |
relations(Properties properties)
Annotate for relations expressed in sentences
|
Annotator |
sentiment(Properties properties,
String name)
Annotate for sentiment in sentences
|
Annotator |
tokenizer(Properties properties,
boolean verbose,
String options)
Tokenize, emulating the Penn Treebank
|
Annotator |
tokensRegexNER(Properties properties,
String name)
Run RegexNER -- rule-based NER based on a deterministic mapping file
|
Annotator |
trueCase(Properties properties,
String modelLoc,
String classBias,
String mixedCaseFileName,
boolean verbose)
Infer the original casing of tokens
|
Annotator |
wordToSentences(Properties properties,
boolean verbose,
String boundaryTokenRegex,
Set<String> boundaryToDiscard,
Set<String> htmlElementsToDiscard,
String newlineIsSentenceBreak,
String boundaryMultiTokenRegex,
Set<String> tokenRegexesToDiscard)
Sentence split, in addition to a bunch of other things in this annotator (be careful to check the implementation!)
|
public Annotator tokenizer(Properties properties, boolean verbose, String options)
public CleanXmlAnnotator cleanXML(Properties properties, String xmlTagsToRemove, String sentenceEndingTags, String dateTags, boolean allowFlawedXml)
public Annotator wordToSentences(Properties properties, boolean verbose, String boundaryTokenRegex, Set<String> boundaryToDiscard, Set<String> htmlElementsToDiscard, String newlineIsSentenceBreak, String boundaryMultiTokenRegex, Set<String> tokenRegexesToDiscard)
public Annotator posTagger(Properties properties)
public Annotator morpha(Properties properties, boolean verbose)
public Annotator ner(Properties properties) throws FileNotFoundException
FileNotFoundExceptionpublic Annotator tokensRegexNER(Properties properties, String name)
public Annotator mentions(Properties properties, String name)
public Annotator gender(Properties properties, boolean verbose)
public Annotator parse(Properties properties)
properties - public Annotator custom(Properties properties, String property)
public Annotator trueCase(Properties properties, String modelLoc, String classBias, String mixedCaseFileName, boolean verbose)
public Annotator coref(Properties properties)
public Annotator relations(Properties properties)
public Annotator sentiment(Properties properties, String name)
public Annotator dependencies(Properties properties)
public Annotator natlog(Properties properties)
public Annotator quote(Properties properties)