edu.stanford.nlp.wordseg
Class CorpusDictionary
java.lang.Object
edu.stanford.nlp.wordseg.CorpusDictionary
public class CorpusDictionary
- extends java.lang.Object
Check if a bigram exists in bakeoff corpora.
The dictionaries that this class reads have to be in UTF-8.
- Author:
- Huihsin Tseng, Pichuan Chang
|
Method Summary |
boolean |
contains(java.lang.String word)
|
java.util.Set<java.lang.String> |
getTable()
|
java.lang.String |
getW(java.lang.String a1)
|
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
CorpusDictionary
public CorpusDictionary(java.lang.String filename)
- Load a dictionary of words.
- Parameters:
filename - A file of words, one per line. It must be in UTF-8.
CorpusDictionary
public CorpusDictionary(java.lang.String filename,
boolean normalize)
getTable
public java.util.Set<java.lang.String> getTable()
contains
public boolean contains(java.lang.String word)
getW
public java.lang.String getW(java.lang.String a1)
Stanford NLP Group