edu.stanford.nlp.wordseg
Class CorpusDictionary

java.lang.Object
  extended by edu.stanford.nlp.wordseg.CorpusDictionary

public class CorpusDictionary
extends java.lang.Object

Check if a bigram exists in bakeoff corpora. The dictionaries that this class reads have to be in UTF-8.

Author:
Huihsin Tseng, Pichuan Chang

Constructor Summary
CorpusDictionary(java.lang.String filename)
          Load a dictionary of words.
CorpusDictionary(java.lang.String filename, boolean normalize)
           
 
Method Summary
 boolean contains(java.lang.String word)
           
 java.util.Set<java.lang.String> getTable()
           
 java.lang.String getW(java.lang.String a1)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

CorpusDictionary

public CorpusDictionary(java.lang.String filename)
Load a dictionary of words.

Parameters:
filename - A file of words, one per line. It must be in UTF-8.

CorpusDictionary

public CorpusDictionary(java.lang.String filename,
                        boolean normalize)
Method Detail

getTable

public java.util.Set<java.lang.String> getTable()

contains

public boolean contains(java.lang.String word)

getW

public java.lang.String getW(java.lang.String a1)


Stanford NLP Group