edu.stanford.nlp.international.arabic.process
Class IOBUtils

java.lang.Object
  extended by edu.stanford.nlp.international.arabic.process.IOBUtils

public class IOBUtils
extends java.lang.Object

A class for converting strings to input suitable for processing by and IOB sequence model.

Author:
Spence Green

Method Summary
static java.lang.String getBoundaryCharacter()
           
static java.lang.String IOBToString(java.util.List<CoreLabel> labeledSequence, java.lang.String prefixMarker, java.lang.String suffixMarker)
          Convert a list of labeled characters to a String.
static java.util.List<CoreLabel> StringToIOB(java.util.List<CoreLabel> tokenList, java.lang.Character segMarker, boolean applyRewriteRules)
          Convert a String to a list of characters suitable for labeling in an IOB segmentation model.
static java.util.List<CoreLabel> StringToIOB(java.lang.String string)
          This version is for turning an unsegmented string to an IOB input, i.e., for processing raw text.
static java.util.List<CoreLabel> StringToIOB(java.lang.String str, java.lang.Character segMarker)
           
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Method Detail

getBoundaryCharacter

public static java.lang.String getBoundaryCharacter()

StringToIOB

public static java.util.List<CoreLabel> StringToIOB(java.util.List<CoreLabel> tokenList,
                                                    java.lang.Character segMarker,
                                                    boolean applyRewriteRules)
Convert a String to a list of characters suitable for labeling in an IOB segmentation model.

Parameters:
tokenList -
segMarker -
applyRewriteRules - add rewrite labels (for training data)

StringToIOB

public static java.util.List<CoreLabel> StringToIOB(java.lang.String string)
This version is for turning an unsegmented string to an IOB input, i.e., for processing raw text.

Parameters:
string -

StringToIOB

public static java.util.List<CoreLabel> StringToIOB(java.lang.String str,
                                                    java.lang.Character segMarker)

IOBToString

public static java.lang.String IOBToString(java.util.List<CoreLabel> labeledSequence,
                                           java.lang.String prefixMarker,
                                           java.lang.String suffixMarker)
Convert a list of labeled characters to a String. Include segmentation markers in the string.

Parameters:
labeledSequence -
prefixMarker -


Stanford NLP Group