public class IOBUtils
extends java.lang.Object
| Modifier and Type | Method and Description |
|---|---|
static java.lang.String |
getBoundaryCharacter() |
static java.lang.String |
IOBToString(java.util.List<CoreLabel> labeledSequence,
java.lang.String prefixMarker,
java.lang.String suffixMarker)
Convert a list of labeled characters to a String.
|
static java.util.List<CoreLabel> |
StringToIOB(java.util.List<CoreLabel> tokenList,
java.lang.Character segMarker,
boolean applyRewriteRules)
Convert a String to a list of characters suitable for labeling in an IOB
segmentation model.
|
static java.util.List<CoreLabel> |
StringToIOB(java.lang.String string)
This version is for turning an unsegmented string to an IOB input, i.e.,
for processing raw text.
|
static java.util.List<CoreLabel> |
StringToIOB(java.lang.String str,
java.lang.Character segMarker) |
public static java.lang.String getBoundaryCharacter()
public static java.util.List<CoreLabel> StringToIOB(java.util.List<CoreLabel> tokenList, java.lang.Character segMarker, boolean applyRewriteRules)
tokenList - segMarker - applyRewriteRules - add rewrite labels (for training data)public static java.util.List<CoreLabel> StringToIOB(java.lang.String string)
string - public static java.util.List<CoreLabel> StringToIOB(java.lang.String str, java.lang.Character segMarker)
public static java.lang.String IOBToString(java.util.List<CoreLabel> labeledSequence, java.lang.String prefixMarker, java.lang.String suffixMarker)
labeledSequence - prefixMarker -