public class CrossProductTokenization extends WordTokenization
SymbolTokenization.TokenType
Annotatable.AnnotationForwarder
CHARACTER, FIXEDWIDTH, SEPARATED, UNKNOWN
ANNOTATION
Constructor and Description |
---|
CrossProductTokenization(Alphabet alpha) |
CrossProductTokenization(Alphabet alpha,
List tokenizers) |
Modifier and Type | Method and Description |
---|---|
Symbol |
parseToken(String token)
Returns the symbol for a single token.
|
String |
tokenizeSymbol(Symbol s)
Return a token representing a single symbol.
|
getAlphabet, getAnnotation, getTokenType, parseStream, parseString, splitString, tokenizeSymbolList
addChangeListener, addChangeListener, addForwarder, getForwarders, getListeners, isUnchanging, removeChangeListener, removeChangeListener, removeForwarder
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
addChangeListener, addChangeListener, isUnchanging, removeChangeListener, removeChangeListener
public CrossProductTokenization(Alphabet alpha) throws BioException
BioException
public CrossProductTokenization(Alphabet alpha, List tokenizers)
public Symbol parseToken(String token) throws IllegalSymbolException
SymbolTokenization
The Symbol will be a member of the alphabet. If the token is not recognized as mapping to a symbol, an exception will be thrown.
token
- the token to retrieve a Symbol forIllegalSymbolException
- if there is no Symbol for the tokenpublic String tokenizeSymbol(Symbol s) throws IllegalSymbolException
SymbolTokenization
s
- The symbolIllegalSymbolException
- if the symbol isn't recognized.Copyright © 2020 BioJava. All rights reserved.