public class SoftMaskedAlphabet.CaseSensitiveTokenization extends Unchangeable implements SymbolTokenization
SymbolTokenizer
works with a delegate to softmask
symbol tokenization as appropriate. It should only be used in combination
with a SoftMaskedAlphabet.
You will never instantiate one of these yourself.
Copyright (c) 2004 Novartis Institute for Tropical Diseases
SymbolTokenization.TokenType
Annotatable.AnnotationForwarder
CHARACTER, FIXEDWIDTH, SEPARATED, UNKNOWN
ANNOTATION
Modifier and Type | Method and Description |
---|---|
Alphabet |
getAlphabet()
The alphabet to which this tokenization applies.
|
Annotation |
getAnnotation()
Should return the associated annotation object.
|
SymbolTokenization.TokenType |
getTokenType()
Determine the style of tokenization represented by this object.
|
StreamParser |
parseStream(SeqIOListener l)
The current implementation only supports character parsing.
|
Symbol |
parseToken(String token)
Returns the symbol for a single token.
|
String |
tokenizeSymbol(Symbol s)
Return a token representing a single symbol.
|
String |
tokenizeSymbolList(SymbolList sl)
Return a string representation of a list of symbols.
|
addChangeListener, addChangeListener, addForwarder, getForwarders, getListeners, isUnchanging, removeChangeListener, removeChangeListener, removeForwarder
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
addChangeListener, addChangeListener, isUnchanging, removeChangeListener, removeChangeListener
public Annotation getAnnotation()
Annotatable
getAnnotation
in interface Annotatable
public Alphabet getAlphabet()
SymbolTokenization
getAlphabet
in interface SymbolTokenization
public SymbolTokenization.TokenType getTokenType()
SymbolTokenization
getTokenType
in interface SymbolTokenization
public Symbol parseToken(String token) throws IllegalSymbolException
SymbolTokenization
The Symbol will be a member of the alphabet. If the token is not recognized as mapping to a symbol, an exception will be thrown.
parseToken
in interface SymbolTokenization
token
- the token to retrieve a Symbol forIllegalSymbolException
- if there is no Symbol for the tokenpublic String tokenizeSymbolList(SymbolList sl) throws IllegalSymbolException
SymbolTokenization
tokenizeSymbolList
in interface SymbolTokenization
sl
- A SymbolListIllegalSymbolException
public StreamParser parseStream(SeqIOListener l)
parseStream
in interface SymbolTokenization
l
- the SeqIOListener
to callback to.StreamParser
that the SeqIOListener
talks to.public String tokenizeSymbol(Symbol s) throws IllegalSymbolException
SymbolTokenization
tokenizeSymbol
in interface SymbolTokenization
s
- The symbolIllegalSymbolException
- if the symbol isn't recognized.Copyright © 2020 BioJava. All rights reserved.