public class UniProtXMLFormat extends RichSequenceFormat.BasicFormat
| Modifier and Type | Class and Description |
|---|---|
static class |
UniProtXMLFormat.Terms
Implements some UniProtXML-specific terms.
|
RichSequenceFormat.BasicFormat, RichSequenceFormat.HeaderlessFormat| Constructor and Description |
|---|
UniProtXMLFormat() |
| Modifier and Type | Method and Description |
|---|---|
void |
beginWriting()
Informs the writer that we want to start writing.
|
boolean |
canRead(BufferedInputStream stream)
Check to see if a given stream is in our format.
|
boolean |
canRead(File file)
Check to see if a given file is in our format.
|
void |
finishWriting()
Informs the writer that are done writing.
|
String |
getDefaultFormat()
getDefaultFormat returns the String identifier for
the default sub-format written by a SequenceFormat
implementation. |
SymbolTokenization |
guessSymbolTokenization(BufferedInputStream stream)
On the assumption that the stream is readable by this format (not checked),
attempt to guess which symbol tokenization we should use to read it.
|
SymbolTokenization |
guessSymbolTokenization(File file)
On the assumption that the file is readable by this format (not checked),
attempt to guess which symbol tokenization we should use to read it.
|
boolean |
readRichSequence(BufferedReader reader,
SymbolTokenization symParser,
RichSeqIOListener rlistener,
Namespace ns)
Reads a sequence from the given buffered reader using the given tokenizer to parse
sequence symbols.
|
boolean |
readSequence(BufferedReader reader,
SymbolTokenization symParser,
SeqIOListener listener)
Read a sequence and pass data on to a SeqIOListener.
|
void |
writeSequence(Sequence seq,
Namespace ns)
Writes a sequence out to the outputstream given by beginWriting() using the default format of the
implementing class.
|
void |
writeSequence(Sequence seq,
PrintStream os)
writeSequence writes a sequence to the specified
PrintStream, using the default format. |
void |
writeSequence(Sequence seq,
String format,
PrintStream os)
writeSequence writes a sequence to the specified
PrintStream, using the specified format. |
getElideComments, getElideFeatures, getElideReferences, getElideSymbols, getLineWidth, getPrintStream, setElideComments, setElideFeatures, setElideReferences, setElideSymbols, setLineWidth, setPrintStreampublic static final String UNIPROTXML_FORMAT
protected static final String ENTRY_GROUP_TAG
protected static final String ENTRY_TAG
protected static final String ENTRY_VERSION_ATTR
protected static final String ENTRY_NAMESPACE_ATTR
protected static final String ENTRY_CREATED_ATTR
protected static final String ENTRY_UPDATED_ATTR
protected static final String COPYRIGHT_TAG
protected static final String ACCESSION_TAG
protected static final String NAME_TAG
protected static final String TEXT_TAG
protected static final String REF_ATTR
protected static final String TYPE_ATTR
protected static final String KEY_ATTR
protected static final String ID_ATTR
protected static final String EVIDENCE_ATTR
protected static final String VALUE_ATTR
protected static final String STATUS_ATTR
protected static final String NAME_ATTR
protected static final String PROTEIN_TAG
protected static final String PROTEIN_TYPE_ATTR
protected static final String DOMAIN_TAG
protected static final String COMPONENT_TAG
protected static final String GENE_TAG
protected static final String ORGANISM_TAG
protected static final String DBXREF_TAG
protected static final String PROPERTY_TAG
protected static final String LINEAGE_TAG
protected static final String TAXON_TAG
protected static final String GENELOCATION_TAG
protected static final String GENELOCATION_NAME_TAG
protected static final String REFERENCE_TAG
protected static final String CITATION_TAG
protected static final String TITLE_TAG
protected static final String EDITOR_LIST_TAG
protected static final String AUTHOR_LIST_TAG
protected static final String PERSON_TAG
protected static final String CONSORTIUM_TAG
protected static final String LOCATOR_TAG
protected static final String RP_LINE_TAG
protected static final String RC_LINE_TAG
protected static final String RC_SPECIES_TAG
protected static final String RC_TISSUE_TAG
protected static final String RC_TRANSP_TAG
protected static final String RC_STRAIN_TAG
protected static final String RC_PLASMID_TAG
protected static final String COMMENT_TAG
protected static final String COMMENT_MASS_ATTR
protected static final String COMMENT_ERROR_ATTR
protected static final String COMMENT_METHOD_ATTR
protected static final String COMMENT_LOCTYPE_ATTR
protected static final String COMMENT_ABSORPTION_TAG
protected static final String COMMENT_ABS_MAX_TAG
protected static final String COMMENT_KINETICS_TAG
protected static final String COMMENT_KIN_KM_TAG
protected static final String COMMENT_KIN_VMAX_TAG
protected static final String COMMENT_PH_TAG
protected static final String COMMENT_REDOX_TAG
protected static final String COMMENT_TEMPERATURE_TAG
protected static final String COMMENT_LINK_TAG
protected static final String COMMENT_LINK_URI_ATTR
protected static final String COMMENT_EVENT_TAG
protected static final String COMMENT_ISOFORM_TAG
protected static final String COMMENT_INTERACTANT_TAG
protected static final String COMMENT_INTERACT_INTACT_ATTR
protected static final String COMMENT_INTERACT_LABEL_TAG
protected static final String COMMENT_ORGANISMS_TAG
protected static final String COMMENT_EXPERIMENTS_TAG
protected static final String NOTE_TAG
protected static final String KEYWORD_TAG
protected static final String PROTEIN_EXISTS_TAG
protected static final String ID_TAG
protected static final String FEATURE_TAG
protected static final String FEATURE_DESC_ATTR
protected static final String FEATURE_ORIGINAL_TAG
protected static final String FEATURE_VARIATION_TAG
protected static final String EVIDENCE_TAG
protected static final String EVIDENCE_CATEGORY_ATTR
protected static final String EVIDENCE_ATTRIBUTE_ATTR
protected static final String EVIDENCE_DATE_ATTR
protected static final String LOCATION_TAG
protected static final String LOCATION_SEQ_ATTR
protected static final String LOCATION_BEGIN_TAG
protected static final String LOCATION_END_TAG
protected static final String LOCATION_POSITION_ATTR
protected static final String LOCATION_POSITION_TAG
protected static final String SEQUENCE_TAG
protected static final String SEQUENCE_VERSION_ATTR
protected static final String SEQUENCE_LENGTH_ATTR
protected static final String SEQUENCE_MASS_ATTR
protected static final String SEQUENCE_CHECKSUM_ATTR
protected static final String SEQUENCE_MODIFIED_ATTR
public UniProtXMLFormat()
public boolean canRead(File file) throws IOException
canRead in interface RichSequenceFormatcanRead in class RichSequenceFormat.BasicFormatfile - the File to check.IOException - in case the file is inaccessible.public SymbolTokenization guessSymbolTokenization(File file) throws IOException
guessSymbolTokenization in interface RichSequenceFormatguessSymbolTokenization in class RichSequenceFormat.BasicFormatfile - the File object to guess the format of.SymbolTokenization to read the file with.IOException - if the file is unrecognisable or inaccessible.public boolean canRead(BufferedInputStream stream) throws IOException
stream - the BufferedInputStream to check.IOException - in case the stream is inaccessible.public SymbolTokenization guessSymbolTokenization(BufferedInputStream stream) throws IOException
stream - the BufferedInputStream object to guess the format of.SymbolTokenization to read the stream with.IOException - if the stream is unrecognisable or inaccessible.public boolean readSequence(BufferedReader reader, SymbolTokenization symParser, SeqIOListener listener) throws IllegalSymbolException, IOException, ParseException
reader - The stream of data to parse.symParser - A SymbolParser defining a mapping from
character data to Symbols.listener - A listener to notify when data is extracted
from the stream.IllegalSymbolException - if it is not possible to
translate character data from the stream into valid BioJava
symbols.IOException - if an error occurs while reading from the
stream.ParseExceptionpublic boolean readRichSequence(BufferedReader reader, SymbolTokenization symParser, RichSeqIOListener rlistener, Namespace ns) throws IllegalSymbolException, IOException, ParseException
reader - the input sourcesymParser - the tokenizer which understands the sequence being readrlistener - the listener to send sequence events tons - the namespace to read sequences into.IllegalSymbolException - if the tokenizer couldn't understand one of the
sequence symbols in the file.IOException - if there was a read error.ParseExceptionpublic void beginWriting() throws IOException
IOException - if writing fails.public void finishWriting() throws IOException
IOException - if writing fails.public void writeSequence(Sequence seq, PrintStream os) throws IOException
writeSequence writes a sequence to the specified
PrintStream, using the default format.seq - the sequence to write out.os - the printstream to write to.IOExceptionpublic void writeSequence(Sequence seq, String format, PrintStream os) throws IOException
writeSequence writes a sequence to the specified
PrintStream, using the specified format.seq - a Sequence to write out.format - a String indicating which sub-format
of those available from a particular
SequenceFormat implemention to use when
writing.os - a PrintStream object.IOException - if an error occurs.public void writeSequence(Sequence seq, Namespace ns) throws IOException
seq - the sequence to writens - the namespace to write it withIOException - in case it couldn't write somethingpublic String getDefaultFormat()
getDefaultFormat returns the String identifier for
the default sub-format written by a SequenceFormat
implementation.String.Copyright © 2020 BioJava. All rights reserved.