Package org.biojava.nbio.core.sequence
Class TranscriptSequence
java.lang.Object
org.biojava.nbio.core.sequence.template.AbstractSequence<NucleotideCompound>
org.biojava.nbio.core.sequence.DNASequence
org.biojava.nbio.core.sequence.TranscriptSequence
- All Implemented Interfaces:
Iterable<NucleotideCompound>
,Accessioned
,Sequence<NucleotideCompound>
This is the sequence if you want to go from a gene sequence to a protein sequence. Need to start with a
ChromosomeSequence then getting a GeneSequence and then a TranscriptSequence
- Author:
- Scooter Willis
-
Nested Class Summary
Nested classes/interfaces inherited from class org.biojava.nbio.core.sequence.DNASequence
DNASequence.DNAType
Nested classes/interfaces inherited from class org.biojava.nbio.core.sequence.template.AbstractSequence
AbstractSequence.AnnotationType
-
Constructor Summary
ConstructorDescriptionTranscriptSequence
(GeneSequence parentDNASequence, int begin, int end) Deprecated.TranscriptSequence
(GeneSequence parentDNASequence, AccessionID accessionID, int begin, int end) -
Method Summary
Modifier and TypeMethodDescriptionaddCDS
(AccessionID accession, int begin, int end, int phase) Add a Coding Sequence region with phase to the transcript sequencevoid
addStartCodonSequence
(AccessionID accession, int begin, int end) Sets the start codon sequence at given begin / end location.void
addStopCodonSequence
(AccessionID accession, int begin, int end) Sets the stop codon sequence at given begin / end location.Get the CDS sequences that have been added to the TranscriptSequencesGet the stitched together CDS sequences then maps to the cDNAint
Returns the length of the SequenceReturn a list of protein sequences based on each CDS sequence where the phase shift between two CDS sequences is assigned to the CDS sequence that starts the triplet.Get the protein sequenceGet the protein sequence with user defined TranscriptEngineRemove a CDS or coding sequence from the transcript sequenceMethods inherited from class org.biojava.nbio.core.sequence.DNASequence
getComplement, getDNAType, getGCCount, getReverse, getReverseComplement, getRNASequence, getRNASequence, getRNASequence, getRNASequence, setDNAType
Methods inherited from class org.biojava.nbio.core.sequence.template.AbstractSequence
addFeature, addFeature, addNote, countCompounds, equals, getAccession, getAnnotationType, getAsList, getBioBegin, getBioEnd, getComments, getCompoundAt, getCompoundSet, getDatabaseReferences, getDescription, getFeatureRetriever, getFeatures, getFeatures, getFeatures, getFeaturesByType, getFeaturesKeyWord, getIndexOf, getInverse, getLastIndexOf, getNotesList, getOriginalHeader, getParentSequence, getProxySequenceReader, getReferences, getSequenceAsString, getSequenceAsString, getSequenceScore, getSource, getSubSequence, getTaxonomy, getUserCollection, hashCode, initSequenceStorage, iterator, removeFeature, removeNote, setAccession, setAnnotationType, setBioBegin, setBioEnd, setComments, setCompoundSet, setDatabaseReferences, setDescription, setFeatureRetriever, setFeaturesKeyWord, setNotesList, setOriginalHeader, setParentSequence, setProxySequenceReader, setReferences, setSequenceScore, setSource, setTaxonomy, setUserCollection, toString
Methods inherited from class java.lang.Object
clone, finalize, getClass, notify, notifyAll, wait, wait, wait
Methods inherited from interface java.lang.Iterable
forEach, spliterator
-
Constructor Details
-
TranscriptSequence
Deprecated.Use public TranscriptSequence(GeneSequence parentDNASequence, AccessionID accessionID, int begin, int end) that requires an explicit accessionID -
TranscriptSequence
public TranscriptSequence(GeneSequence parentDNASequence, AccessionID accessionID, int begin, int end) - Parameters:
parentDNASequence
-accessionID
-begin
-end
- inclusive of end- Throws:
IllegalArgumentException
- if the parentDNASequence is incompatible with DNACompoundSet
-
-
Method Details
-
getLength
Description copied from interface:Sequence
Returns the length of the Sequence- Specified by:
getLength
in interfaceSequence<NucleotideCompound>
- Overrides:
getLength
in classAbstractSequence<NucleotideCompound>
- Returns:
-
getStrand
- Returns:
- the strand
-
removeCDS
Remove a CDS or coding sequence from the transcript sequence- Parameters:
accession
-- Returns:
-
getCDSSequences
Get the CDS sequences that have been added to the TranscriptSequences- Returns:
-
addCDS
Add a Coding Sequence region with phase to the transcript sequence- Parameters:
accession
-begin
-end
-phase
- 0,1,2- Returns:
- Throws:
Exception
-
getProteinCDSSequences
Return a list of protein sequences based on each CDS sequence where the phase shift between two CDS sequences is assigned to the CDS sequence that starts the triplet. This can be used to map a CDS/exon region of a protein sequence back to the DNA sequence If you have a protein sequence and a predicted gene you can take the predict CDS protein sequences and align back to the protein sequence. If you have errors in mapping the predicted protein CDS regions to an the known protein sequence then you can identify possible errors in the prediction- Returns:
-
getDNACodingSequence
Get the stitched together CDS sequences then maps to the cDNA- Returns:
-
getProteinSequence
Get the protein sequence- Returns:
-
getProteinSequence
Get the protein sequence with user defined TranscriptEngine- Parameters:
engine
-- Returns:
-
getStartCodonSequence
- Returns:
- the startCodonSequence
-
addStartCodonSequence
Sets the start codon sequence at given begin / end location. Note that calling this method multiple times will replace any existing value.- Parameters:
accession
-begin
-end
-
-
getStopCodonSequence
- Returns:
- the stopCodonSequence
-
addStopCodonSequence
Sets the stop codon sequence at given begin / end location. Note that calling this method multiple times will replace any existing value.- Parameters:
accession
-begin
-end
-
-