Class TranscriptSequence

All Implemented Interfaces:
Iterable<NucleotideCompound>, Accessioned, Sequence<NucleotideCompound>

public class TranscriptSequence extends DNASequence
This is the sequence if you want to go from a gene sequence to a protein sequence. Need to start with a ChromosomeSequence then getting a GeneSequence and then a TranscriptSequence
Author:
Scooter Willis
  • Constructor Details

    • TranscriptSequence

      public TranscriptSequence(GeneSequence parentDNASequence, int begin, int end)
      Deprecated.
      Use public TranscriptSequence(GeneSequence parentDNASequence, AccessionID accessionID, int begin, int end) that requires an explicit accessionID
    • TranscriptSequence

      public TranscriptSequence(GeneSequence parentDNASequence, AccessionID accessionID, int begin, int end)
      Parameters:
      parentDNASequence -
      accessionID -
      begin -
      end - inclusive of end
      Throws:
      IllegalArgumentException - if the parentDNASequence is incompatible with DNACompoundSet
  • Method Details

    • getLength

      public int getLength()
      Description copied from interface: Sequence
      Returns the length of the Sequence
      Specified by:
      getLength in interface Sequence<NucleotideCompound>
      Overrides:
      getLength in class AbstractSequence<NucleotideCompound>
      Returns:
    • getStrand

      public Strand getStrand()
      Returns:
      the strand
    • removeCDS

      public CDSSequence removeCDS(String accession)
      Remove a CDS or coding sequence from the transcript sequence
      Parameters:
      accession -
      Returns:
    • getCDSSequences

      Get the CDS sequences that have been added to the TranscriptSequences
      Returns:
    • addCDS

      public CDSSequence addCDS(AccessionID accession, int begin, int end, int phase) throws Exception
      Add a Coding Sequence region with phase to the transcript sequence
      Parameters:
      accession -
      begin -
      end -
      phase - 0,1,2
      Returns:
      Throws:
      Exception
    • getProteinCDSSequences

      Return a list of protein sequences based on each CDS sequence where the phase shift between two CDS sequences is assigned to the CDS sequence that starts the triplet. This can be used to map a CDS/exon region of a protein sequence back to the DNA sequence If you have a protein sequence and a predicted gene you can take the predict CDS protein sequences and align back to the protein sequence. If you have errors in mapping the predicted protein CDS regions to an the known protein sequence then you can identify possible errors in the prediction
      Returns:
    • getDNACodingSequence

      Get the stitched together CDS sequences then maps to the cDNA
      Returns:
    • getProteinSequence

      Get the protein sequence
      Returns:
    • getProteinSequence

      Get the protein sequence with user defined TranscriptEngine
      Parameters:
      engine -
      Returns:
    • getStartCodonSequence

      Returns:
      the startCodonSequence
    • addStartCodonSequence

      public void addStartCodonSequence(AccessionID accession, int begin, int end)
      Sets the start codon sequence at given begin / end location. Note that calling this method multiple times will replace any existing value.
      Parameters:
      accession -
      begin -
      end -
    • getStopCodonSequence

      Returns:
      the stopCodonSequence
    • addStopCodonSequence

      public void addStopCodonSequence(AccessionID accession, int begin, int end)
      Sets the stop codon sequence at given begin / end location. Note that calling this method multiple times will replace any existing value.
      Parameters:
      accession -
      begin -
      end -