Class SeqIOConstants


  • public final class SeqIOConstants
    extends Object
    SeqIOConstants contains constants used to identify sequence formats, alphabets etc, in the context of reading and writing sequences.

    An int used to specify symbol alphabet and sequence format type is derived thus:

    • The two least significant bytes are reserved for format types such as RAW, FASTA, EMBL etc.
    • The two most significant bytes are reserved for alphabet and symbol information such as AMBIGUOUS, DNA, RNA, AA etc.
    • Bitwise OR combinations of each component int are used to specify combinations of format type and symbol information. To derive an int identifier for DNA with ambiguity codes in Fasta format, bitwise OR the AMBIGUOUS, DNA and FASTA values.
    Author:
    Keith James
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static int AA
      AA indicates that a sequence contains AA (amino acid) symbols.
      static int AMBIGUOUS
      AMBIGUOUS indicates that a sequence contains ambiguity symbols.
      static int DNA
      DNA indicates that a sequence contains DNA (deoxyribonucleic acid) symbols.
      static int EMBL
      EMBL indicates that the sequence format is EMBL.
      static int EMBL_AA
      EMBL_AA premade EMBL | AA.
      static int EMBL_DNA
      EMBL_DNA premade EMBL | DNA.
      static int EMBL_RNA
      EMBL_RNA premade EMBL | RNA.
      static int FASTA
      FASTA indicates that the sequence format is Fasta.
      static int FASTA_AA
      FASTA_AA premade FASTA | AA.
      static int FASTA_DNA
      FASTA_DNA premade FASTA | DNA.
      static int FASTA_RNA
      FASTA_RNA premade FASTA | RNA.
      static int GCG
      GCG indicates that the sequence format is GCG.
      static int GENBANK
      GENBANK indicates that the sequence format is GENBANK.
      static int GENBANK_AA
      GENBANK_DNA premade GENBANK | AA.
      static int GENBANK_DNA
      GENBANK_DNA premade GENBANK | DNA.
      static int GENBANK_RNA
      GENBANK_DNA premade GENBANK | RNA.
      static int GENPEPT
      GENPEPT indicates that the sequence format is GENPEPT.
      static int GFF
      GFF indicates that the sequence format is GFF.
      static int IG
      IG indicates that the sequence format is IG.
      static int INTEGER
      INTEGER indicates that a sequence contains integer alphabet symbols, such as used to describe sequence quality data.
      static LifeScienceIdentifier LSID_EMBL_AA
      LSID_EMBL_AA sequence format LSID for EMBL AA.
      static LifeScienceIdentifier LSID_EMBL_DNA
      LSID_EMBL_DNA sequence format LSID for EMBL DNA.
      static LifeScienceIdentifier LSID_EMBL_RNA
      LSID_EMBL_RNA sequence format LSID for EMBL RNA.
      static LifeScienceIdentifier LSID_FASTA_AA
      LSID_FASTA_AA sequence format LSID for Fasta AA.
      static LifeScienceIdentifier LSID_FASTA_DNA
      LSID_FASTA_DNA sequence format LSID for Fasta DNA.
      static LifeScienceIdentifier LSID_FASTA_RNA
      LSID_FASTA_RNA sequence format LSID for Fasta RNA.
      static LifeScienceIdentifier LSID_GENBANK_AA
      LSID_GENBANK_AA sequence format LSID for Genbank AA.
      static LifeScienceIdentifier LSID_GENBANK_DNA
      LSID_GENBANK_DNA sequence format LSID for Genbank DNA.
      static LifeScienceIdentifier LSID_GENBANK_RNA
      LSID_GENBANK_RNA sequence format LSID for Genbank RNA.
      static LifeScienceIdentifier LSID_SWISSPROT
      LSID_SWISSPROT sequence format LSID for Swissprot.
      static int NBRF
      NBRF indicates that the sequence format is NBRF.
      static int PDB
      PDB indicates that the sequence format is PDB.
      static int PHRED
      PHRED indicates that the sequence format is PHRED.
      static int RAW
      RAW indicates that the sequence format is raw (symbols only).
      static int REFSEQ
      REFSEQ indicates that the sequence format is REFSEQ.
      static int REFSEQ_AA
      REFSEQ_AA premade REFSEQ | AA.
      static int REFSEQ_DNA
      REFSEQ_DNA premade REFSEQ | DNA.
      static int REFSEQ_RNA
      REFSEQ_RNA premade REFSEQ | RNA.
      static int RNA
      RNA indicates that a sequence contains RNA (ribonucleic acid) symbols.
      static int SWISSPROT
      SWISSPROT indicates that the sequence format is SWISSPROT.
      static int UNKNOWN
      UNKNOWN indicates that the sequence format is unknown.
    • Constructor Summary

      Constructors 
      Constructor Description
      SeqIOConstants()