Class EmblRecord


  • public class EmblRecord
    extends Object
    this class contains the parsed data of embl file
    Since:
    5.0.0
    Author:
    Noor Aldeen Al Mbaidin
    • Method Detail

      • getEmblId

        public EmblId getEmblId()
        The ID (IDentification) line The tokens represent: 1. Primary accession number 2. Sequence version number 3. Topology: 'circular' or 'linear' 4. Molecule type 5. Data class 6. Taxonomic division 7. Sequence length
        Returns:
        EmblId
      • getEmblReference

        public List<EmblReferencegetEmblReference()
        The Reference (RN, RC, RP, RX, RG, RA, RT, RL) Lines These lines comprise the literature citations within the database. The citations provide access to the papers from which the data has been abstracted.
        Returns:
        EmblReference
      • getAccessionNumber

        public List<StringgetAccessionNumber()
        The AC (Accession number) line lists the accession numbers associated with the entry.
        Returns:
        List
      • getOrGanelle

        public String getOrGanelle()
        The OG (OrGanelle) linetype indicates the sub-cellular location of non-nuclear sequences.
        Returns:
        String
      • getCreatedDate

        public String getCreatedDate()
        The DT line shows when an entry first appeared in the database
        Returns:
        String
      • getFeatureHeader

        public String getFeatureHeader()
        The FH (Feature Header) lines are present only to improve readability of an entry when it is printed or displayed on a terminal screen.
        Returns:
        String
      • getFeatureTable

        public String getFeatureTable()
        The FT (Feature Table) lines provide a mechanism for the annotation of the sequence data. Regions or sites in the sequence which are of interest are listed in the table.
        Returns:
        String
      • getLastUpdatedDate

        public String getLastUpdatedDate()
        The DT (DaTe) line shows when an entry was last updated in the database.
        Returns:
        String
      • getSequenceDescription

        public String getSequenceDescription()
        The DE (Description) lines contain general descriptive information about the sequence stored. This may include the designations of genes for which the sequence codes, the region of the genome from which it is derived, or other information which helps to identify the sequence.
        Returns:
        String
      • getKeyword

        public List<StringgetKeyword()
        The KW (KeyWord) lines provide information which can be used to generate cross-reference indexes of the sequence entries based on functional, structural, or other categories deemed important.
        Returns:
        List
      • getOrganismSpecies

        public String getOrganismSpecies()
        The OS (Organism Species) line specifies the preferred scientific name of the organism which was the source of the stored sequence. In most cases this is done by giving the Latin genus and species designations, followed (in parentheses) by the preferred common name in English where known.
        Returns:
        String
      • getOrganismClassification

        public String getOrganismClassification()
        The OC (Organism Classification) lines contain the taxonomic classification Of the source organism
        Returns:
        String
      • getDatabaseCrossReference

        public String getDatabaseCrossReference()
        The DR (Database Cross-reference) line cross-references other databases which contain information related to the entry in which the DR line appears.
        Returns:
        String
      • getAssemblyHeader

        public String getAssemblyHeader()
        The AH (Assembly Header) line provides column headings for the assembly information.
        Returns:
        String
      • getAssemblyInformation

        public String getAssemblyInformation()
        The AS (Assembly Information) lines provide information on the composition of a TPA or TSA sequence.
        Returns:
        String
      • getConstructedSequence

        public String getConstructedSequence()
        Con(structed) sequences in the CON data classes represent complete chromosomes, genomes and other long sequences constructed from segment entries.
        Returns:
        String
      • getSequenceHeader

        public String getSequenceHeader()
        The SQ (SeQuence header) line marks the beginning of the sequence data and Gives a summary of its content.
        Returns:
        String