Class EmblRecord
java.lang.Object
org.biojava.nbio.core.sequence.io.embl.EmblRecord
this class contains the parsed data of embl file
- Since:
- 5.0.0
- Author:
- Noor Aldeen Al Mbaidin
-
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionThe AC (Accession number) line lists the accession numbers associated with the entry.The AH (Assembly Header) line provides column headings for the assembly information.The AS (Assembly Information) lines provide information on the composition of a TPA or TSA sequence.Con(structed) sequences in the CON data classes represent complete chromosomes, genomes and other long sequences constructed from segment entries.The DT line shows when an entry first appeared in the databaseThe DR (Database Cross-reference) line cross-references other databases which contain information related to the entry in which the DR line appears.The ID (IDentification) line The tokens represent: 1.The Reference (RN, RC, RP, RX, RG, RA, RT, RL) Lines These lines comprise the literature citations within the database.The FH (Feature Header) lines are present only to improve readability of an entry when it is printed or displayed on a terminal screen.The FT (Feature Table) lines provide a mechanism for the annotation of the sequence data.The KW (KeyWord) lines provide information which can be used to generate cross-reference indexes of the sequence entries based on functional, structural, or other categories deemed important.The DT (DaTe) line shows when an entry was last updated in the database.The OG (OrGanelle) linetype indicates the sub-cellular location of non-nuclear sequences.The OC (Organism Classification) lines contain the taxonomic classification Of the source organismThe OS (Organism Species) line specifies the preferred scientific name of the organism which was the source of the stored sequence.The Sequence Data LineThe DE (Description) lines contain general descriptive information about the sequence stored.The SQ (SeQuence header) line marks the beginning of the sequence data and Gives a summary of its content.void
setAccessionNumber
(List<String> accessionNumber) void
setAssemblyHeader
(String assemblyHeader) void
setAssemblyInformation
(String assemblyInformation) void
setConstructedSequence
(String constructedSequence) void
setCreatedDate
(String createdDate) void
setDatabaseCrossReference
(String databaseCrossReference) void
void
setEmblReference
(List<EmblReference> emblReference) void
setFeatureHeader
(String featureHeader) void
setFeatureTable
(String featureTable) void
setKeyword
(List<String> keyword) void
setLastUpdatedDate
(String lastUpdatedDate) void
setOrGanelle
(String orGanelle) void
setOrganismClassification
(String organismClassification) void
setOrganismSpecies
(String organismSpecies) void
setProjectIdentifier
(String projectIdentifier) void
setSequence
(String sequence) void
setSequenceDescription
(String sequenceDescription) void
setSequenceHeader
(String sequenceHeader)
-
Constructor Details
-
EmblRecord
public EmblRecord()
-
-
Method Details
-
getEmblId
The ID (IDentification) line The tokens represent: 1. Primary accession number 2. Sequence version number 3. Topology: 'circular' or 'linear' 4. Molecule type 5. Data class 6. Taxonomic division 7. Sequence length- Returns:
-
setEmblId
-
getEmblReference
The Reference (RN, RC, RP, RX, RG, RA, RT, RL) Lines These lines comprise the literature citations within the database. The citations provide access to the papers from which the data has been abstracted.- Returns:
-
setEmblReference
-
getAccessionNumber
The AC (Accession number) line lists the accession numbers associated with the entry.- Returns:
-
setAccessionNumber
-
getProjectIdentifier
- Returns:
- String
-
setProjectIdentifier
-
getOrGanelle
The OG (OrGanelle) linetype indicates the sub-cellular location of non-nuclear sequences.- Returns:
-
setOrGanelle
-
getCreatedDate
The DT line shows when an entry first appeared in the database- Returns:
-
setCreatedDate
-
getFeatureHeader
The FH (Feature Header) lines are present only to improve readability of an entry when it is printed or displayed on a terminal screen.- Returns:
-
setFeatureHeader
-
getFeatureTable
The FT (Feature Table) lines provide a mechanism for the annotation of the sequence data. Regions or sites in the sequence which are of interest are listed in the table.- Returns:
-
setFeatureTable
-
getLastUpdatedDate
The DT (DaTe) line shows when an entry was last updated in the database.- Returns:
- String
-
setLastUpdatedDate
-
getSequenceDescription
The DE (Description) lines contain general descriptive information about the sequence stored. This may include the designations of genes for which the sequence codes, the region of the genome from which it is derived, or other information which helps to identify the sequence.- Returns:
-
setSequenceDescription
-
getKeyword
The KW (KeyWord) lines provide information which can be used to generate cross-reference indexes of the sequence entries based on functional, structural, or other categories deemed important.- Returns:
-
setKeyword
-
getOrganismSpecies
The OS (Organism Species) line specifies the preferred scientific name of the organism which was the source of the stored sequence. In most cases this is done by giving the Latin genus and species designations, followed (in parentheses) by the preferred common name in English where known.- Returns:
- String
-
setOrganismSpecies
-
getOrganismClassification
The OC (Organism Classification) lines contain the taxonomic classification Of the source organism- Returns:
-
setOrganismClassification
-
getDatabaseCrossReference
The DR (Database Cross-reference) line cross-references other databases which contain information related to the entry in which the DR line appears.- Returns:
-
setDatabaseCrossReference
-
getAssemblyHeader
The AH (Assembly Header) line provides column headings for the assembly information.- Returns:
-
setAssemblyHeader
-
getAssemblyInformation
The AS (Assembly Information) lines provide information on the composition of a TPA or TSA sequence.- Returns:
- String
-
setAssemblyInformation
-
getConstructedSequence
Con(structed) sequences in the CON data classes represent complete chromosomes, genomes and other long sequences constructed from segment entries.- Returns:
-
setConstructedSequence
-
getSequenceHeader
The SQ (SeQuence header) line marks the beginning of the sequence data and Gives a summary of its content.- Returns:
-
setSequenceHeader
-
getSequence
The Sequence Data Line- Returns:
-
setSequence
-