All Classes and Interfaces
Class
Description
Factory class to get Providers for substitution matrices that are provided by
the AAINDEX database.
Title: ABITrace
An Abstract Class to generalize the visualization of AFP and
MultipleAlignment structure alignments in Jmol.
A utility class to ease the problem of implementing an Annotation to that of
providing an apropreate implementation of Map.
Convert a BioJava object to a CifFile.
Wrapped atoms represent individual atoms enriched with model- and chain-level information.
The details of a Compound
A feature is currently any descriptive item that can be associated with a sequence position(s)
A feature has a type and a source which is currently a string to allow flexibility for the user
Ideally well defined features should have a class to describe attributes of that feature
Base abstraction of a location which encodes for the majority of important
features about a location such as the start, end and strand
Implements common code for an
Aligner
which builds a score matrix during computation.Implements common code for algorithms which compute a score.
Abstact implementation of the
ScoresCache
with the shared code used
in all objects with a variables cache.The base class for DNA, RNA and Protein sequences.
Abstract implementation of term
This provides basic change-forwarding functionality from
the annotation and ontology properties.
Base class for a new structure alignment CLI.
A location which is bound to an AccessionID.
Indicates an entity is accessioned
Used in Sequences as the unique identifier.
A class to represent a FATCAT aligned fragment pair (AFP)
A class that performs calculations on AFPChains
A bean to contain the core of a structure alignment.
A class to chain AFPs to an alignment
A class to convert the data in an AfpChain object to various String outputs.
Demo displaying a structural alignment from a FASTA file using
FastaAFPChainConverter
.does post processing after alignment chaingin
Defines a data structure for a
Sequence
within an alignment.Defines an alignment step in order to pass alignment information from an Aligner to a constructor.
Static utility to construct alignment routines from a common library of methods.
Compounds in query and target sequences that must align
Defines a 'cut' row for divide-and-conquer alignment in which a new anchor is found.
Define a traceback pointer for the three edit operations: substitution (match/replacement of a query compound
with a target compound), deletion (removal of a query compound leaving a gap in the target sequence), and
insertion (addition of a target compound opening a gap in the query sequence).
Alignment subproblem.
A class that obtains two structures via DAS and aligns them
This is done in a separate thread.
A class that obtains two structures via DAS and aligns them
This is done in a separate thread.
A JFrame that allows to trigger a pairwise structure alignment,
either from files in a directory,
or after manual upload.
Get an instance of the two alignment GUIs.
A class to listen to progress of the structure alignment calculations
A class to track the alignment results in a flat file
Static utility to easily run alignment routines.
List of implemented sequence pair in a profile scoring routines.
List of implemented pairwise sequence alignment routines.
List of implemented pairwise sequence scoring routines.
List of implemented profile-profile alignment routines.
List of implemented profile refinement routines.
Methods for analyzing and manipulating AFPChains and for
other pairwise alignment utilities.
A
Map<K,V>
can be viewed as a function from K to V.Low level helper methods for CE and FATCAT algorithms.
A JPanel that can display an AFPChain in a nice way and interact with Jmol.
A ChemComp provider that downloads and caches the components.cif file from the wwPDB site.
Thrown to indicate that a term or triple can't be added to an ontology
because it is already present.
A comparator to sort AlternativeAlignments based on their number of equivalent residues
and RMSD.
Implements a class which handles one possible (alternative) solution.
a frame showing the alternative alignments, which are the result of a structure superimposition
Ambiguity set for hybrid DNA/RNA sequences.
A
Group
that represents an AminoAcid.Used to describe an Amino Acid.
Set of proteinogenic amino acids.
AminoAcid inherits most from Hetatom.
This class provides the protein properties at the level of individual amino acids.
Guesses an order of rotational symmetry from the angle.
Indicates that an object has an associated annotation.
Arbitrary annotation associated with one or more objects.
Hello world!
Hello world!
Stores a Sequence as a collection of compounds in an ArrayList
Class to calculate Accessible Surface Areas based on
the rolling ball algorithm by Shrake and Rupley.
An unchecked exception representing an Assertion failure.
Provides programmatic access to ASTRAL representative sets.
An ASTRAL sequence-identity cutoff with an identifier such as:
A simple interface for an Atom.
A utility class that provides easy access to Structure objects.
A pair of atoms that are in contact
A set of atom-atom contacts to hold the results of intra and inter-chain contact calculations
Implementation of an Atom of a PDB file.
This class uniquely describes an atom
an iterator over all atoms of a structure / group.
A map from
ResidueNumbers
to ATOM record positions in a PDB file.Used as a Predicate to indicate whether a particular Atom should be mapped
Created by douglas on 1/23/15.
Describes author attributes for author information in a PDB file.
A class that provides auto-completion suggestions for JAutoSuggest
This module calculates the el Hassan-Calladine Base Pairing and Base-pair Step Parameters for any nucleic
acid containing structure that has the information about the core base-pair rings.
Bare bones version of the Sequence object to be used sparingly.
Parse binary Cif files and provide capabilities to store them locally.
SCOPe:
The Structural Classification of Proteins (extended) at Berkeley Lab and UC Berkeley
(http://scop.berkeley.edu/).
Container that represents a beta Bridge between two residues.
Representation of a Biological Assembly annotation as provided by the PDB.
Created by andreas on 8/9/14.
A class that provides a simple GUI for Jmol
Wrapper for the BioJava Structure Alignment Implementation
Reconstructs the quaternary structure of a protein from an asymmetric unit
The transformation needed for generation of biological assemblies
from the contents of a PDB/mmCIF file.
An implementation of the popular bit encodings.
The logic of working with a bit has been separated out into this class
to help developers create the bit data structures without having to
put the code into an intermediate format and to also use the format
without the need to copy this code.
Alignment request parameters accepted by QBlast service.
Not all are mandatory.
Not all are mandatory.
Designed by Paolo Pavan.
Designed by Paolo Pavan.
Information about QBlast search job
Enum representing matrices supported by QBlast
Enum representing available output alignment types.
Enum representing available output formats.
Output parameters accepted by QBlast service.
Enum representing available blast programs.
This class models a Blast/Blast plus result.
Designed by Paolo Pavan.
Designed by Paolo Pavan.
Re-designed by Paolo Pavan on the footprint of:
org.biojava.nbio.genome.query.BlastXMLQuery by Scooter Willis
You may want to find my contacts on Github and LinkedIn for code info
or discuss major changes.
A Block is a Data Structure that stores aligned positions of a
MultipleAlignment
with the condition that residues are in a
sequential order.General implementation of a
Block
that supports any type of
sequential alignment with gaps.A BlockSet is a Data Structure to store a flexible alignment part of a
multiple alignment.
A general implementation of a BlockSet to store a flexible part of a multiple
alignment.
A simple bond -- it stores information about two atoms as well as information
about its bond order.
A simple bond -- it stores information about two atoms as well as information
about its bond order.
Adds polymer bonds for peptides and nucleotides based on distance cutoffs and
intra-group (residue) bonds based on data from the Chemical Component Dictionary
to the Structure object.
Work in progress - NOT final!
A bounding box for short cutting some geometrical calculations.
An enum to represent the 7 Bravais lattices
A bridge is formed by two non-overlapping stretches of three residues each
(i-1,i,i+1) and (j-1,j,j+1), where i<j.
Need to keep track of actual bytes read and take advantage of buffered reader
performance.
Converts full atom representations to Calpha only ones.
Utility operations on Atoms, AminoAcids, Matrices, Point3d, etc.
Utility operations on Point3d.
Implements a concurrency wrapper for a
PairwiseSequenceAligner
.Implements a concurrency wrapper for a
PairwiseSequenceScorer
.Implements a concurrency wrapper for a
ProfileProfileAligner
.Simple Callable Class that calculates a pairwise alignment in a different
thread, so that multiple pairwise alignments can be run in parallel
(examples: all-to-all alignments, DB search alignments).
A cartesian product between two lists A and B is the set of all ordered pairs
of the elements of both sets.
Set of proteinogenic amino acids.
Attempts to wrap compounds so it is possible to view them
in a case insensitive manner
A sequence creator which preserves the case of its input string in
the user collection of the returned ProteinSequence.
The categories found within CATH.
General API for interacting with CATH.
A class which represents a single CATH domain.
Controls global
CathDatabases
being used.Represents a node in the CATH hierarchy.
Represents a exon or coding sequence in a gene.
This is based on the original Combinatorial Extension (CE) source code from 2003 or 2004 (CE version 2.3),
as has been originally developed by I.
This is based on the original Combinatorial Extension (CE) source code from 2003 or 2004 (CE version 2.3),
as has been originally developed by I.
A wrapper for
CeMain
which sets default parameters to be appropriate for finding
circular permutations.Tiny wrapper for the disallowed regions of an alignment.
Provides parameters to
CeCPMain
The main class of the Java implementation of the Combinatorial Extension Algorithm (CE),
as has been originally developed by I.
Contains the parameters that can be sent to CE
Identify the symmetries in a structure by running an alignment of the
structure against itself disabling the diagonal of the identity alignment.
Iterative version of CeSymm that aims at identifying all symmetry axis of a
structure.
Provides parameters to
CeSymm
.The internal symmetry detection can be divided into two types: CLOSE:
includes the circular and dihedral symmetries, and OPEN: includes the
helical and protein repeats symmetries.
This Class stores all the relevant information of an internal symmetry result
obtained with CeSymm.
Process the arguments from command line
Defines the interface for a Chain.
A Chain in a PDB file.
A class to add appropriate charge information to a structure.
Properties of a chemical component.
Properties of an atom of a chemical component.
Properties of a bond in a chemical component.
Create the
ChemicalComponentDictionary
from CIF data.Consumes a CCD file to create the
ChemicalComponentDictionary
.Convert CifFiles to chem comps.
Properties of the chemical component descriptor.
Interface that is implemented by all classes that can provide
ChemComp
definitions.A representation of the Chemical Component Dictionary.
Cholesky Decomposition.
Ask the user to provide a directory containting PDB files.
A class that can map chromosomal positions to mRNA (coding sequence) positions.
A ChromosomeSequence is a DNASequence but keeps track of geneSequences
Created by ap3 on 27/10/2014.
Flag for BioJava beans that resemble categories defined by the mmCIF schema.
Convert a chain to a
CifFile
.Defines a rather generic interface which allows to populate some data structure with data parsed from a CIF file.
Parse text Cif files and provide capabilities to store them locally.
Create a CifFile instance for a given container of structure data.
Defines the categories to consume during CIF parsing.
An implementation of a CifFileConsumer for BioJava.
Convert BioJava structures to CifFiles and vice versa.
Convert a structure to a CifFile.
This object represents a classpath resource on the local system.
Utilities for autoconfiguring javabeans based on command line arguments.
A class that clusters alternative alignments according to their
similarity.
Define a codon
Utilities for working with collections.
http://www.javamex.com/tutorials/memory/ascii_charsequence.shtml
This class provides static methods for the calculation of the percentage of
identity between two aligned sequences.
For a given sequence this class will create a view over the top of it
and for every request the code will return the complement of the underlying
base e.g. base A will become base T
contains information about a certain Component.
Static utility to easily share a thread pool for concurrent/parallel/lazy execution.
XML content handler for serialisation of RegistryConfiguration class
This class is used to support the implementation of properties stated in IPeptideProperties.
A simple class to store contacts in the form of pairs of indices and a distance associated to them.
Provides a mapping between real numbers and Colors.
Maps colors by performing a transform of the input data and then passing
the transformed value to a ContinuousColorMapper for rendering.
a class that manages the conversion of sequence coordinate system to
JPanel drawing coordinates
Superimposes the core aligned residues of every structure in a
MultipleAlignment
onto a reference structure.The CoxHelper class is provided to start with a tab delimited file in a similar process in R and return the results as a CoxInfo class.
Holds the results of a cox analysis where calling dump(), toString() will give an output similar to R
This is a port of the R survival code used for doing Cox Regression.
Utility class that calculates a CRC64 checksum on a stream of bytes.
A class containing methods to find interfaces in a given crystallographic Structure by
reconstructing the crystal lattice through application of symmetry operators
A crystal cell's parameters.
Representation of a transformation in a crystal:
- a transformation id (each of the transformations in a space group, 0 to m)
- a crystal translation
The transformation matrix in crystal basis is stored, representing the basic
transformation together with the crystal translation.
Parses the cytoband (karyotype) file from UCSC.
Represents revision records for use by
PDBHeader
.If a SequenceProxyReader implements this interface then that external source
has a list of cross reference id(s)
Phosphosite is available under the PhosphoSitePlus® is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License and is freely available for non-commercial purposes from
http://www.phosphosite.org/staticDownloads.do
Please acknowledge PhosphoSitePlus®, www.phosphosite.org" at appropriate locations.
GenBank gi|gi-number|gb|accession|locus
ENA Data Library gi|gi-number|emb|accession|locus
DDBJ, DNA Database of Japan gi|gi-number|dbj|accession|locus
NBRF PIR pir||entry
Protein Research Foundation prf||name
SWISS-PROT UNIPROT sp|accession|name
Brookhaven Protein Data Bank (1) pdb|entry|chain
Brookhaven Protein Data Bank (2) entry:chain|PDBID|CHAIN|SEQUENCE
Patents pat|country|number
GenInfo Backbone Id bbs|number
General database identifier gnl|database|identifier
NCBI Reference Sequence ref|accession|locus
Local Sequence identifier lcl|identifier
A class to represent database cross references.
If you have a uniprot ID then it is possible to get a collection
of other id(s) that the protein is known by.
The default provider for AAINDEX loads substitution matrices from the AAINDEX file in the resources directory
Color Mapper which mimics the default coloring of JMatrixPanel pixels.
Default implementation of OntologyOps.
Demo of how to use the
FastaStructureParser
class to read protein
structures from a FASTA file.Example of how to load PDB files using the AtomCache class.
A demo for how to use the Berkeley version of SCOP instead of the default UK-SCOP
An example for how to access CATH data.
Example of how to run a structure alignment using the CE algorithm.
Quick demo of how to call CE-Symm programmatically.
This demo shows how to use an alternative ChemCompProvider.
This demo contains the CookBook example to create a phylogenetic tree from a
multiple sequence alignment (MSA).
Demonstration of how to load a Structure with the SS information, either from
the PDB file annotation (Author's assignment) or from the DSSP file in the
PDB servers (DSSP assignment).
Example for how to load protein structures (from PDB files).
An example of how to read MMcif files
An example of how to convert mmCIF file to PDB file
Demo for running the MultipleMC Algorithm on a protein family.
Demo on how to use programatically
QsAlign
for the alignment of
quaternary structures.This demo shows how to display the
QuatSymmetryResults
of a
structure.A demo for how to use
RotationAxis
to display the rotation for an
alignment.A class demonstrating the use of the SCOP parsing tools
Demonstration on how to use the Secondary Structure Prediction (DSSP)
implementation in BioJava and obtain different SS representations and
outputs.
Created by andreas on 8/10/15.
Demo of how to use the
FastaStructureParser
class to read protein
structures from a FASTA file.A demo on how to use the quaternary symmetry detection algorithms.
A utility class for visualistion of structure alignments
The DistanceMatrixCalculator methods generate a
DistanceMatrix
from a
MultipleSequenceAlignment
or other indirect distance infomation (RMSD).Check the accuracy of a Distance Tree by least squares error (LSE) of the
Tree branch lengths and the original Distance Matrix.
This is class should model the attributes associated with a DNA sequence
The type of DNA sequence
A helper class that allows different ways to read a string and create a DNA sequence.
Performs the first stage of transcription by going from DNA to RNA.
Decomposes a structure from the PDB into representative domains
Displays the dot plot trace for an alignment.
This provider of chemical components can download and cache chemical component definition files from the RCSB PDB web
site.
Class to parse a DSSP file (output of the DSSP program),
that contains the secondary structure assignment of a structure.
General API for interacting with ECOD.
An EcodDomain contains all the information of the ECOD database: id,
classification groups (from higher to lower in the tree: X,H,T,F), PDB code,
chain, residue ranges and status (manual or automatic classification).
Controls global
EcodDatabases
being used.Provides access to the Evolutionary Classification of Protein Domains (ECOD).
Interface for carrying out edit operations on a Sequence.
Abstract class which defines all edit operations as a call to discover
what 5' and 3' ends of an editing Sequence should be joined together
with a target Sequence.
Implementation which allows for the deletion of bases from a Sequence
Edit implementation which allows us to insert a base at any position
in a Sequence.
Allows for the substitution of bases into an existing Sequence.
Eigenvalues and eigenvectors of a real matrix.
One way to model the elements
Element is an enumeration of the elements of the periodic table.
ElementType is an enumeration of the types of elements found in the periodic table.
This class contains the processed data of embl file
Primary accession number
Sequence version number
Topology: 'circular' or 'linear'
Molecule type
Data class
Taxonomic division
Sequence length
This class should process the data of embl file
this class contains the parsed data of embl file
This class contains the processed data of embl file that
contains the referenceNumber, referenceComment, referencePosition
referenceCrossReference, referenceGroup, referenceAuthor
referenceTitle, referenceLocation
Heuristical finding of Entities (called Compounds in legacy PDB format)
in a given Structure.
An object to contain the info from the PDB header for a Molecule.
A set of helper methods which return true if the two parameters are
equal to each other.
Sort Exon where it is a little confusing if exons should always be ordered left to right
where a negative stranded gene should go the other direction.
A gene contains a collection of Exon sequences
An enum to represent the experimental technique of a PDB structure
A collection of static utilities to convert between
AFPChains
and FastaSequences.A Gene sequence has a Positive or Negative Strand where we want to write out to a stream the 5 to 3 prime version.
Use FastaReaderHelper as an example of how to use this class where FastaReaderHelper should be the
primary class used to read Fasta files
A FASTA formatted sequence.
Used to parse a stream of a fasta file to get the sequence
Read from a FASTA file (or gzipped FASTA file) and create a Java stream of
ProteinSequence
objects
for use in a functional programming paradigm.Reads a protein sequence from a fasta file and attempts to match it to a
3D structure.
The FastaWriter writes a collection of sequences to an outputStream.
The class that should be used to write out fasta file of a sequence collection
FASTQ formatted sequence.
Fluent builder API for creating FASTQ formatted sequences.
Reader for FASTQ formatted sequences.
Utility methods for FASTQ formatted sequences.
FASTQ sequence format variant.
Writer for FASTQ formatted sequences.
A class that does calculations on an AFPChain
A Feature corresponds to a single row in a GFF file.
It is
DBReferenceInfo
which implements FeatureInterface
.A feature on a sequence (for example, an exon or a gene), defined by a location
and a set of attributes encoded as key/value pairs.
Interface class to handle describing arbitrary features.
A list of FeatureI objects implemented using a Java ArrayList; corresponds to a GFF file.
If a SequenceProxyReader implements this interface then that external source
has a list features
Models the keywords that are annotated for a protein sequence at Uniprot.
Methods to convert a structure object into different file formats.
A class that configures parameters that can be sent to the PDB file parsers
FileParsingParameters.setParseCAOnly(boolean)
- parse only the Atom records for C-alpha atoms
FileParsingParameters.setParseSecStruc(boolean)
- a flag if the secondary structure information from the PDB file (author's assignment) should be parsed.This class is a good example of using the SequenceCreatorInterface where during parsing of the stream
the sequence and the offset index are passed to create a Protein sequence that will be loaded in lazily.
This class is a good example of using the SequenceCreatorInterface where during parsing of the stream
the sequence and the offset index are passed to create a Protein sequence that will be loaded in lazily.
This class is a good example of using the SequenceCreatorInterface where during parsing of the stream
the sequence and the offset index are passed to create a Protein sequence that will be loaded in lazily.
Provides a cache for storing multiple small files in memory.
This class contains wrapper methods for communication between BioJava and
forester (e.g, Data Structure conversion).
Four bit encoding of the bit formats.
A four bit per compound implementation of the bit array worker code.
Implements an algorithm which computes a score for a sequence alignment pair picked from an alignment
Profile
.Implements an algorithm which computes a score for a sequence alignment pair.
Implements an algorithm which computes a score for a sequence alignment pair picked from an alignment
Profile
.Implements an algorithm which computes a score for a sequence alignment pair.
Joins the initial Fragments together to larger Fragments
a pair of fragments of two protein structures
Indicates a way of translating a sequence.
Implementation for resolving fuzzy locations.
Defines a data structure for the gap penalties used during a sequence alignment routine.
Defines the possible types of gap penalties.
Use
GenbankReaderHelper
as an example of how to use this class where GenbankReaderHelper
should be the
primary class used to read Genbank filesFor Genbank format file only.
The class that should be used to write out genbank file of a sequence
collection
A parser that parses a file from the UCSC genome browser that contains mapping of gene name to chromosome positions
http://www.bioperl.org/wiki/GTF
Read and write FeatureLists as GFF/GTF formatted files.
http://www.bioperl.org/wiki/GTF
Read and write FeatureLists as GFF/GTF formatted files.
A simple bean that contains gene name information as available from www.genenames.org
Parses a file from the www.genenames.org website that contains a mapping of human gene names to other databases
We store the original header if the sequence is parsed from a fasta file and will use that exact
sequence if we write out the sequences to a fasta file.
The default fasta header parser where some headers are well defined based on the source
database which allows us to set the source of the protein sequence and the identifier
that can be used in future implementations to load features from external sources
If the user has a custom header with local data then they can create their own implementation
of a FastaHeaderParserInterface
http://www.bioperl.org/wiki/GTF
Read and write FeatureLists as GFF/GTF formatted files.
Simple parser for the Gene Ontology (GO) flatfile format.
a class to perform Gotoh algorithm
Maps a set of real values onto a gradient.
The GraphOrderDetector transforms the self-alignment into a Graph and
extracts its maximally connected Components.
The GraphRefiner transforms the self-alignment into a Graph and extracts its
maximally connected Components.
A grid to be used for calculating atom contacts through a spatial hashing algorithm.
A grid cell to be used in contact calculation via spatial hashing algorithm.
This is the data structure for a single Group of atoms.
A class to store the results of ASA calculations, it can
hold ASA values per atom present in
Group
A pair of residues that are in contact
A set of residue-residue contacts.
An iterator over all groups of a structure.
This contains basic categories for Group types.
Implements a data structure for a guide tree used during progressive multiple sequence alignment.
Defines a data structure for the node in a guide tree used during progressive multiple sequence alignment.
A class to wrap some of the strucutre.gui classes using Reflection
Contains helper methods for generating a HashCode without having to resort to
the commons lang hashcode builders.
Container that represents a hidrogen bond.
A dummy class that does nothing.
Generic Implementation of a Group interface.
Behaviors for how to balance memory vs. performance.
Defines a clustering algorithm that converts a distance matrix into a tree.
This class models a search Hit.
The cookbook recipe for how to request Pfam annotations for a protein sequence using the Hmmer3 service
Provides the details of a domain hit
The results of a Hmmer search for a single sequence
Interface for performing Hmmscans on sequences.
This class models a search Hsp.
Represents an even coverage of quaternion space by 60 points.
Reader for
FastqVariant.FASTQ_ILLUMINA
formatted sequences.Writer for
FastqVariant.FASTQ_ILLUMINA
formatted sequences.A class that provides an InputStream from a File.
A collection of locations which are used whenever we work with INSDC; some
of which could be deprecated (from INSDC's point of view) yet appear
in records.
Used to represent bond locations equivalent to bond(7,8) or bond(7).
Deprecated in INSDC yet still appears; equivalent to the order()
directive except no 5' to 3' ordering is defined.
Deprecated in INSDC; refers to a set of locations of which one
location could be valid e.g. one-of(location, location, location).
Used to describe a 5' to 3' ordering but no firm assurance it is correct
Parser for working with INSDC style locations.
A class containing methods to find interfaces in a given structure.
Thrown to indicate that an ontology term is not acceptable or
appropriate in a given context
Closure interface used when working with
IOUtils.processReader(BufferedReader, ReaderProcessor)
.An interface to generate some basic physico-chemical properties of protein sequences.
The following properties could be generated:
The following properties could be generated:
Enumeration of the seven different attributes
Enumeration of the distribution for the first, first 25%, first 50%, first 75% and 100% of the grouping
Enumeration of the three different groupings for each attributes
Enumeration of the transition between groupA and groupB
Available translations
1 - UNIVERSAL
2 - VERTEBRATE_MITOCHONDRIAL
3 - YEAST_MITOCHONDRIAL
4 - MOLD_MITOCHONDRIAL
5 - INVERTEBRATE_MITOCHONDRIAL
6 - CILIATE_NUCLEAR
9 - ECHINODERM_MITOCHONDRIAL
10 - EUPLOTID_NUCLEAR
11 - BACTERIAL
12 - ALTERNATIVE_YEAST_NUCLEAR
13 - ASCIDIAN_MITOCHONDRIAL
14 - FLATWORM_MITOCHONDRIAL
15 - BLEPHARISMA_MACRONUCLEAR
16 - 2CHLOROPHYCEAN_MITOCHONDRIAL
21 - TREMATODE_MITOCHONDRIAL
23 - SCENEDESMUS_MITOCHONDRIAL
Taken from NCBI with slight modification and put into the classpath resource.
Holds the concept of a codon table from the IUPAC format
A JTextField that can make suggestions for auto-complete.
a JPanel that can display a difference of distance matrix and paths that have been
taken for the alignment
This reader actually proxies onto multiple types of sequence in order
to allow a number of sequence objects to act as if they are one sequence.
A utility class that defines which set of atoms are considered
to be on equivalent positions.
PDB-specific
This class gives public API to RONN functions.
Holder for the ranges, contain pointers to starting and ending position
on the sequence which comprises a disordered region.
Subclass of WeakReference which includes and extra field (the key)
which can be used to help cleanup once this reference has been
enqueued.
A Ladder is a set of one or more consecutive bridges of identical type.
Defines a minimal data structure for reading and writing a sequence alignment.
List of output formats.
Superclass for classes which download and interact with the PDB's FTP server,
specifically
PDBFileReader
and CifFileReader
.Controls when the class should fetch files from the ftp server
Behaviors for when an obsolete structure is requested.
Protein Domain Parser is a an algorithm that attempts at assigning domains for 3D protein structures.
Classes which implement ScopDatabase in a way which allows them to serve queries
without accessing the internet should implement this interface instead.
Sets of integers used to represent the location of features on sequence.
A location on a sequence.
Helper methods for use with the Location classes.
Helper methods for use with the Location classes.
Move a sliding window over a Location.
Performs a log10 transform on input before passing the values off to another
colorMapper.
LU Decomposition.
Transforms Map to String.
Matrices contains static methods to operate and transform matrices used in 3D
geometry (transformation matrices and rotation matrices).
Jama = Java Matrix class.
***********************************************************************
Compilation: javac Matrix.java Execution: java Matrix
A bare-bones collection of static methods for manipulating matrices.
Defines an
Aligner
which builds a score matrix during computation.Tracks Memory allocated and used, displayed in graph form.
Create the menus for structure alignment GUI windows (JFrames).
Create the menu for BiojavaJmol
Creates a frame to display a DotPlotPanel.
Consume metal bond data.
Created by andreas on 6/9/16.
Created by andreas on 6/6/16.
A bean that contains cutoffs for correctly detecting metal bonds.
This class allows for finding inter-strand base pairs that are not necessarily canonical Watson-Crick pairs.
A class of functions for reading and writing Biojava structures using MMTF
A class to read MMTF files and cache them locally.
A biojava specific structure inflator for MMTF.
Class to take Biojava structure data and covert to the DataApi for encoding.
Class to store the summary data for a given structure.
A utils class of functions needed for Biojava to read and write to mmtf.
An internal utility class for StructureImpl to make it easier to manage poly and nonpoly chains.
Class that loads data from the model files into
ModelLoader.Model
objectsRepresent a RONN model
Represents a Threshold
define modification categories.
Conditions of a protein modification, e.g. components and atoms.
Root interface for all modifications in structure.
The moment of inertia, otherwise known as the angular mass or rotational
inertia, of a rigid body determines the torque needed for a desired angular
acceleration about a rotational axis.
A MultipleAlignment is a Data Structure to store the core information of a
multiple structure alignment, as a return type.
A class that obtains structures via DAS and aligns them.
Generalization of the Coodinate Manager to include an arbitrary number of
sequences (lines) for MultipleAlignment visualization.
Utility functions to generalize the visualization of MultipleAlignments in
molecular viewers.
A MultipleAlignmentEnsemble is a collection of
MultipleAlignment
s
that share the same structures (Atoms) and creation properties (algorithm,
version, creation time, etc.).A general implementation of a
MultipleAlignmentEnsemble
.A JFrame that allows to trigger a multiple structure alignment,
either from files in a directory or after manual upload.
A general implementation of a
MultipleAlignment
.A class that provides a 3D visualization Frame in Jmol for
MultipleAlignment
s.Utility Class that provides helper methods for the visualization of
MultipleAlignment
s.Utility class for calculating common scores of
MultipleAlignment
s.Utility functions for working with
MultipleAlignment
.This class contains functions for the conversion of
MultipleAlignment
to various String outputs.Helper methods to convert all the hierarchy levels of a MultipleAlignment
into an XML format.
Parse an XML file representing a
MultipleAlignmentEnsemble
, so
that the original alignment can be recovered.A JPanel that can display the sequence alignment of a
MultipleAlignment
in a nice way and interact with Jmol by
selecting the aligned atoms of the sequence selection.Mouse Motion Listener for the
MultipleAligPanel
,
which provides methods to obtain positions of the mouse
and connect them to the sequence alignment positions using
the information in MultipleAlignmentCoordManager
.Main class of the Java implementation of the Combinatorial Extension -
Monte Carlo (CEMC) Algorithm,
as it was originally described by C.Guda, E.D.Scheeff, P.E.
This class takes a MultipleAlignment seed previously generated and runs a
Monte Carlo optimization in order to improve the overall score and highlight
common structural motifs.
Contains the parameters to be sent to the MC optimization.
Implements a minimal data structure for reading and writing a sequence alignment.
This class provides information of the selected positions in the
MultipleAligPanel
.Interface for the Multiple Structure Alignment Algorithms.
Interface for Multiple Alignment superposition algorithms.
Defines a mutable (editable) data structure for an
AlignedSequence
.Defines a mutable (editable) data structure for a
Profile
.Defines a mutable (editable) data structure for a
ProfilePair
.Defines a mutable (editable) data structure for the results of pairwise sequence alignment.
A class that can change one amino acid to another.
Loads an alignment in an XML format and displays its content in a
new Jmol panel.
Shows the interatomic Distance Matrices of all the Structures aligned in different Frames.
Save an alignment to a specified File by the user.
This class wraps a QBlast search request parameter
Map
by adding several convenient parameter addition
methods.This class wraps a QBlast output parameter
Map
by adding several convenient parameter addition methods.Provides a simple way of submitting BLAST request to the QBlast service at NCBI.
A simple demo showing
NCBIQBlastService
usageA nucleotide group is almost the same as a Hetatm group.
The stream that void its input
an interface for events that occur during parsing of .obo files
A file handler for .obo files
A class to parse the content of an OBO file.
Parses an OBO file.
An ontology.
A basic in-memory implementation of an ontology
Thrown to indicate an error in an Ontology object
A factory for Ontology instances.
This is an interface for optimizing ontology operators.
A term in an ontology which identifies another ontology.
Simple in-memory implementation of a remote ontology term.
Tools for manipulating ontologies.
A class to resolve the operators for transformations
A wrapper for
CeMain
which sets default parameters to be appropriate for finding
circular permutations.Contains the parameters that can be sent to CE
A method to decide the order of symmetry (number of subunits)
given a structure self-alignment, calculated by CE-Symm.
An ordered pair represents a component of a cartesian product.
Fully re-factored and enhanced version of RONN.
Fully re-factored version of RONN model.
A Pair of objects.
Defines an algorithm which computes a score for a sequence alignment pair picked from an alignment
Profile
.Defines an algorithm which computes a score for a pair of sequences.
UI for
ConfigStrucAligParams
, for the AlignmentGUI.Created by andreas on 6/17/15.
Low-level event based parser callback.
General abstraction of different parsing errors
Defines an algorithm which computes a new alignment profile by splitting a current alignment and realigning.
A stub StructureIdentifier, representing the full structure in all cases.
Parses REMARK 350 records in a PDB file and creates transformations to
construct the quaternary structure of a protein from an asymmetric unit
A class to hold crystallographic information about a PDB structure.
A class to define where a structure for the alignment is coming from
This class implements the actual PDB file parsing.
The wrapper class for parsing a PDB file.
A class that contains PDB Header information.
A wrapper class for the PDB identifier.
A pair for structure alignment.
An exception during the parsing of a PDB file.
An interface implemented by all classes that represent PDB records.
A class to define where a structure for the alignment is coming from
Methods for getting the status of a PDB file (current, removed, unreleased)
and for accessing different versions of the structure.
Represents a simplified 3 state status of PDB IDs.
Internal use only.
Temporary data storage for LINK records.
A JPanel to upload 2 custom PDB files.
This is an adaptor class which enable the ease of generating protein properties.
Enumeration of 20 standard amino acid code
This class contains the actual implementation of IPeptideProperties and is wrapped around by PeptideProperties for ease of use.
The plain fasta header takes everything in the header as a single entity.
Holds a single point part of a location
Used to resolve a position about a point
Enumerates the classification of polymers.
In mathematics, the power set (or powerset) of any set S, written P(S), is
the set of all subsets of S, including the empty set and S itself.
Implementation of XMLWriter which emits nicely formatted documents
to a PrintWriter.
This is an adaptor class which enable the ease of generating profeat properties.
Defines a data structure for the results of sequence alignment.
List of output formats.
Defines a data structure for the results of the alignment of a pair of
Profile
s.Defines an
Aligner
for a pair of profiles.Defines an algorithm which computes a score for a pairing of alignment profiles.
Defines a data structure for a view of sequence alignment.
This interface defines information about a specific protein
modification.
Identify attachment modification in a 3-D structure.
This class contains information about a specific protein
modification.
Uses Builder pattern to build a ProteinModification.
This class serves as a instance registry by maintaining
a pool of ProteinModification instances.
The representation of a ProteinSequence
Used to create a ProteinSequence from a String to allow for details
about the location of the sequence etc.
QR Decomposition.
Quaternary Structure Alignment (QS-Align).
The parameter bean for the
QsAlign
algorithm.Result of a Quaternary Structure Alignment
QsAlign
.The Quaternary Structure Relation describes the pairwise relation between two
quaternary structures.
DNA Sequences produced by modern sequencers usually have quality informaion
attached to them.
It is common to have a numerical value or values associated with a feature.
Detects the symmetry (global, pseudo, internal and local) of protein
structures.
The QuatSymmetryParameters specify the options used for the detection of
quaternary symmetry in structures using the
QuatSymmetryDetector
.Holds the results of quaternary symmetry perception obtained with
QuatSymmetryDetector
.A bean to represent information about the set of
Subunit
s being
considered for symmetry detection.a utility class that listens to Ramsol script commands in the @link
BiojavaJmol
classUnlike the
DownloadChemCompProvider
, this ChemCompProvider
does not download any chem comp
definitions.Superimposes each structure in a
MultipleAlignment
onto a reference
structure.Refinement of the self-alignment failed.
Makes remote calls to the HMMER web service at the EBI web site and returns Pfam domain annotations for an input protein sequence.
RemotePairwiseAlignmentOutputProperties: the simplest representation of an object capable of holding
output formatting informations to be fed to a RemotePairwiseAlignmentService-implemented object.
RemotePairwiseAlignmentProperties is a interface that contains the barest of
methods for setting and getting Alignment properties.
This interface specifies minimal information needed to execute a pairwise alignment on a remote service.
A term in another ontology.
Simple in-memory implementation of a remote ontology term.
Defines an algorithm which computes a new alignment profile by rescoring all pairs in an alignment and
realigning.
A ResidueGroup is a set of residues that are part of a maximally connected
component of the self-Alignment Graph in symmetry analysis.
Everything that is needed to uniquely describe a residue position
A chainName, a start residue, and an end residue.
A chain, a start residue, and an end residue.
Enumerates the possible classifications of residues.
A class that manages the Strings that are defined in the spice.properties file.
This class models a search result.
Designed by Paolo Pavan.
For a given sequence this class will return the base at the reversed
position i.e. in a sequence of size 10, if you request base 2 you will get
back the base at position 9.
RNASequence where RNACompoundSet are the allowed values
Used to create a RNA sequence
Attempts to do on the fly translation of RNA by not requesting the compounds
until asked.
Takes a
Sequence
of NucleotideCompound
which should represent
an RNA sequence (RNASequence
is good for this) and returns a list of
Sequence
which hold AminoAcidCompound
.A collection of various constrain values used by RONN
Calculates the rotation axis for an alignment
Reader for
FastqVariant.FASTQ_SANGER
formatted sequences.Writer for
FastqVariant.FASTQ_SANGER
formatted sequences.A JPanel that can display the underlying distance matrix
data of the protein structure alignment algorithm.
The biojava-alignment module represents substitution matrices with short
values.
The various categories provided by SCOP.
General API how to interact with SCOP
Contains data from
dir.des.scop.txt_1.75
Container for the information for a domain.
Controls the global ScopDatabase being used.
This class provides access to the SCOP protein structure classification.
Indicates that an I/O error occurred with SCOP lazy initialization.
Helper class to store paths to the four SCOP files
The string "%s" is replaced with the version number.
Defines an algorithm which computes a score.
Interface for classes which implement a temporary cache for various numeric
scores, e.g.
Designed by Paolo Pavan.
Calculate and assign the secondary structure (SS) to the
Groups of a Structure object.
A secondary structure element (SSE) is an object representing a block of
sequential residues that share the same secondary structure type.
Container for the secondary structure information of a single residue.
This class extends the basic container for secondary structure annotation,
including all the information used in the DSSP algorithm.
This class contains methods for obtaining and converting secondary structure
information from BioJava
Structure
s.This enum contains all of the secondary structure types found in the DSSP
output.
A Text Panel that allows the user to specify multiple structure
identifiers, space separated.
A Panel that allows user to specify PDB and chain ID, as well as sub-ranges
Created by andreas on 9/11/15.
Created by andreas on 9/11/15.
Aligns the SEQRES residues to the ATOM residues.
Main interface for defining a collection of Compounds and accessing them
using biological indexes
This is a common method that can be used across multiple storage/proxy implementations to
handle Negative strand and other interesting elements of sequence data.
Used to sort sequences in ascending order of bioBegin property.
A sequence display that can show the results of a protein structure alignment.
This class represents the storage container of a sequence stored in a fasta file where
the initial parsing of the file we store the offset and length of the sequence.
Calls Spencer's method for determining order.
Creates a refined alignment with the CE-Symm alternative self-alignment.
A location in a sequence that keeps a reference to its parent sequence
Provides a set of static methods to be used as static imports when needed
across multiple Sequence implementations but inheritance gets in the way.
A basic sequence iterator which iterates over the given Sequence by
biological index.
a mouse listener for the AbstractChainRenderer class
it listens to all mouse events and triggeres appropriate
SequenceListener and FeatureListener events
A static class that provides optimization hints for memory or performance handling of sequence data.
Defines a data structure for the results of pairwise sequence alignment.
A class that draws a Sequence as a rectangle, a scale display over it.
Utility class for operations on sequences
A class that provides all that is necessary to create a Serializable Cache
Demo how to load and display a structure in Jmol
An entry in the chain-level SIFTS mapping between UniProt and the PDB.
A mapping between UniProt entries and PDB chains.
Implements a data structure for a
Sequence
within an alignment.Implements a data structure for the gap penalties used during a sequence alignment routine.
Very basic implementation of the Location interface which defines a series
of simple constructors.
Basic implementation of the Point interface.
Implements a data structure for the results of sequence alignment.
Implements a data structure for the results of the alignment of a pair of
Profile
s.Implements a data structure for the results of pairwise sequence alignment.
Implements a data structure which holds the score (penalty or bonus) given during alignment for the exchange of one
Compound
in a sequence for another.An implementation of the SequenceReader interface which for every
call will return only 1 compound (given to it during construction; a String
is also valid but will require a CompoundSet).
An implementation of a single linkage clusterer
See http://en.wikipedia.org/wiki/Single-linkage_clustering
Singular Value Decomposition.
Created by ap3 on 31/10/2014.
Holds the data of sites presented in PDB files.
Annotation that is optimized for memory usage.
Lightweight implementation of Map which uses little memory to store a
small number of mappings, at the expense of scalability.
Smith and Waterman defined an algorithm for pairwise local sequence alignments (best match of sections from each
Sequence
).Provides a 3D superimposition of two structures based on their sequence
alignment.
An in memory cache using soft references.
Reader for
FastqVariant.FASTQ_SOLEXA
formatted sequences.Writer for
FastqVariant.FASTQ_SOLEXA
formatted sequences.A crystallographic space group.
A sparse, square matrix, implementing using two arrays of sparse
vectors, one representation for the rows and one for the columns.
A sparse vector, implemented using a symbol table.
Sample possible orientations.
Utility to write each Fasta entry to a unique file
Performs a sqrt transform on input before passing the values off to another
colorMapper.
A simple bean to store disulfide bridge information, the SSBOND records in the PDB files.
A class that provides a set of standard amino acids.
Used to map the start codon feature on a gene
A simple bean that contains the parameters that can get set at startup
Standard array IO.
Stores all the content parsed from the #=GF lines
Stockholm file parser.
for more information about the format refer to ftp://ftp.sanger.ac.uk/pub/databases /Pfam/current_release/userman.txt. ftp://ftp.sanger.ac.uk/pub/databases/Rfam/CURRENT /USERMAN. http://sonnhammer.sbc.su.se/Stockholm.html.
for more information about the format refer to ftp://ftp.sanger.ac.uk/pub/databases /Pfam/current_release/userman.txt. ftp://ftp.sanger.ac.uk/pub/databases/Rfam/CURRENT /USERMAN. http://sonnhammer.sbc.su.se/Stockholm.html.
Stores all the content of a Stockholm file.
A utility object that describes Stoichiometry (composition of a protein assembly),
determined via clustering procedure
SubunitClusterer
,
and implements human-readable representation using various strategies.What to do when the number of
SubunitCluster
exceeds the length of the alphabet.Used to map the stop codon sequence on a gene
Provides a way of representing the strand of a sequence, location
hit or feature.
Information needed to represent a survival curve
Event based parser callback.
A utility class for common
String
manipulation tasks.An example of a ProxySequenceReader that is created from a String.
A class that contains all the parameters of the structure alignment algorithm.
Interface for a structure object.
A class that provides a simple GUI for Jmol
Everything that is needed to uniquely describe a atom.
An exception during the parsing of a PDB file.
An enum of supported file formats.
Information of a group (residue or ligand) involved in a modification.
An identifier that uniquely identifies a whole
Structure
or
arbitrary substructure.Implementation of a PDB Structure.
An interface between 2 molecules (2 sets of atoms).
A list of interfaces between 2 molecules (2 sets of atoms)
A class that provides static access methods for easy lookup of protein structure related components
StructureIOFile extends StructureProvider with methods specific to
parsing files from the filesystem.
A utility class that makes working with names of structures, domains and ranges easier.
Perform a pairwise protein structure superimposition.
To be implemented by JPanels that are part of the GUI to trigger structure aligmnents.
A class that can provide a protein structure object from somewhere.
A utility class with methods for matching ProteinSequences with
Structures.
A class that provides some tool methods.
Defines a data structure which holds the score (penalty or bonus) given during alignment for the exchange of one
Compound
in a sequence for another.Static utility to access substitution matrices that come bundled with BioJava.
Scores using a substitution matrix.
This is the canonical way to identify a part of a structure.
A Subunit consists of a set of residues from a Structure, which may
correspond to an entire Chain, a Domain, or any subset or combination of
residues from them.
A SubunitCluster contains a set of equivalent
QuatSymmetrySubunits
,
the set of equivalent residues (EQR) between Subunit
and a
Subunit
representative.The SubunitClusterer takes as input a collection of
Subunit
and
returns a collection of SubunitCluster
.The SubunitClustererMethod ennummerates all methods that can be used to
cluster
Subunit
in the SubunitCluster
.The SubunitClustererParameters specifies the options used for the clustering
of the subunits in structures using the
SubunitClusterer
.The SuperPosition interface defines and documents the required methods for
any superpostion algorithm implementation, so that the input and expected
output are uniform.
The SuperPositionAbstract contains common code shared by all SuperPosition
algorithm implementations.
Implementation of the Quaternion-Based Characteristic Polynomial algorithm
for RMSD and Superposition calculations.
The SuperPositionQuat implements a quaternion based algorithm to superpose
arrays of Points in 3D.
SuperPositions is a Class that provides static helper methods and an easy
access to the whole family of
SuperPosition
algorithms.A class that calculates the superposition between two sets of points using an
SVD Matrix Decomposition.
Contains info for graphing km figures
Ported from survfitKM.S When combining multiple entries with same time not
sure how the weighting adds up
Data class to represent a single sample where time and event/censor status is required
Additionally each variable and data associated with that variable.
Used to work with SurvivalInfo
Not used and probably should be deleted
Sorted symbol table implementation using a java.util.TreeMap.
Data Structure that stores all the symmetry axis that describe
the symmetry of a structure.
Represents an axis of symmetry
Calculates a symmetry analysis and displays the results.
Class that provides visualizations methods for symmetry alignments.
A JFrame that allows to trigger a symmetry analysis, either from files
in a directory or after manual upload
Adapted from the AlignmentGui class in biojava.
Action Listener for the symmetry menu.
Method used for symmetry perception in the
QuatSymmetryDetector
.Interface for all symmetry refinement implementations.
Utility methods for symmetry (quaternary and internal) detection and result
manipulation.
Optimizes a symmetry alignment by a Monte Carlo score optimization of the
repeat multiple alignment.
A class containing static methods to parse the symop.lib file from the
CCP4 package.
Parse tab-delimited ontology files into Ontology objects.
Provides a way of separating us from the specific
IUPACParser.IUPACTable
even
though this is the only implementing class for the interface.Class used to hold three nucleotides together and allow for equality
to be assessed in a case insensitive manner.
Instance of a Codon which is 3
NucleotideCompound
s, its
corresponding AminoAcidCompound
and if it is a start or stop codon.A sequence can be associated with a species or Taxonomy ID
A term in an ontology.
Simple in-memory implementation of an ontology term.
This class also finds the base pairing and base-pair step parameters but has a broader definition
of a base pair so that non-canonical-WC base pairs will be detected and reported.
A implmentation of AbstractFeature
A simple timer, calculates the time interval between two events.
Used as a way of encapsulating the data structures required to parse DNA to a
Protein sequence.
This class is the way to create a
TranscriptionEngine
.This is the sequence if you want to go from a gene sequence to a protein sequence.
Thrown from AbstractCompundTranslator
The TreeConstructor uses the forester library to build different types of
phylogenetic trees.
The TreeConstructorType specifies the aligorithm used to construct the tree
(clustering algorithm).
The TreeType specifies the optimization criteria used to generate the tree.
A triple in an ontology.
Basic in-memory implementation of a Triple in an ontology
This can be used to implement Ontology.createTriple
A facade that makes it easier to work with a 2bit file.
downloaded from http://storage.bioinf.fbb.msu.ru/~roman/TwoBitParser.java
Class is a parser of UCSC Genome Browser file format .2bit used to store
nucleotide sequence information.
Implementation of the 2bit encoding.
Extension of the BitArrayWorker which provides the 2bit implementation
code.
Uncompresses a single tarred or zipped file, writing output to stdandard out
This class decompresses an input stream containing data compressed with
the unix "compress" utility (LZC, a LZW variant).
Pass in a Uniprot ID and this ProxySequenceReader when passed to a ProteinSequence will get the sequence data and other data elements
associated with the ProteinSequence by Uniprot.
A class to contain the BoundingBoxes of all polymeric molecules in a full unit cell.
UnitQuaternions is a static Class that contains methods for calculating and
using unit quaternions.
A class that takes care about opening URLConnections and sets the proper timeouts
Represents a structure loaded from a URL (including a file URL)
A container to persist config to the file system
This is a utility class that contains utility methods which will facilitates the coding of other methods
Map implementation which keeps weak references to values.
A sliding window view of a sequence which does not implement any
interfaces like
Sequence
because they do not fit how this works.Need to handle very large spreadsheets of expression data so keep memory
footprint low
FlowLayout subclass that fully supports wrapping of components.
Helper methods to simplify boilerplate XML parsing code for org.w3c.dom XML objects
Simple interface for building XML documents.
This chemical component provider retrieves and caches chemical component definition files from a
zip archive specified in its construction.