public class SequenceMixin extends Object
NucleotideCompound
Sequence
; or to any Sequence e.g. looking for the
getComposition(Sequence)
or getDistribution(Sequence)
for any type of Sequence.
All of these methods assume that you can use the Iterable
interface
offered by the implementations of Sequence
to provide all the
compounds that implementation allows you to see. Since sequence should know
nothing about its backing stores (apart from calling out to it) this should
be true.Modifier and Type | Class and Description |
---|---|
static class |
SequenceMixin.SequenceIterator<C extends Compound>
A basic sequence iterator which iterates over the given Sequence by
biological index.
|
Constructor and Description |
---|
SequenceMixin() |
Modifier and Type | Method and Description |
---|---|
static <C extends Compound> |
checksum(Sequence<C> sequence)
Performs a simple CRC64 checksum on any given sequence.
|
static int |
countAT(Sequence<NucleotideCompound> sequence)
Returns the count of AT in the given sequence
|
static <C extends Compound> |
countCompounds(Sequence<C> sequence,
C... compounds)
For the given vargs of compounds this method counts the number of
times those compounds appear in the given sequence
|
static int |
countGC(Sequence<NucleotideCompound> sequence)
Returns the count of GC in the given sequence
|
static <C extends Compound> |
createIterator(Sequence<C> sequence)
Creates a simple sequence iterator which moves through a sequence going
from 1 to the length of the Sequence.
|
static <C extends Compound> |
createSubSequence(Sequence<C> sequence,
int start,
int end)
Creates a simple sub sequence view delimited by the given start and end.
|
static <C extends Compound> |
getComposition(Sequence<C> sequence)
Does a linear scan over the given Sequence and records the number of
times each base appears.
|
static <C extends Compound> |
getDistribution(Sequence<C> sequence)
Analogous to
getComposition(Sequence) but returns the
distribution of that Compound over the given sequence. |
static <C extends Compound> |
indexOf(Sequence<C> sequence,
C compound)
Performs a linear search of the given Sequence for the given compound.
|
static <C extends Compound> |
inverse(Sequence<C> sequence)
A method which attempts to do the right thing when is comes to a
reverse/reverse complement
|
static <C extends Compound> |
lastIndexOf(Sequence<C> sequence,
C compound)
Performs a reversed linear search of the given Sequence by wrapping
it in a
ReversedSequenceView and passing it into
indexOf(Sequence, Compound) . |
static <C extends Compound> |
nonOverlappingKmers(Sequence<C> sequence,
int kmer)
Produces kmers of the specified size e.g.
|
static <C extends Compound> |
overlappingKmers(Sequence<C> sequence,
int kmer)
Used to generate overlapping k-mers such i.e.
|
static <C extends Compound> |
sequenceEquality(Sequence<C> source,
Sequence<C> target)
A case-sensitive manner of comparing two sequence objects together.
|
static <C extends Compound> |
sequenceEqualityIgnoreCase(Sequence<C> source,
Sequence<C> target)
A case-insensitive manner of comparing two sequence objects together.
|
static <C extends Compound> |
shuffle(Sequence<C> sequence)
Implements sequence shuffling by first materializing the given
Sequence into a List , applying
Collections.shuffle(List) and then returning the shuffled
elements in a new instance of SequenceBackingStore which behaves
as a Sequence . |
static <C extends Compound> |
toList(Sequence<C> sequence)
|
static <C extends Compound> |
toString(Sequence<C> sequence)
Shortcut to
toStringBuilder(org.biojava.nbio.core.sequence.template.Sequence)
which calls toString() on the resulting object. |
static <C extends Compound> |
toStringBuilder(Sequence<C> sequence)
For the given Sequence this will return a
StringBuilder object
filled with the results of Compound#toString() . |
static <C extends Compound> |
write(Appendable appendable,
Sequence<C> sequence)
Used as a way of sending a Sequence to a writer without the cost of
converting to a full length String and then writing the data out
|
public SequenceMixin()
public static <C extends Compound> int countCompounds(Sequence<C> sequence, C... compounds)
C
- The type of compound we are looking forsequence
- The Sequence
to perform the count oncompounds
- The compounds to look forpublic static int countGC(Sequence<NucleotideCompound> sequence)
sequence
- The NucleotideCompound
Sequence
to perform
the GC analysis onpublic static int countAT(Sequence<NucleotideCompound> sequence)
sequence
- The NucleotideCompound
Sequence
to perform
the AT analysis onpublic static <C extends Compound> Map<C,Double> getDistribution(Sequence<C> sequence)
getComposition(Sequence)
but returns the
distribution of that Compound
over the given sequence.C
- The type of compound to look forsequence
- The type of sequence to look overpublic static <C extends Compound> Map<C,Integer> getComposition(Sequence<C> sequence)
C
- The type of compound to look forsequence
- The type of sequence to look overpublic static <C extends Compound> void write(Appendable appendable, Sequence<C> sequence) throws IOException
C
- Type of compoundwriter
- The writer to send data tosequence
- The sequence to write outIOException
- Thrown if we encounter a problempublic static <C extends Compound> StringBuilder toStringBuilder(Sequence<C> sequence)
StringBuilder
object
filled with the results of Compound#toString()
. Does not
used write(java.lang.Appendable, org.biojava.nbio.core.sequence.template.Sequence)
because of its IOException
signature.public static <C extends Compound> String toString(Sequence<C> sequence)
toStringBuilder(org.biojava.nbio.core.sequence.template.Sequence)
which calls toString() on the resulting object.public static <C extends Compound> int indexOf(Sequence<C> sequence, C compound)
public static <C extends Compound> int lastIndexOf(Sequence<C> sequence, C compound)
ReversedSequenceView
and passing it into
indexOf(Sequence, Compound)
. We then inverse the index coming
out of it.public static <C extends Compound> Iterator<C> createIterator(Sequence<C> sequence)
public static <C extends Compound> SequenceView<C> createSubSequence(Sequence<C> sequence, int start, int end)
public static <C extends Compound> Sequence<C> shuffle(Sequence<C> sequence)
Sequence
into a List
, applying
Collections.shuffle(List)
and then returning the shuffled
elements in a new instance of SequenceBackingStore
which behaves
as a Sequence
.public static <C extends Compound> String checksum(Sequence<C> sequence)
public static <C extends Compound> List<SequenceView<C>> nonOverlappingKmers(Sequence<C> sequence, int kmer)
C
- Compound to usesequence
- Sequence to build fromkmer
- Kmer sizepublic static <C extends Compound> List<SequenceView<C>> overlappingKmers(Sequence<C> sequence, int kmer)
C
- Compound to usesequence
- Sequence to build fromkmer
- Kmer sizepublic static <C extends Compound> SequenceView<C> inverse(Sequence<C> sequence)
C
- The type of compoundsequence
- The input sequencepublic static <C extends Compound> boolean sequenceEqualityIgnoreCase(Sequence<C> source, Sequence<C> target)
C
- The type of compoundsource
- Source sequence to assesstarget
- Target sequence to assesspublic static <C extends Compound> boolean sequenceEquality(Sequence<C> source, Sequence<C> target)
C
- The type of compoundsource
- Source sequence to assesstarget
- Target sequence to assessCopyright © 2000–2019 BioJava. All rights reserved.