public class StructureSequenceMatcher extends Object
| Constructor and Description |
|---|
StructureSequenceMatcher() |
| Modifier and Type | Method and Description |
|---|---|
static ProteinSequence |
getProteinSequenceForStructure(Structure struct,
Map<Integer,Group> groupIndexPosition)
Generates a ProteinSequence corresponding to the sequence of struct,
and maintains a mapping from the sequence back to the original groups.
|
static Structure |
getSubstructureMatchingProteinSequence(ProteinSequence sequence,
Structure wholeStructure)
|
static ResidueNumber[] |
matchSequenceToStructure(ProteinSequence seq,
Structure struct)
Given a sequence and the corresponding Structure, get the ResidueNumber
for each residue in the sequence.
|
static ProteinSequence |
removeGaps(ProteinSequence gapped)
Removes all gaps ('-') from a protein sequence
|
static <T> T[][] |
removeGaps(T[][] gapped)
Creates a new list consisting of all columns of gapped where no row
contained a null value.
|
public StructureSequenceMatcher()
public static Structure getSubstructureMatchingProteinSequence(ProteinSequence sequence, Structure wholeStructure)
wholeStructure containing only the Groups that are included in
sequence. The resulting structure will contain only ATOM residues; the SEQ-RES will be empty.
The Chains of the Structure will be new instances (cloned), but the Groups will not.sequence - The input protein sequencewholeStructure - The structure from which to take a substructureStructureException#matchSequenceToStructure(ProteinSequence, Structure)}public static ProteinSequence getProteinSequenceForStructure(Structure struct, Map<Integer,Group> groupIndexPosition)
struct - Input structuregroupIndexPosition - An empty map, which will be populated with
(residue index in returned ProteinSequence) -> (Group within struct)SeqRes2AtomAligner#getFullAtomSequence(List, Map)}, which
does the heavy lifting.public static ResidueNumber[] matchSequenceToStructure(ProteinSequence seq, Structure struct)
Smith-Waterman alignment is used to match the sequences. Residues in the sequence but not the structure or mismatched between sequence and structure will have a null atom, while residues in the structure but not the sequence are ignored with a warning.
seq - The protein sequence. Should match the sequence of struct very
closely.struct - The corresponding protein structurepublic static ProteinSequence removeGaps(ProteinSequence gapped)
gapped - public static <T> T[][] removeGaps(T[][] gapped)
gapped - A rectangular matrix containing null to mark gapsCopyright © 2000–2017 BioJava. All rights reserved.