Class SubunitClustererParameters
- All Implemented Interfaces:
Serializable
SubunitClusterer.- Since:
- 5.0.0
- Author:
- Peter Rose, Aleix Lafita
- See Also:
-
Constructor Summary
ConstructorsConstructorDescriptionInitialize with "local" metrics by default.SubunitClustererParameters(boolean useGlobalMetrics) "Local" metrics are scoring SubunitClustererMethod.SEQUENCE: sequence identity of a local alignment (normalised by the number of aligned residues) sequence coverage of the alignment (normalised by the length of the longer sequence) SubunitClustererMethod.STRUCTURE: RMSD of the aligned substructures and structure coverage of the alignment (normalised by the length of the larger structure) Two thresholds for each method are required. -
Method Summary
Modifier and TypeMethodDescriptionintIf the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.Method to cluster subunits.intGet the minimum number of residues of a subunits to be considered in the clusters.doubleIf the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.doubleStructure similarity threshold (measured with RMSD) to consider for the structural subunit clustering.doubleThe minimum coverage of the sequence alignment between two subunits to be clustered together.doubleSequence identity threshold to consider for the subunits clustering.doubleThe minimum coverage of the structure alignment between two subunits to be clustered together.Method to superpose subunits (i.e., structural aligner).doubleStructure similarity threshold (measured with TMScore) to consider for the structural subunit clustering.booleanisHighConfidenceScores(double sequenceIdentity, double sequenceCoverage) Whether the subunits can be considered "identical" by sequence alignment.booleanThe internal symmetry option divides eachSubunitof eachSubunitClusterinto its internally symmetric repeats.booleanWhether the alignment algorithm should try its best to optimize the alignment, or we are happy with a quick and dirty result.booleanWhether to use the entity id of subunits to infer that sequences are identical.booleanUse metrics calculated relative to the whole sequence or structure, rather than the aligned part onlybooleanUse RMSD for evaluating structure similaritybooleanUse sequence coverage for evaluating sequence similaritybooleanUse structure coverage for evaluating sequence similaritybooleanUse TMScore for evaluating structure similarityvoidsetAbsoluteMinimumSequenceLength(int absoluteMinimumSequenceLength) If the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.voidMethod to cluster subunits.voidsetInternalSymmetry(boolean internalSymmetry) The internal symmetry option divides eachSubunitof eachSubunitClusterinto its internally symmetric repeats.voidsetMinimumSequenceLength(int minimumSequenceLength) Set the minimum number of residues of a subunits to be considered in the clusters.voidsetMinimumSequenceLengthFraction(double minimumSequenceLengthFraction) If the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.voidsetOptimizeAlignment(boolean optimizeAlignment) Whether the alignment algorithm should try its best to optimize the alignment, or we are happy with a quick and dirty result.voidsetRMSDThreshold(double rmsdThreshold) Structure similarity threshold (measured with RMSD) to consider for the structural subunit clustering.voidsetSequenceCoverageThreshold(double sequenceCoverageThreshold) The minimum coverage of the sequence alignment between two subunits to be clustered together.voidsetSequenceIdentityThreshold(double sequenceIdentityThreshold) Sequence identity threshold to consider for the sequence subunit clustering.voidsetStructureCoverageThreshold(double structureCoverageThreshold) The minimum coverage of the structure alignment between two subunits to be clustered together.voidsetSuperpositionAlgorithm(String superpositionAlgorithm) Method to cluster subunits.voidsetTMThreshold(double tmThreshold) Structure similarity threshold (measured with TMScore) to consider for the structural subunit clustering.voidsetUseEntityIdForSeqIdentityDetermination(boolean useEntityIdForSeqIdentityDetermination) Whether to use the entity id of subunits to infer that sequences are identical.voidsetUseGlobalMetrics(boolean useGlobalMetrics) Use metrics calculated relative to the whole sequence or structure, rather than the aligned part onlyvoidsetUseRMSD(boolean useRMSD) Use RMSD for evaluating structure similarityvoidsetUseSequenceCoverage(boolean useSequenceCoverage) Use sequence coverage for evaluating sequence similarityvoidsetUseStructureCoverage(boolean useStructureCoverage) Use structure coverage for evaluating sequence similarityvoidsetUseTMScore(boolean useTMScore) Use TMScore for evaluating structure similaritytoString()
-
Constructor Details
-
SubunitClustererParameters
"Local" metrics are scoring SubunitClustererMethod.SEQUENCE: sequence identity of a local alignment (normalised by the number of aligned residues) sequence coverage of the alignment (normalised by the length of the longer sequence) SubunitClustererMethod.STRUCTURE: RMSD of the aligned substructures and structure coverage of the alignment (normalised by the length of the larger structure) Two thresholds for each method are required. "Global" metrics are scoring SubunitClustererMethod.SEQUENCE: sequence identity of a global alignment (normalised by the length of the alignment) SubunitClustererMethod.STRUCTURE: TMScore of the aligned structures (normalised by the length of the larger structure) One threshold for each method is required. -
SubunitClustererParameters
public SubunitClustererParameters()Initialize with "local" metrics by default.
-
-
Method Details
-
getMinimumSequenceLength
Get the minimum number of residues of a subunits to be considered in the clusters.- Returns:
- minimumSequenceLength
-
setMinimumSequenceLength
Set the minimum number of residues of a subunits to be considered in the clusters.- Parameters:
minimumSequenceLength-
-
getAbsoluteMinimumSequenceLength
If the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.This adaptive feature allows the consideration of structures mainly constructed by very short chains, such as collagen (1A3I)
- Returns:
- the absoluteMinimumSequenceLength
-
setAbsoluteMinimumSequenceLength
If the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.This adaptive feature allows the consideration of structures mainly constructed by very short chains, such as collagen (1A3I)
- Parameters:
absoluteMinimumSequenceLength-
-
getMinimumSequenceLengthFraction
If the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.This adaptive feature allows the consideration of structures mainly constructed by very short chains, such as collagen (1A3I)
- Returns:
- the minimumSequenceLengthFraction
-
setMinimumSequenceLengthFraction
If the shortest subunit sequence length is higher or equal the minimumSequenceLengthFraction times the median subunit sequence length, then the minimumSequenceLength is set to shortest subunit sequence length, but not shorter than the absoluteMinimumSequenceLength.This adaptive feature allows the consideration of structures mainly constructed by very short chains, such as collagen (1A3I)
- Parameters:
minimumSequenceLengthFraction-
-
getSequenceIdentityThreshold
Sequence identity threshold to consider for the subunits clustering.Two subunits with sequence identity equal or higher than the threshold will be clustered together.
- Returns:
- sequenceIdentityThreshold
-
setSequenceIdentityThreshold
Sequence identity threshold to consider for the sequence subunit clustering.Two subunits with sequence identity equal or higher than the threshold will be clustered together.
- Parameters:
sequenceIdentityThreshold-
-
getSequenceCoverageThreshold
The minimum coverage of the sequence alignment between two subunits to be clustered together.- Returns:
- sequenceCoverageThreshold
-
setSequenceCoverageThreshold
The minimum coverage of the sequence alignment between two subunits to be clustered together.- Parameters:
sequenceCoverageThreshold-
-
getRMSDThreshold
Structure similarity threshold (measured with RMSD) to consider for the structural subunit clustering.- Returns:
- rmsdThreshold
-
setRMSDThreshold
Structure similarity threshold (measured with RMSD) to consider for the structural subunit clustering.- Parameters:
rmsdThreshold-
-
getTMThreshold
Structure similarity threshold (measured with TMScore) to consider for the structural subunit clustering.- Returns:
- tmThreshold
-
setTMThreshold
Structure similarity threshold (measured with TMScore) to consider for the structural subunit clustering.- Parameters:
tmThreshold-
-
getStructureCoverageThreshold
The minimum coverage of the structure alignment between two subunits to be clustered together.- Returns:
- structureCoverageThreshold
-
setStructureCoverageThreshold
The minimum coverage of the structure alignment between two subunits to be clustered together.- Parameters:
structureCoverageThreshold-
-
getClustererMethod
Method to cluster subunits.- Returns:
- clustererMethod
-
setClustererMethod
Method to cluster subunits.- Parameters:
method-
-
isInternalSymmetry
The internal symmetry option divides eachSubunitof eachSubunitClusterinto its internally symmetric repeats.The
SubunitClustererMethod.STRUCTUREmust be chosen to consider internal symmetry, otherwise this parameter will be ignored.- Returns:
- true if internal symmetry is considered, false otherwise
-
setInternalSymmetry
The internal symmetry option divides eachSubunitof eachSubunitClusterinto its internally symmetric repeats.The
SubunitClustererMethod.STRUCTUREmust be chosen to consider internal symmetry, otherwise this parameter will be ignored.- Parameters:
internalSymmetry- true if internal symmetry is considered, false otherwise
-
toString
-
getSuperpositionAlgorithm
Method to superpose subunits (i.e., structural aligner).- Returns:
- superpositionAlgorithm
-
setSuperpositionAlgorithm
Method to cluster subunits.- Parameters:
superpositionAlgorithm-
-
isOptimizeAlignment
Whether the alignment algorithm should try its best to optimize the alignment, or we are happy with a quick and dirty result. Effect depends on implementation of the specific algorithm's method. *- Returns:
- optimizeAlignment
-
setOptimizeAlignment
Whether the alignment algorithm should try its best to optimize the alignment, or we are happy with a quick and dirty result. Effect depends on implementation of the specific algorithm's method. *- Parameters:
optimizeAlignment-
-
isUseRMSD
Use RMSD for evaluating structure similarity- Returns:
- useRMSD
-
setUseRMSD
Use RMSD for evaluating structure similarity- Parameters:
useRMSD-
-
isUseTMScore
Use TMScore for evaluating structure similarity- Returns:
- useTMScore
-
setUseTMScore
Use TMScore for evaluating structure similarity- Parameters:
useTMScore-
-
isUseSequenceCoverage
Use sequence coverage for evaluating sequence similarity- Returns:
- useSequenceCoverage
-
setUseSequenceCoverage
Use sequence coverage for evaluating sequence similarity- Parameters:
useSequenceCoverage-
-
isUseStructureCoverage
Use structure coverage for evaluating sequence similarity- Returns:
- useStructureCoverage
-
setUseStructureCoverage
Use structure coverage for evaluating sequence similarity- Parameters:
useStructureCoverage-
-
isUseGlobalMetrics
Use metrics calculated relative to the whole sequence or structure, rather than the aligned part only- Returns:
- useGlobalMetrics
-
setUseGlobalMetrics
Use metrics calculated relative to the whole sequence or structure, rather than the aligned part only- Parameters:
useGlobalMetrics-
-
isHighConfidenceScores
Whether the subunits can be considered "identical" by sequence alignment. For local sequence alignment (normalized by the number of aligned pairs) this means 0.95 or higher identity and 0.75 or higher coverage. For global sequence alignment (normalised by the alignment length) this means 0.85 or higher sequence identity.- Parameters:
sequenceIdentity-sequenceCoverage-- Returns:
- true if the sequence alignment scores are equal to or better than the "high confidence" scores, false otherwise.
-
isUseEntityIdForSeqIdentityDetermination
Whether to use the entity id of subunits to infer that sequences are identical. Only applies if theSubunitClustererMethodis a sequence based one.- Returns:
- the flag
- Since:
- 5.4.0
-
setUseEntityIdForSeqIdentityDetermination
public void setUseEntityIdForSeqIdentityDetermination(boolean useEntityIdForSeqIdentityDetermination) Whether to use the entity id of subunits to infer that sequences are identical. Only applies if theSubunitClustererMethodis a sequence based one. Note this requiresFileParsingParameters.setAlignSeqRes(boolean)to be set to true.- Parameters:
useEntityIdForSeqIdentityDetermination- the flag to be set- Since:
- 5.4.0
-