Chimera Commands Index

Usage:
mda   input-sequence  output-folder  options

The command mda (for MultiDomain Assembler) performs several steps toward modeling a multidomain protein:

  1. Obtains the protein sequence specified by input-sequence, which can be either of the following:
  2. With that sequence as the query, searches the Protein Data Bank (PDB) sequences to find known structures of similar sequence, using a BLAST web service hosted by the UCSF Resource for Biocomputing, Visualization, and Informatics (RBVI).

  3. Imports the corresponding PDB structures into Chimera and lays them out from left to right according to sequence matches along the query from N- to C-terminus. Multiple hits to the same or overlapping segments are stacked vertically; gaps in structural coverage are shown as transparent dark gray spheres proportional in volume to the number of missing residues. See the display options.

    Performance depends on the number of structures imported into Chimera, which can be very large especially for runs with low filtering (permissive BLAST score and percent ID cutoffs, little or no winnowing) on queries with many known structures. Repeated uses of mda will reuse any open models that still meet the criteria and close any others. The command can be aborted during structure import by clicking the red stop icon in the status line to cancel the foreground task.

    The center of rotation method is set to independent so that models rotate about their individual centers rather than a single collective center. The initial layout including scale and model orientations is saved as a position named stacked. Positions can be restored from the Rapid Access interface or with the command reset (e.g., reset stacked). Another position named overlay, better suited to the modeling step, is also saved.

  4. In the specified output-folder, writes output files:

  5. Opens the pseudo-multiple sequence alignment from BLAST in Multalign Viewer; this can be suppressed (along with the Modeller dialog) using showAlignment false.

  6. Opens the interface to Modeller comparative (homology) modeling with the query sequence as the target, all of the imported hit structures designated as templates, and other options set as recommended for use with mda, including thorough optimization. The overlay position should be restored before comparative modeling to decrease the likelihood of generating models with knots. Comparative modeling relies on an accurate sequence alignment. If necessary, the sequence alignment can be edited manually or realigned using Multalign Viewer, or the output FASTA file can be edited outside of Chimera and reopened. Note that the BLAST alignment may omit parts of the hit chains (more...).

    On first use, mda also generates three database files (MDA_seqs.db, MDA_blast.db and MDA_uniprot.db) to speed up subsequent uses of the command, but these are not human-readable. These are placed in the user's Chimera download directory, in an MDA subdirectory.

See also: Blast Protein, Fetch by ID, the Chimera-Modeller interface, and
Multidomain Assembler (MDA) generates models of large multidomain proteins. Hertig S, Goddard TD, Johnson GT, Ferrin TE. Biophys J. 2015 May 5;108(9):2097-102.

Examples:
mda Q14896 ~/Desktop/MDA percentId 50 showAlignment false coloring blastscore skip 3CX2
mda Q14896 ~/Desktop/MDA limit 1,20; reset overlay
A smaller example (as of October 2014, imports five structures):
mda p45379 ~/Desktop/MDA percent 60 group false color blast

Options

Option keywords can be truncated to unique strings and their case does not matter. A vertical bar “|” designates mutually exclusive options, and default values are indicated with bold. Synonyms for true: True, 1. Synonyms for false: False, 0.

Initial search and filtering

winnow  max-per-region
Use the BLAST option to winnow the results to no more than max-per-region hits per region, as described in Berman et al., J Comput Biol. 7:293 (2000). Lower values correspond to more aggressive winnowing and fewer hits returned. All other filters (options below) are applied after winnowing. If the option is not specified or set to zero, no winnowing is done.
minScore  score
Keep only BLAST hits with scores of at least score (default 50). The BLOSUM62 scoring matrix is used. Lowering the cutoff may increase structural coverage of the query, but low-scoring hits may be poorly aligned with the query sequence.
percentId  percent
Keep only BLAST hits with percent sequence identity (%ID) of at least percent. %ID is based only on the region of hit-query alignment from BLAST. If the option is not specified, no filtering by %ID is done.
includeNative true | false
Whether to keep all hits (PDB entries) that are annotated by the PDB with the same UniProt ID as the query, regardless of minScore and percentIdThreshold settings.
suppressDoubles true | false
Whether to keep only the highest-scoring hit to a given PDB entry. An entire PDB structure will be imported if any part is kept as a hit, but redundant chains will be hidden and its position in the mda layout will be based on that match to the query. If suppressDoubles is true (default), a PDB entry will not be imported more than once. If false, it is possible for the same PDB entry to be imported more than once and laid out in different places corresponding to different matches along the query. Regardless of this setting, however, only the PDB chain with the most structure residues will be kept as a hit when BLAST finds multiple identical chains from the same PDB entry. The default (true) is recommended for initial assessment of the available structures, whereas false may provide better structural coverage for modeling proteins with repeated domains.
forceBlast true | false
Whether to re-run BLAST instead of using any previously cached results. Even if this option is false, previously cached results will only be used when the query (the uniprotID) and winnowing level are the same as in an earlier use of mda. Whenever mda runs BLAST, the new results will be added to the cache, and if there were earlier results for the same query and winnowing level, they will be overwritten. Using cached results will speed mda execution at the possible cost of missing any newer PDB entries added to the sequence database since the time of caching.

Display

group true | false
Whether to simplify the display when multiple hits cover similar parts of the query. Hits are grouped into sets in which each member overlaps another by at least 25 residues in the alignment according to BLAST, but extends less than 25 residues beyond the alignment region of the member with the longest such region. If this option is true, the members of a set will be superimposed, only the representative structure (that with the longest region of alignment) will be shown, and their models will be grouped in the Model Panel. If this option is false (default), the models in a set will still be laid out in the same “column,” but vertically separated from one another, and they will not be grouped in the Model Panel.
hideSubmodels true | false
Whether to hide submodels except for the first of any multi-model PDB entries, namely NMR ensembles.
hideAltChain true | false
Whether to hide additional chains of a PDB entry that are identical to (same molecular entity as) the BLAST hit chain.
hideComplex true | false
Whether to hide nearby chains that are nonidentical to the BLAST hit. When shown (default), these other chains are colored blue to distinguish them from the hit chains; see coloring. Regardless of this option, however, only ribbons are shown, and thus small molecules without ribbon representations are not displayed. Atoms/bonds can be displayed with the menu and/or commands, for example: display ligand
deleteHidden true | false
Whether to delete submodels or chains hidden by the three preceding options. Deleting hidden parts improves current performance, but can make a subsequent run with some of the same hits slower, as those with deleted parts must be reimported.
coloring  scheme
Coloring allows a quick visual assessment of the query-aligned parts of hit structures, match quality, and potentially interesting binding partners or insertions. The ribbon-coloring scheme can be one of the following:
In addition, three attributes are assigned to residues in the hit structures to allow custom coloring. The first two are quantitative, suitable for showing with Render by Attribute or the rangecolor command: Although the third has numerical values, it is a classification rather than a continuously varying quantity: Residues in each class above can be specified directly in commands, e.g.: color yellow :/mda_alignment=1

Suppressing dialogs (helpful during scripted or repetitive use)

noConfirm true | false
Whether to simply import the hit structures (no matter how many) rather than asking the user for permission when more than ten are found.
showAlignment true | false
Whether to open Multalign Viewer and the Modeller interface. Not opening these dialogs allows faster execution, so false is recommended when just the initial layout of structures is desired, not the sequence alignment or subsequent homology modeling (which requires the sequence alignment).

Further limiting the hits (structures to be used as templates)

limit  min-per-residue[,min-extension]
Limit the number of hits to a specified coverage depth. All hits are considered in order from highest to lowest %ID. A hit is kept if it covers at least min-extension consecutive residues (default 20) where each residue is covered by fewer than min-per-residue already accepted (higher-identity) hits. With this option, a more complete coverage of the target sequence with structural templates can usually be achieved than with the BLAST-provided winnow option, which has not been optimized for template search. Exclusions by limit can be overridden with keepPDB.
keepPDB  pdb1[,pdb2,...]
Retain hits with the specified PDB IDs regardless of any limit settings. This option is useful when a specific PDB structure is desired as a template for homology modeling.
skipPDB  pdb1[,pdb2,...]
Omit hits with the specified PDB IDs. This option is useful when a specific PDB structure should be excluded from the templates for homology modeling. Note that models can also be omitted from the templates using the Modeller interface.
excludeSelected true | false
Whether to omit models with any part currently selected from the hits in the next mda run.