API Reference

property is_backbone: ndarray#: Boolean array indicating backbone atoms.

property is_sidechain: ndarray#: Boolean array indicating sidechain atoms.

property is_aromatic: ndarray#: Boolean array indicating aromatic atoms.

property is_ligand: ndarray#: Boolean array indicating ligand atoms.

property residue_type: ndarray#: String array with residue type classification (PROTEIN/DNA/RNA/LIGAND).

__getitem__(index)[source]#

Get subset of structure by index.

Parameters:: index (Union[int, slice, ndarray]) – Integer, slice, or boolean/integer array for indexing
Return type:: Structure
Returns:: New Structure containing selected atoms

Example

>>> subset = structure[structure.element == "C"]
>>> single_atom = structure[0]
>>> chain_a = structure[structure.chain_id == "A"]

__len__()[source]#

Return number of atoms in structure.

Return type:: int

copy()[source]#

Create deep copy of structure.

Return type:: Structure
Returns:: New Structure with copied data

get_center(weights=None)[source]#

Calculate geometric or mass-weighted center of structure.

Parameters:: weights (Optional[ndarray], default: None) – Optional weights for each atom (e.g., atomic masses)
Return type:: ndarray
Returns:: 3D coordinate of center as numpy array

get_masses()[source]#

Get atomic masses for all atoms.

Return type:: ndarray
Returns:: Array of atomic masses in amu

translate(vector)[source]#

Translate structure by given vector.

Parameters:: vector (ndarray) – 3D translation vector
Return type:: None

center_at_origin(weights=None)[source]#

Center structure at origin.

Parameters:: weights (Optional[ndarray], default: None) – Optional weights for center calculation
Return type:: None

get_neighbors_within(atom_idx, radius)[source]#

Get atom indices within radius of specified atom.

Parameters:

atom_idx (int) – Index of query atom
radius (float) – Search radius in Angstroms

Return type:

Returns:

Array of neighbor atom indices (excluding query atom)

Example

>>> neighbors = structure.get_neighbors_within(100, 5.0)

get_atoms_within_sphere(center, radius)[source]#

Get atoms within spherical region.

Parameters:

center (ndarray) – Center point of sphere (x, y, z)
radius (float) – Radius of sphere in Angstroms

Return type:

Returns:

Array of atom indices within the sphere

Example

>>> center = np.array([10.0, 15.0, 20.0])
>>> atoms = structure.get_atoms_within_sphere(center, 8.0)

get_atoms_within_cog_sphere(selection, radius)[source]#

Get atoms within spherical zone centered at center of geometry of selection.

Parameters:

selection (ndarray) – Boolean mask or indices of atoms to define COG
radius (float) – Radius of spherical zone in Angstroms

Return type:

Returns:

Array of atom indices within the COG sphere

Example

>>> active_site = structure.select("resname HIS")
>>> nearby = structure.get_atoms_within_cog_sphere(active_site, 10.0)

get_neighbors_for_atoms(atom_indices, radius)[source]#

Get neighbors for multiple atoms at once (batch operation).

Parameters:

atom_indices (ndarray) – Array of atom indices to query
radius (float) – Search radius in Angstroms

Return type:

Dict[int, ndarray]

Returns:

Dictionary mapping atom_idx -> array of neighbor indices

Example

>>> ca_atoms = structure.select("name CA")
>>> neighbors = structure.get_neighbors_for_atoms(ca_atoms, 8.0)

get_closest_atoms(query_point, k=1)[source]#

Get k nearest atoms to a query point.

Parameters:

query_point (ndarray) – 3D coordinate to query
k (int, default: 1) – Number of nearest neighbors to return

Return type:

Tuple[ndarray, ndarray]

Returns:

Tuple of (distances, atom_indices) for k nearest atoms

Example

>>> center = np.array([0.0, 0.0, 0.0])
>>> distances, indices = structure.get_closest_atoms(center, k=5)

get_atoms_between_selections(selection1, selection2, max_distance)[source]#

Find atoms from two selections within max_distance of each other.

Parameters:

selection1 (ndarray) – First selection (boolean mask or indices)
selection2 (ndarray) – Second selection (boolean mask or indices)
max_distance (float) – Maximum distance between selections

Return type:

Dict[str, ndarray]

Returns:

Dictionary with ‘selection1_atoms’, ‘selection2_atoms’, ‘distances’

Example

>>> protein = structure.select("protein")
>>> ligand = structure.select("resname LIG")
>>> contacts = structure.get_atoms_between_selections(protein, ligand, 5.0)

has_spatial_index()[source]#

Check if spatial index is available.

Return type:: bool
Returns:: True if scipy is available and spatial indexing is possible

get_bonds_to(other_atoms, max_distance=2.0)[source]#

Find potential bonds to other atoms based on distance.

Parameters:

other_atoms (ndarray) – Indices of other atoms to check
max_distance (float, default: 2.0) – Maximum distance for bond consideration

Return type:

Returns:

Boolean array indicating which atoms have potential bonds

select(selection_string)[source]#

Select atoms using selection language.

Parameters:: selection_string (str) – Selection expression
Return type:: ndarray
Returns:: Boolean array of selected atoms

Examples

>>> mask = structure.select("protein and backbone")
>>> mask = structure.select("resname ALA GLY")
>>> mask = structure.select("chain A and resid 1:50")

Raises:: NotImplementedError – For unsupported selection syntax

get_annotation_info()[source]#

Get information about all annotations in the structure.

Return type:: Dict[str, Dict[str, Any]]
Returns:: Dictionary with annotation info including dtype and whether it’s initialized

__repr__()[source]#

String representation of Structure.

Return type:: str

__str__()[source]#

Detailed string representation.

Return type:: str

classmethod from_pdb(filename)[source]#

Create Structure from PDB file.

Parameters:: filename (str) – Path to PDB file
Return type:: Structure
Returns:: Structure object with all atoms and annotations
Raises:: ValueError – If PDB file contains multiple models

Example

>>> structure = Structure.from_pdb("example.pdb")
>>> print(f"Loaded {structure.n_atoms} atoms")

classmethod from_mmcif(filename)[source]#

Create Structure from mmCIF file.

Parameters:: filename (str) – Path to mmCIF file
Return type:: Structure
Returns:: Structure object with all atoms and annotations
Raises:: ValueError – If mmCIF file contains multiple models

Example

>>> structure = Structure.from_mmcif("example.cif")
>>> print(f"Loaded {structure.n_atoms} atoms")

classmethod from_pdb_string(pdb_content)[source]#

Create Structure from PDB content string.

Parameters:: pdb_content (str) – PDB file content as string
Return type:: Structure
Returns:: Structure object with all atoms and annotations
Raises:: ValueError – If PDB content contains multiple models

Example

>>> pdb_data = "ATOM      1  N   ALA A   1      20.154  16.967  22.478  1.00 10.00           N"
>>> structure = Structure.from_pdb_string(pdb_data)

classmethod from_mmcif_string(mmcif_content)[source]#

Create Structure from mmCIF content string.

Parameters:: mmcif_content (str) – mmCIF file content as string
Return type:: Structure
Returns:: Structure object with all atoms and annotations
Raises:: ValueError – If mmCIF content contains multiple models

Example

>>> mmcif_data = "data_test\nloop_\n_atom_site.group_PDB\n..."
>>> structure = Structure.from_mmcif_string(mmcif_data)

detect_bonds(vdw_factor=0.75, use_file_bonds=True, store_bonds=True)[source]#

Detect bonds using the simplified default detector.

Parameters:

vdw_factor (float, default: 0.75) – Factor for VdW radii in distance detection (0.0 < factor <= 1.0)
use_file_bonds (bool, default: True) – Whether to include file-based bonds (CONECT, mmCIF)
store_bonds (bool, default: True) – Whether to store detected bonds on structure

Return type:

molr.BondList

Returns:

BondList with detected bonds

Example

>>> structure = Structure.from_pdb("protein.pdb")
>>> bonds = structure.detect_bonds()
>>> print(f"Detected {len(bonds)} bonds")

property bonds: TypeAliasForwardRef('molr.BondList') | None#

Get bonds associated with this structure.

Returns:: BondList if bonds have been detected/assigned, None otherwise

has_bonds()[source]#

Check if structure has bond information.

Return type:: bool
Returns:: True if bonds are available

property file_bonds: TypeAliasForwardRef('molr.BondList') | None#

Get bonds loaded from structure files (PDB CONECT, mmCIF bonds).

Returns:: BondList with file-based bonds or None if not available

has_file_bonds()[source]#

Check if structure has file-based bond information.

Return type:: bool
Returns:: True if file bonds are available

The main class for representing molecular structures with spatial indexing capabilities.

Key Methods:

from_pdb() - Load from PDB file
from_mmcif() - Load from mmCIF file
select() - Atom selection using query language
detect_bonds() - Automatic bond detection
get_neighbors_within() - Spatial neighbor queries

StructureEnsemble#

class molr.StructureEnsemble(template, n_frames=0)[source]#

Bases: object

Ensemble of molecular structures representing trajectory data.

This class stores multiple frames of coordinate data while sharing annotations (atom names, elements, etc.) across all frames for memory efficiency. Designed for trajectory analysis and multi-model PDB files.

Memory layout:

coords: (n_frames, n_atoms, 3) array of coordinates
Annotations shared from template Structure
Optional time and box information per frame

Example

>>> ensemble = StructureEnsemble.from_structures([struct1, struct2])
>>> print(f"Trajectory with {ensemble.n_frames} frames")
>>> frame0 = ensemble[0]  # Returns Structure for frame 0

Parameters:

template (Structure)
n_frames (int, default: 0)

__init__(template, n_frames=0)[source]#

Initialize StructureEnsemble from template Structure.

Parameters:

template (Structure) – Template Structure with shared annotations
n_frames (int, default: 0) – Number of frames (default: 0 for dynamic growth)

classmethod from_pdb(filename)[source]#

Create StructureEnsemble from multi-model PDB file.

Parameters:: filename (str) – Path to multi-model PDB file
Return type:: StructureEnsemble
Returns:: StructureEnsemble with all models as frames
Raises:: ValueError – If PDB file contains only single model

classmethod from_pdb_string(pdb_content)[source]#

Create StructureEnsemble from multi-model PDB content string.

Parameters:: pdb_content (str) – PDB content string with multiple models
Return type:: StructureEnsemble
Returns:: StructureEnsemble with all models as frames
Raises:: ValueError – If PDB content contains only single model

classmethod from_mmcif(filename)[source]#

Create StructureEnsemble from multi-model mmCIF file.

Parameters:: filename (str) – Path to multi-model mmCIF file
Return type:: StructureEnsemble
Returns:: StructureEnsemble with all models as frames
Raises:: ValueError – If mmCIF file contains only single model

classmethod from_mmcif_string(mmcif_content)[source]#

Create StructureEnsemble from multi-model mmCIF content string.

Parameters:: mmcif_content (str) – mmCIF content string with multiple models
Return type:: StructureEnsemble
Returns:: StructureEnsemble with all models as frames
Raises:: ValueError – If mmCIF content contains only single model

classmethod from_structures(structures)[source]#

Create StructureEnsemble from list of Structure objects.

Parameters:: structures (List[Structure]) – List of Structure objects with same atoms
Return type:: StructureEnsemble
Returns:: StructureEnsemble with structures as frames
Raises:: ValueError – If structures have different atom counts

add_frame(structure, time=None)[source]#

Add a new frame to the ensemble.

Parameters:

structure (Structure) – Structure to add as new frame
time (Optional[float], default: None) – Optional time value for this frame

Raises:

ValueError – If structure atom count doesn’t match

Return type:

__getitem__(index)[source]#

Get frame(s) from ensemble.

Parameters:: index (Union[int, slice]) – Frame index or slice
Return type:: Union[Structure, StructureEnsemble]
Returns:: Structure for single frame, StructureEnsemble for slice

Examples

>>> frame0 = ensemble[0]  # Single frame as Structure
>>> sub_traj = ensemble[10:20]  # Sub-trajectory as StructureEnsemble

__len__()[source]#

Return number of frames.

Return type:: int

__iter__()[source]#

Iterate over frames as Structure objects.

Return type:: Any

get_frame_coords(frame_index)[source]#

Get coordinates for specific frame.

Parameters:: frame_index (int) – Index of frame
Return type:: ndarray
Returns:: Coordinate array (n_atoms, 3) for the frame

set_frame_coords(frame_index, coords)[source]#

Set coordinates for specific frame.

Parameters:

frame_index (int) – Index of frame
coords (ndarray) – Coordinate array (n_atoms, 3)

Return type:

center_frames(selection=None)[source]#

Center all frames at origin.

Parameters:: selection (Optional[str], default: None) – Optional selection for center calculation (default: all atoms)
Return type:: None

rmsd(reference_frame=0, selection=None)[source]#

Calculate RMSD of each frame relative to reference.

Parameters:

reference_frame (int, default: 0) – Index of reference frame
selection (Optional[str], default: None) – Optional atom selection for RMSD calculation

Return type:

Returns:

Array of RMSD values for each frame

__repr__()[source]#

String representation.

Return type:: str

__str__()[source]#

Detailed string representation.

Return type:: str

Multi-model trajectory representation for handling structural ensembles.

Key Methods:

from_pdb() - Load multi-model PDB
__getitem__() - Access individual models
__len__() - Number of models

BondList#

class molr.BondList(n_bonds=0)[source]#

Bases: object

Efficient storage and manipulation of molecular bonds with smart indexing.

The BondList class stores bonds as pairs of atom indices with additional metadata such as bond order, detection method, and confidence scores. It supports smart indexing that automatically adjusts bond indices when the parent structure is sliced or modified.

Bond storage uses Structure of Arrays (SoA) design:

bonds: (N, 2) array of atom index pairs
bond_order: Bond order (1=single, 2=double, 3=triple, 1.5=aromatic)
bond_type: Bond type classification
detection_method: How the bond was detected
confidence: Confidence score for bond existence

Smart indexing features:

Automatic bond index adjustment when structure is sliced
Efficient bond filtering based on atom selections
Bond validation against structure changes

Example

>>> bond_list = BondList()
>>> bond_list.add_bond(0, 1, bond_order=1.0, bond_type="covalent")
>>> bond_list.add_bonds([(2, 3), (3, 4)], bond_orders=[1.0, 2.0])
>>> subset_bonds = bond_list.filter_by_atoms([0, 1, 2])

Parameters:: n_bonds (int, default: 0)

__init__(n_bonds=0)[source]#

Initialize BondList.

Parameters:: n_bonds (int, default: 0) – Initial number of bonds (default: 0 for dynamic growth)

add_property(name, dtype=<class 'numpy.float32'>, default_value=None)[source]#

Add custom property to bonds.

Parameters:

name (str) – Name of the property
dtype (Any, default: <class 'numpy.float32'>) – NumPy data type for the property
default_value (Any, default: None) – Default value to fill existing bonds

Raises:

ValueError – If property name already exists

Return type:

add_bond(atom1, atom2, bond_order=1.0, bond_type='covalent', **kwargs)[source]#

Add a single bond.

Parameters:

atom1 (int) – Index of first atom
atom2 (int) – Index of second atom
bond_order (float, default: 1.0) – Bond order (1.0=single, 2.0=double, etc.)
bond_type (str, default: 'covalent') – Type of bond
**kwargs (Any) – Additional bond properties

Return type:

int

Returns:

Index of the added bond

Raises:

ValueError – If atoms are the same or invalid

add_bonds(bond_pairs, bond_orders=None, bond_types=None, **kwargs)[source]#

Add multiple bonds efficiently.

Parameters:

bond_pairs (List[Tuple[int, int]]) – List of (atom1, atom2) tuples
bond_orders (Optional[List[float]], default: None) – Optional list of bond orders (default: all 1.0)
bond_types (Optional[List[str]], default: None) – Optional list of bond types (default: all “covalent”)
**kwargs (Any) – Additional properties as lists

Return type:

Union[Structure, StructureEnsemble]

Returns:

Array of bond indices for added bonds

Raises:

ValueError – If list lengths don’t match

remove_bonds(bond_indices)[source]#

Remove bonds by index.

Parameters:: bond_indices (Union[int, List[int], ndarray[Any, Any]]) – Bond index or array of bond indices to remove
Return type:: None

get_bonds_for_atom(atom_index)[source]#

Get all bonds involving a specific atom.

Parameters:: atom_index (int) – Index of the atom
Return type:: ndarray
Returns:: Array of bond indices involving the atom

get_neighbors(atom_index)[source]#

Get neighbor atoms for a specific atom.

Parameters:: atom_index (int) – Index of the atom
Return type:: ndarray
Returns:: Array of neighbor atom indices

filter_by_atoms(atom_indices)[source]#

Create new BondList containing only bonds between specified atoms.

Parameters:: atom_indices (Union[List[int], ndarray]) – List or array of atom indices to keep
Return type:: molr.BondList
Returns:: New BondList with filtered bonds and remapped indices

get_bond_matrix(n_atoms)[source]#

Create bond adjacency matrix.

Parameters:: n_atoms (int) – Total number of atoms in structure
Return type:: ndarray
Returns:: (n_atoms, n_atoms) boolean adjacency matrix

validate_bonds(n_atoms)[source]#

Validate that all bonds reference valid atom indices.

Parameters:: n_atoms (int) – Number of atoms in the structure
Return type:: Tuple[bool, List[int]]
Returns:: Tuple of (all_valid, list_of_invalid_bond_indices)

__len__()[source]#

Return number of bonds.

Return type:: int

__getitem__(index)[source]#

Get bond(s) by index.

Parameters:: index (Union[int, slice, ndarray]) – Integer, slice, or array for indexing
Return type:: Union[Tuple[int, int], molr.BondList]
Returns:: Single bond tuple or new BondList with selected bonds

__repr__()[source]#

String representation of BondList.

Return type:: str

__str__()[source]#

Detailed string representation.

Return type:: str

Efficient storage and manipulation of molecular bonds.

Key Methods:

get_bond() - Get bond between atoms
get_neighbors() - Get bonded neighbors
to_connectivity_matrix() - Convert to adjacency matrix

Bond Detection#

DefaultBondDetector#

class molr.bond_detection.DefaultBondDetector(vdw_factor=0.75)[source]#

Bases: object

Simplified bond detector using templates and distance criteria.

This replaces the complex hierarchical system with a straightforward approach: 1. Apply residue templates (from residue_bonds.py or CCD) 2. Apply distance-based detection as fallback

Parameters:: vdw_factor (float, default: 0.75)

__init__(vdw_factor=0.75)[source]#

Initialize the default bond detector.

Parameters:: vdw_factor (float, default: 0.75) – Factor for Van der Waals radii in distance detection (0.0 < factor <= 1.0). Default 0.75 works well for most cases.

detect_bonds(structure, use_file_bonds=True)[source]#

Detect bonds in a molecular structure.

Parameters:

structure (Structure) – Structure to analyze
use_file_bonds (bool, default: True) – Whether to include file-based bonds (CONECT, etc.)

Return type:

BondList

Returns:

BondList containing all detected bonds

Default bond detector that combines residue templates and distance-based detection.

Bond Detection Functions#

molr.bond_detection.detect_bonds(structure, vdw_factor=0.75, use_file_bonds=True)[source]#

Convenience function to detect bonds in a structure.

Parameters:

structure (Structure) – Structure to analyze
vdw_factor (float, default: 0.75) – Factor for VdW radii in distance detection
use_file_bonds (bool, default: True) – Whether to include file-based bonds

Return type:

BondList

Returns:

BondList with detected bonds

Main function for bond detection in molecular structures.

I/O Parsers#

PDB Parser#

class molr.PDBParser[source]#

Bases: object

PDB file parser for the space module.

Designed specifically for the NumPy-based Structure class, this parser converts pdbreader output directly to NumPy arrays for optimal performance.

Features: - Direct conversion to NumPy arrays - CONECT record parsing for explicit bonds - Multi-model support for trajectories - Efficient memory usage - Full PDB annotation support

__init__()[source]#: Initialize the PDB parser.

parse_file(filename)[source]#

Parse a PDB file and return a Structure.

Parameters:

filename (str) – Path to the PDB file

Return type:

Returns:

Structure object with all atoms and annotations

Raises:

IOError – If file cannot be read
ValueError – If PDB format is invalid

parse_string(pdb_content)[source]#

Parse PDB content from a string.

Parameters:: pdb_content (str) – PDB file content as string
Return type:: Union[Structure, StructureEnsemble]
Returns:: Structure object with all atoms and annotations

Parser for PDB format files with support for:

Multi-model structures
CONECT record parsing
Alternate conformations
Insertion codes
Crystal information

mmCIF Parser#

class molr.mmCIFParser[source]#

Bases: object

mmCIF file parser for the space module.

Designed specifically for the NumPy-based Structure class, this parser converts mmcif output directly to NumPy arrays for optimal performance.

Features: - Direct conversion to NumPy arrays - Multi-model support for trajectories - Efficient memory usage - Full mmCIF annotation support - Chemical bond information from mmCIF data

__init__()[source]#: Initialize the mmCIF parser.

parse_file(filename)[source]#

Parse an mmCIF file and return a Structure or StructureEnsemble.

Parameters:

filename (str) – Path to the mmCIF file

Return type:

Union[Structure, StructureEnsemble]

Returns:

Structure object for single model, StructureEnsemble for multi-model

Raises:

IOError – If file cannot be read
ValueError – If mmCIF format is invalid

parse_string(mmcif_content)[source]#

Parse mmCIF content from a string.

Parameters:: mmcif_content (str) – mmCIF file content as string
Return type:: Union[Structure, StructureEnsemble]
Returns:: Structure object with all atoms and annotations

Parser for mmCIF format files with support for:

Chemical bond information
Large structure handling
Complete metadata extraction

Selection System#

Selection Engine#

class molr.selection.SelectionEngine(cache_size=100)[source]#

Bases: object

Engine for evaluating atom selections on structures.

Provides caching and optimization for repeated selections.

Parameters:: cache_size (int, default: 100)

__init__(cache_size=100)[source]#

Initialize selection engine.

Parameters:: cache_size (int, default: 100) – Maximum number of cached selections

select(structure, selection)[source]#

Select atoms from a structure.

Parameters:

structure (Structure) – The structure to select from
selection (Union[str, SelectionExpression]) – Selection string or expression

Return type:

Returns:

Boolean array indicating selected atoms

Raises:

ParseException – If selection string is invalid

select_atoms(structure, selection)[source]#

Return a new Structure containing only selected atoms.

Parameters:

structure (Structure) – The structure to select from
selection (Union[str, SelectionExpression]) – Selection string or expression

Return type:

Structure

Returns:

New Structure with selected atoms

count(structure, selection)[source]#

Count atoms matching selection.

Parameters:

structure (Structure) – The structure to select from
selection (Union[str, SelectionExpression]) – Selection string or expression

Return type:

int

Returns:

Number of selected atoms

get_indices(structure, selection)[source]#

Get indices of atoms matching selection.

Parameters:

structure (Structure) – The structure to select from
selection (Union[str, SelectionExpression]) – Selection string or expression

Return type:

Returns:

Array of atom indices

clear_cache()[source]#

Clear the selection cache.

Return type:: None

Main engine for parsing and evaluating selection expressions.

Selection Functions#

molr.selection.select(structure, selection)[source]#

Select atoms from a structure.

Parameters:

structure (Structure) – The structure to select from
selection (Union[str, SelectionExpression]) – Selection string or expression

Return type:

Returns:

Boolean array indicating selected atoms

Main selection function for atom queries.

molr.selection.select_atoms(structure, selection)[source]#

Return a new Structure containing only selected atoms.

Parameters:

structure (Structure) – The structure to select from
selection (Union[str, SelectionExpression]) – Selection string or expression

Return type:

Structure

Returns:

New Structure with selected atoms

Alternative selection function.

Selection Parser#

class molr.selection.SelectionParser[source]#

Bases: object

Parser for atom selection language.

Supports syntax like:

“protein and backbone”
“resname ALA GLY”
“chain A and resid 1:100”
“element C N O”
“not water”
“(protein and chain A) or ligand”
“byres (ligand and within 5 of protein)”

__init__()[source]#: Initialize the parser with grammar rules.

parse(selection_string)[source]#

Parse a selection string into a SelectionExpression.

Parameters:: selection_string (str) – The selection string to parse
Return type:: SelectionExpression
Returns:: SelectionExpression object
Raises:: ParseException – If the string cannot be parsed

classmethod parse_selection(selection_string)[source]#

Convenience class method to parse a selection string.

Parameters:: selection_string (str) – The selection string to parse
Return type:: SelectionExpression
Returns:: SelectionExpression object

pyparsing-based parser for selection language syntax.

Supported Expressions:

Atom properties: name, element, resname, chain
Spatial queries: within, around, cog
Boolean operations: and, or, not
Residue modifiers: byres
Predefined groups: protein, backbone, sidechain

Expression Classes#

Base Expression#

class molr.selection.SelectionExpression[source]#

Bases: ABC

Abstract base class for all selection expressions.

Selection expressions form a tree structure that can be evaluated against a Structure to produce a boolean mask indicating which atoms are selected.

abstractmethod evaluate(structure)[source]#

Evaluate the expression against a structure.

Parameters:: structure (Structure) – The molecular structure to evaluate against
Return type:: ndarray
Returns:: Boolean array with True for selected atoms

__and__(other)[source]#

Create AND expression using & operator.

Parameters:: other (SelectionExpression)
Return type:: SelectionExpression

__or__(other)[source]#

Create OR expression using | operator.

Parameters:: other (SelectionExpression)
Return type:: SelectionExpression

__invert__()[source]#

Create NOT expression using ~ operator.

Return type:: SelectionExpression

abstractmethod __repr__()[source]#

String representation of the expression.

Return type:: str

Atom Property Expressions#

class molr.selection.ElementExpression(elements)[source]#

Select atoms by element type.

Parameters:: elements (Union[str, List[str]])

__init__(elements)[source]#

Initialize element selection.

Parameters:: elements (Union[str, List[str]]) – Element symbol(s) to select

evaluate(structure)[source]#

Select atoms matching the specified elements.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.AtomNameExpression(names)[source]#

Select atoms by atom name.

Parameters:: names (Union[str, List[str]])

__init__(names)[source]#

Initialize atom name selection.

Parameters:: names (Union[str, List[str]]) – Atom name(s) to select

evaluate(structure)[source]#

Select atoms matching the specified names.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.ResidueNameExpression(resnames)[source]#

Select atoms by residue name.

Parameters:: resnames (Union[str, List[str]])

__init__(resnames)[source]#

Initialize residue name selection.

Parameters:: resnames (Union[str, List[str]]) – Residue name(s) to select

evaluate(structure)[source]#

Select atoms in residues matching the specified names.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.ResidueIdExpression(resids)[source]#

Select atoms by residue ID.

Parameters:: resids (Union[int, List[int], range])

__init__(resids)[source]#

Initialize residue ID selection.

Parameters:: resids (Union[int, List[int], range]) – Residue ID(s) to select

evaluate(structure)[source]#

Select atoms in residues matching the specified IDs.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.ChainExpression(chains)[source]#

Select atoms by chain ID.

Parameters:: chains (Union[str, List[str]])

__init__(chains)[source]#

Initialize chain selection.

Parameters:: chains (Union[str, List[str]]) – Chain ID(s) to select

evaluate(structure)[source]#

Select atoms in the specified chains.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.IndexExpression(indices)[source]#

Select atoms by index.

Parameters:: indices (Union[int, List[int], range, slice])

__init__(indices)[source]#

Initialize index selection.

Parameters:: indices (Union[int, List[int], range, slice]) – Atom indices to select

evaluate(structure)[source]#

Select atoms at the specified indices.

Parameters:: structure (Structure)
Return type:: ndarray

Structural Expressions#

class molr.selection.BackboneExpression[source]#

Select backbone atoms.

evaluate(structure)[source]#

Select atoms that are part of the backbone.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.SidechainExpression[source]#

Select sidechain atoms.

evaluate(structure)[source]#

Select atoms that are part of sidechains.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.ProteinExpression[source]#

Select protein atoms.

evaluate(structure)[source]#

Select atoms that are part of protein residues.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.NucleicExpression[source]#

Select nucleic acid atoms.

evaluate(structure)[source]#

Select atoms that are part of DNA or RNA.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.DNAExpression[source]#

Select DNA atoms.

evaluate(structure)[source]#

Select atoms that are part of DNA.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.RNAExpression[source]#

Select RNA atoms.

evaluate(structure)[source]#

Select atoms that are part of RNA.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.LigandExpression[source]#

Select ligand atoms.

evaluate(structure)[source]#

Select atoms that are part of ligands.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.AromaticExpression[source]#

Select aromatic atoms.

evaluate(structure)[source]#

Select atoms that are part of aromatic systems.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.WaterExpression[source]#

Select water molecules.

evaluate(structure)[source]#

Select atoms that are part of water molecules.

Parameters:: structure (Structure)
Return type:: ndarray

Boolean Expressions#

class molr.selection.AndExpression(left, right)[source]#

Logical AND of two expressions.

Parameters:

left (SelectionExpression)
right (SelectionExpression)

__init__(left, right)[source]#

Initialize AND expression.

Parameters:

left (SelectionExpression) – Left operand
right (SelectionExpression) – Right operand

evaluate(structure)[source]#

Return atoms selected by both expressions.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.OrExpression(left, right)[source]#

Logical OR of two expressions.

Parameters:

left (SelectionExpression)
right (SelectionExpression)

__init__(left, right)[source]#

Initialize OR expression.

Parameters:

left (SelectionExpression) – Left operand
right (SelectionExpression) – Right operand

evaluate(structure)[source]#

Return atoms selected by either expression.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.NotExpression(operand)[source]#

Logical NOT of an expression.

Parameters:: operand (SelectionExpression)

__init__(operand)[source]#

Initialize NOT expression.

Parameters:: operand (SelectionExpression) – Expression to negate

evaluate(structure)[source]#

Return atoms not selected by the expression.

Parameters:: structure (Structure)
Return type:: ndarray

Special Expressions#

class molr.selection.AllExpression[source]#

Select all atoms.

evaluate(structure)[source]#

Return True for all atoms.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.NoneExpression[source]#

Select no atoms.

evaluate(structure)[source]#

Return False for all atoms.

Parameters:: structure (Structure)
Return type:: ndarray

class molr.selection.ByResidueExpression(atom_selection)[source]#