DAQplugin

DAQplugin is a collection of tools for computing, visualizing, and exporting DAQ scores for protein atomic models in cryo-EM maps.

This repository provides:

Google Colab ready Jupyter notebooks for DAQ score computation and NPY file generation DAQ_Score_Grid.ipynb
A ChimeraX plugin (daqcolor) for interactive coloring and visualization
Command-line utilities for processing and file export

DAQ is included as a Git submodule to ensure consistency with published methods.

If you use DAQplugin, please cite the following paper:

Terashi, G., Wang, X., Maddhuri Venkata Subramaniya, S. R., Tesmer, J. J., & Kihara, D. (2022). Residue-wise local quality estimation for protein models from cryo-EM maps. Nature methods, 19(9), 1116-1125. Link

DAQ score monitoring (7jsn version 1.1 and EMD-22458)

Repository Structure

DAQplugin/
├── DAQ/                  # DAQ core (git submodule)
├── daqcolor/             # ChimeraX plugin
│   ├── src/
│   ├── bundle_info.xml
│   ├── cmd.py            # commands for ChimeraX
│   ├── compute.py        # DAQ for ChimeraX
│   ├── onnx_model.py     # DL for ChimeraX
│   └── 00README.txt
├── cli/                  # Command-line scripts
├── map_util/             # Map preprocessing utilities
├── DAQ_Score.ipynb       # Original DAQ score calculation notebook
├── DAQ_Score_Grid.ipynb  # Grid / NPY generation notebook
├── DAQ_Score_Pdb.ipynb   # PDB coordinate based DAQ score calculation notebook
├── README.md
└── LICENSE

Installation on ChimeraX Toolshed

In the Menu bar: Tools > More Tools > DAQplugin page > Click [Download]

Start GUI

Tools > Validation > DAQplugin

Use GUI

DAQplugin GUI supports both grid-based DAQ computation from cryo-EM maps and structure-based DAQ computation (original DAQ style), as well as real-time coloring and monitoring.

1. Inputs

Structure: Select a loaded atomic model (PDB/mmCIF) from the ChimeraX session. Example: #2 7jsn
Map: Select a loaded cryo-EM density map. Example: #1 emd_22458.mrc
Output/Overwrite NPT : Specify a path to save computed DAQ scores as an .npy file
Load Existing NPY : Load an existing .npy file for coloring and monitoring only.

2. Compute Options

batch_size: Controls how many grid points are processed per batch during grid-based DAQ computation. Larger values → faster computation but higher memory usage Default: 512

3. Grid-based DAQ Score Computation

This mode computes DAQ scores by scanning the EM map on a grid.

Parameters
- contour: Density threshold used to select grid points. Only grid points with density ≥ contour are used for the normalization process. This value should typically match the contour level used for map visualization in ChimeraX.
- stride: Grid sampling interval (in voxels). 1 = dense sampling (slowest, most accurate). 2 or higher = faster computation with reduced sampling. Recommended: 2
- max_points: Maximum number of grid points to evaluate. Useful for very large maps to limit memory and runtime.
Run
- Click [Run Grid-based DAQ score computation] to:
  
  Scan the map above the specified contour level
  
  Compute DAQ scores
  
  Save results to the specified .npy file

4. Coloring / Monitoring with Existing NPY Scores

This section is used without recomputing DAQ scores, relying instead on an existing .npy file.

npy_path: Automatically taken from Output/Load Existing NPY if specified above.
metric: Select which DAQ metric to visualize:

aa_score – DAQ(AA), amino-acid-wise score

atom_score – DAQ(Cα), Cα likelihood score
k: Neighborhood size for smoothing (number of neighboring residues). recommended: 1
half_window: Half-size of the sliding window used for sequence-based averaging. Example: 9 → window size = 19 residues
clamp_min / clamp_max: Clamp score values for coloring. Typical range: -1.0 to 1.0
interval (sec): Update interval for monitoring mode.
Apply coloring : Colors the selected structure based on the chosen DAQ metric.
Start monitor : Continuously updates DAQ score coloring.
Stop monitor: Stops the monitoring.

5. Structure-based DAQ Score Computation (Original DAQ)

This mode computes DAQ scores using the original structure-based DAQ protocol, without grid-based scanning. Uses heavy atom positions directly from the atomic model, and then normalize the DAQ score. This mode is suitable for direct comparison with previously published DAQ results. This mode can not be used for monitoring.

Click [Run Structure-based DAQ score computation] to execute.

6. Typical Workflows

A. Full DAQ computation and visualization

Load model and map into ChimeraX
Set contour
Run Grid-based DAQ score computation
Apply coloring using aa_score

B. Coloring only (use precomputed scores npy file)

Load model and map
Specify an existing .npy file
Click Apply coloring or Start monitoring

C. DAQ score Monitoring with ISOLDE

Start an external refinement pipeline (ISOLDE).
Run Grid-based DAQ score computation. or Load the .npy path in DAQplugin
Click Start monitor
Observe DAQ score changes during the refinement process.
Click Stop monitor

Installation from GitHub

Clone the Repository (IMPORTANT)

This repository uses Git submodules.

Clone with submodules enabled:

git clone --recurse-submodules https://github.com/gterashi/DAQplugin.git

If you already cloned without submodules:

git submodule update --init --recursive

1. DAQ Score and NPY file Computation (Jupyter Notebook on Google Colab)

Notebook

DAQ_Score_Grid.ipynb

Purpose

This notebook computes:

DAQ scores from atomic models (PDB/CIF) and cryo-EM maps (MRC/MAP)
Numpy files (.npy) containing per-point probability and score information

The generated .npy files are used by the ChimeraX plugin (daqcolor) for visualization.

Typical Workflow

Provide:
- Atomic model (.pdb or .cif)
- Cryo-EM map (.mrc or .map)
Run the notebook cells sequentially
Output:
- points_AA_ATOM_SS_swap.npy
- Optional: PDB file with DAQ score

Notebook

DAQ_Score_Pdb.ipynb

Purpose

This notebook computes:

DAQ scores from atomic models (PDB/CIF) and cryo-EM maps (MRC/MAP)
DAQ scores per residue are recorded in the atomic models.

Typical Workflow

Provide:
- Atomic model (.pdb or .cif)
- Cryo-EM map (.mrc or .map)
Run the notebook cells sequentially
Output:
- PDB file with DAQ score

2. ChimeraX Plugin: `daqcolor`

The daqcolor plugin enables interactive coloring and visualization of DAQ scores in ChimeraX.

Installation (Developer Mode)

From the ChimeraX command line:

# Uninstall (if already installed)
devel clean [DAQplugin PATH]/daqcolor

# Install
devel install [DAQplugin PATH]/daqcolor

Note
The devel command requires ChimeraX developer tools.

Help

help daqcolor

Commands

Apply DAQ coloring once

daqcolor apply npyPath model [k N] [half_window N] [colormap] [metric] [atom_name CA] [clamp_min clampMin] [clamp_max clampMax]

npyPath : Path to the numpy file computed by NoteBook (positional argument).
model : ChimeraX model ID (e.g., #1)
k : Number of nearest neighbors for kNN (default: 1)
half_window : Window averaging half-width (n±half_window, default: 9)
colormap : Optional colormap for visualization
metric :
- aa_score — DAQ(AA) score
- atom_score — DAQ(CA) score
- aa_conf:<AA> — DAQ confidence for a specific amino-acid type
atom_name : Atom name (default: CA)
clamp_min, clamp_max : Optional score clamping

Examples

# Color model #2 by amino-acid DAQ score
daqcolor apply ./points_AA_ATOM_SS_swap.npy #2 metric aa_score 

# Color by atom (CA) DAQ score
daqcolor apply ./points_AA_ATOM_SS_swap.npy #1 metric atom_score

Live Monitoring

daqcolor monitor command shows DAQ score based on the current Atom coordinates.

daqcolor monitor model [npy_path npyPath] [k N] [half_window N] [colormap] [metric] [atom_name CA] [on true|false] [interval N]

Parameters:

model : ChimeraX model ID (e.g., #1) - required
npy_path : Path to the numpy file - required when turning monitor on, not needed when turning off
k : Number of nearest neighbors for kNN (default: 1)
half_window : Window averaging half-width (default: 9)
colormap : Optional colormap for visualization
metric : Scoring metric (aa_score, atom_score, or aa_conf:<AA>)
atom_name : Atom name (default: CA)
on : Enable (true) or disable (false) monitoring (default: true)
interval : Update frequency in seconds (default: 0.5)

Examples:

# Start monitoring (npy_path required)
daqcolor monitor #2 npy_path ./points_AA_ATOM_SS_swap.npy metric aa_score

# Start monitoring with custom update interval
daqcolor monitor #2 npy_path ./points_AA_ATOM_SS_swap.npy metric aa_score interval 1.0

# Stop monitoring (simpler - no npy_path needed)
daqcolor monitor #2 on false

Notes:

If you run daqcolor monitor on the same model multiple times, it will automatically replace the previous monitor
To stop monitoring, use on false without specifying the npy_path
The interval parameter controls how frequently the coloring is updated (throttling)

Example: EMD-22456 and mis-aligned model

Mis-aligned model
Red indicates negative DAQ scores.
Green regions are located outside the contour level.
Aligned model using the FitMap command in ChimeraX Blue indicates positive DAQ scores.
PDB 7JSN (version 1)
DAQ detects modeling errors in this version 1.1 deposited model.
RCSB PDB entry

Visualize point clouds

daqcolor points npyPath [radius] [metric] [colormap] [clamp_min clampMin] [clamp_max clampMax]

Parameters:

npyPath : Path to the numpy file (positional argument)
radius : Marker radius (default: 0.4)
metric : Optional metric for coloring:
- aa_conf — Maximum confidence across all amino acids
- aa_top:<AA> — Confidence for a specific amino acid (e.g., aa_top:ALA)
colormap : Optional colormap for visualization
clamp_min, clamp_max : Optional score clamping

Examples:

# Show points without coloring
daqcolor points ./points_AA_ATOM_SS_swap.npy radius 0.6

# Show points colored by maximum confidence
daqcolor points ./points_AA_ATOM_SS_swap.npy radius 0.6 metric aa_conf

# Show points colored by specific amino acid confidence
daqcolor points ./points_AA_ATOM_SS_swap.npy radius 0.6 metric aa_top:ALA

Clear markers:

daqcolor clear

DAQ Score Computation (ChimeraX)

The daqscore commands allow you to compute DAQ scores directly within ChimeraX using ONNX Runtime inference.

Recommendation
For large maps or if you have a weak CPU, consider using the Google Colab notebook instead, which provides free GPU acceleration and can handle larger datasets more efficiently.

Compute DAQ scores from a map (grid-based)

Note
For large maps or if you have a weak CPU, consider using the Colab version instead, which provides better performance with GPU acceleration.

daqscore compute_grid mapInput contour [output npyPath] [stride N] [batch_size N] [max_points N] [ckpt ckptPath] [structure #model] [monitor true|false] [metric] [k N] [colormap] [half_window N]

Parameters:

mapInput: Path to MRC/MAP file OR ChimeraX Volume model (e.g., #1) - required (positional)
contour: Contour threshold value - required (positional)
output: Path to save output NPY file (auto-generated if not specified)
stride: Stride for point sampling (default: 2, higher=faster but less dense)
batch_size: Batch size for inference (default: 512)
max_points: Maximum number of points to sample (default: 500000)
ckpt: Optional path to ONNX checkpoint/model file (uses bundled model if not specified)
structure: Optional structure model to apply coloring after computation
monitor: If true and structure is specified, start live monitoring (default: false)
metric: Coloring metric (aa_score, atom_score, or aa_conf:<AA>, default: aa_score)
k: Number of nearest neighbors for kNN (default: 1)
colormap: Optional colormap for visualization
half_window: Half window size for score smoothing (default: 9)

Examples:

# Compute from a file path
daqscore compute_grid /path/to/map.mrc 0.5 output /path/to/output.npy

# Compute from loaded volume (contour value required)
daqscore compute_grid #1 0.5

# Compute and apply coloring to structure
daqscore compute_grid #1 0.5 structure #2 metric aa_score

# Compute, apply coloring, and start monitoring
daqscore compute_grid #1 0.5 structure #2 monitor true metric aa_score half_window 9

# Stop monitoring of #1
daqcolor monitor #1 on false

Compute DAQ scores from a map (PDB-based)

This command computes DAQ scores using heavy atom positions from a PDB structure as query points, instead of grid points from the map. Reference distributions are computed from atoms with density >= 0.

daqscore compute_pdb mapInput structure #model [output npyPath] [batch_size N] [ckpt ckptPath] [metric] [k N] [colormap] [half_window N] [apply_color true|false] [save_model modelPath]

Parameters:

mapInput: Path to MRC/MAP file OR ChimeraX Volume model (e.g., #1) - required (positional)
structure: Structure model whose heavy atom coordinates will be used - required (keyword)
output: Path to save output NPY file (auto-generated if not specified)
batch_size: Batch size for inference (default: 512)
ckpt: Optional path to ONNX checkpoint/model file (uses bundled model if not specified)
metric: Coloring metric (aa_score, atom_score, or aa_conf:<AA>, default: aa_score)
k: Number of nearest neighbors for kNN (default: 1)
colormap: Optional colormap for visualization
half_window: Half window size for score smoothing (default: 9)
apply_color: If true, apply coloring to structure after computation (default: true)
save_model: Optional path to save the scored structure model (PDB or CIF format). Scores are written to B-factor field.

Examples:

# Compute scores at heavy atom positions and apply coloring
daqscore compute_pdb #1 structure #2 metric aa_score

# Compute without applying color
daqscore compute_pdb #1 structure #2 apply_color false

# Compute and save scored model
daqscore compute_pdb #1 structure #2 metric aa_score save_model scored_model.pdb

# With custom parameters
daqscore compute_pdb #1 structure #2 metric atom_score k 1 half_window 9 save_model output.cif

Saving Colored Models

Once colored, models can be exported using ChimeraX:

Save #1 as colored.pdb

save colored.pdb #1

DAQ scores are written to the B-factor field
Window-averaged scores (defined by halfwindow k) are preserved
Both PDB and CIF formats are supported

3. Command-Line Usage (CLI)

DAQ Score Export to B-factor (CLI)

The script daq_write_bfactor.py writes DAQ-style scores into the B-factor field of a protein structure file (PDB or mmCIF), using the same scoring logic as the ChimeraX daqcolor plugin.

Requirements

Python 3.8+
NumPy
SciPy (optional, for fast kNN; NumPy fallback is used if unavailable)
gemmi (required for PDB/mmCIF I/O)

Install dependencies:

pip install numpy scipy gemmi

Basic Usage

python daq_write_bfactor.py \
    -i model.cif \
    -p points_AA_ATOM_SS_swap.npy \
    -m aa_score \
    -o model.daq.b.cif

This command:

Computes DAQ scores per residue
Writes the scores to the B-factor field
Preserves the input file format (PDB or mmCIF)

Command-Line Options

-i, --input        Input structure file (.pdb/.cif/.mmcif) [required]
-o, --output       Output structure file (.pdb/.cif/.mmcif) [required]
-p, --points       Points file (N×32 numpy file) [required]

-m, --metric       Scoring metric:
                     aa_score        DAQ(AA) score (per-residue)
                     atom_score      DAQ(CA) score
                     aa_conf:ALA     Confidence for a specific AA type

--atom-name        Atom name used to define residue coordinates (default: CA)
-k                 Number of nearest neighbors for kNN (default: 1)
--radius           Distance cutoff for kNN in Å (default: 3.0; <=0 disables)
--half-window      Window averaging half-width (n±half_window, default: 9)
--no-window        Disable window averaging
--nan-fill         Value written when score is NaN/inf (default: 0.0)

Scoring Metrics

aa_score DAQ score for the native residue type
atom_score DAQ score based on CA atom probability
aa_conf:XXX DAQ confidence for a specific amino acid (e.g. aa_conf:ALA)

Window Averaging

By default, scores are smoothed using chain-aware window averaging:

Residues within

residue_number ± half_window (default: ±9 residues) are averaged
Only residues in the same chain are considered
Non-finite values are ignored

Disable window averaging:

--no-window

Notes

DAQ is included as a submodule to ensure consistency with published methods.
The ChimeraX plugin is intended for visualization and inspection.
Numerical analysis should be performed via notebooks or CLI tools.
This repository is under active development.

License

See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 91 Commits
DAQ @ bd55316		DAQ @ bd55316
cli		cli
daqcolor		daqcolor
img		img
map_util		map_util
.gitignore		.gitignore
.gitmodules		.gitmodules
DAQ_Score.ipynb		DAQ_Score.ipynb
DAQ_Score_Grid.ipynb		DAQ_Score_Grid.ipynb
DAQ_Score_Pdb.ipynb		DAQ_Score_Pdb.ipynb
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

DAQplugin

DAQ score monitoring (7jsn version 1.1 and EMD-22458)

Repository Structure

Installation on ChimeraX Toolshed

Start GUI

Use GUI

1. Inputs

2. Compute Options

3. Grid-based DAQ Score Computation

4. Coloring / Monitoring with Existing NPY Scores

5. Structure-based DAQ Score Computation (Original DAQ)

6. Typical Workflows

A. Full DAQ computation and visualization

B. Coloring only (use precomputed scores npy file)

C. DAQ score Monitoring with ISOLDE

Installation from GitHub

Clone the Repository (IMPORTANT)

1. DAQ Score and NPY file Computation (Jupyter Notebook on Google Colab)

Notebook

Purpose

Typical Workflow

Notebook

Purpose

Typical Workflow

2. ChimeraX Plugin: daqcolor

Installation (Developer Mode)

Help

Commands

Apply DAQ coloring once

Live Monitoring

Example: EMD-22456 and mis-aligned model

Visualize point clouds

Clear markers:

DAQ Score Computation (ChimeraX)

Compute DAQ scores from a map (grid-based)

Compute DAQ scores from a map (PDB-based)

Saving Colored Models

3. Command-Line Usage (CLI)

DAQ Score Export to B-factor (CLI)

Requirements

Install dependencies:

Basic Usage

Command-Line Options

Scoring Metrics

Window Averaging

Disable window averaging:

Notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

2. ChimeraX Plugin: `daqcolor`

Packages