fMRI Preprocessing and Quality Assessment¶

You should land on this page after having collected your (f)MRI data and converted it to BIDS.

Preprocessing & Quality Assessment Overview¶

In this page, you will learn how to preprocess fMRI data using fMRIPrep and perform quality assessment with MRIQC. We will cover:

Running fMRIPrep
Step-by-step guide to run fMRIPrep, including the required command structure, key options, and output directory organization.
Performing Quality Control with MRIQC
Use MRIQC to assess the quality of your MRI data. Identify potential artifacts and ensure data suitability for further analysis.
Interpreting fMRIPrep Outputs
Understand the content of the fMRIPrep HTML report, including motion parameters, anatomical alignment, and other key quality checks.
Reviewing MRIQC Reports
Learn how to interpret MRIQC's visual reports and quality metrics, such as SNR and temporal SNR, to evaluate the data's integrity.
Troubleshooting Common Issues
Find solutions to common challenges with fMRIPrep and MRIQC, including memory management and output interpretation.
Next Steps: GLM Analysis
Once your data is preprocessed and quality-checked, move on to first-level analysis with the General Linear Model.

fMRIPrep Documentation
Get detailed insights into the preprocessing steps, output formats, and recommended practices.
MRIQC Documentation
Explore MRIQC's metrics and recommendations for improving MRI data quality.
NeuroStars Community
A valuable resource for troubleshooting and community discussions related to fMRIPrep and MRIQC.
YouTube: Reviewing fMRIPrep Outputs

Tip

Before proceeding, ensure that your fMRI data is converted into BIDS format. Refer to the BIDS Conversion Guide for more details.

Preprocessing with fMRIPrep¶

1. Setting Up fMRIPrep¶

To use fMRIPrep, ensure that you have:

Docker (or Singularity for HPC environments).
Installed the fmriprep-docker wrapper for easier command-line usage:

pip install fmriprep-docker

A valid FreeSurfer license (license.txt) saved in a path accessible by fMRIPrep. This is needed for surface-based preprocessing.

System Requirements

fMRIPrep is resource-intensive. For optimal performance, allocate:

At least 16 GB RAM and 4 CPUs.
A high-speed SSD for the working directory to improve I/O performance.

For detailed instructions, visit the fMRIPrep Installation Guide.

2. Running fMRIPrep¶

Once your environment is ready, you can run fMRIPrep using the following command:

fmriprep-docker /path/to/BIDS /path/to/derivatives/fmriprep participant \
    --work-dir /path/to/temp_fmriprep \
    --fs-license-file /path/to/.license \
    --output-spaces MNI152NLin2009cAsym:res-2 anat fsnative \
    --participant-label <SUBJECT_ID> \
    --n-cpus 8 --mem-mb 16000 --notrack

Replace:

/path/to/BIDS with the path to your BIDS directory.
/path/to/derivatives/fmriprep with where you want to store fMRIPrep outputs.
<SUBJECT_ID> with the ID of the subject being processed.

Why specify output spaces?

--output-spaces defines the spaces in which your data will be resampled. Common options include:

MNI152NLin2009cAsym: Standard volumetric template.
anat: Subject’s native T1w space.
fsnative: FreeSurfer's subject-specific surface space.

3. Output Structure and Files¶

After running fMRIPrep, the output will be in the derivatives/fmriprep folder. This includes:

Preprocessed anatomical images (T1w, T2w).
Preprocessed functional images (BOLD series).
Confounds: .tsv files containing motion parameters and other potential noise regressors.
Reports: sub-xx.html files with a summary of the preprocessing.

Refer to the fMRIPrep Output Documentation for more information.

Quality Assessment with MRIQC¶

1. Running MRIQC¶

MRIQC helps identify potential issues in your data by generating quality metrics. Run MRIQC using Docker with the following command:

docker run -it --rm \
    -v /path/to/BIDS:/data:ro \
    -v /path/to/derivatives/mriqc:/out \
    nipreps/mriqc:latest /data /out participant \
    --participant-label <SUBJECT_ID> --nprocs 8 --mem-gb 16 --verbose-reports

This command will analyze individual subjects and save the results in the specified output directory. Replace the paths as appropriate.

Running Group-Level Analysis

After processing individual subjects, you can run a group-level analysis to compare metrics across subjects:

docker run -it --rm \
    -v /path/to/BIDS:/data:ro \
    -v /path/to/derivatives/mriqc:/out \
    nipreps/mriqc:latest /data /out group \
    --nprocs 8 --mem-gb 16 --verbose-reports

2. Understanding MRIQC Outputs¶

MRIQC generates:

Visual reports (sub-xx.html) for each subject.
CSV files with quality metrics.
Group-level metrics for overall dataset quality.

Refer to the MRIQC Documentation for a detailed explanation of each metric.

Interpreting fMRIPrep and MRIQC Reports¶

fMRIPrep HTML Report¶

After running fMRIPrep, the outputs will be stored in the derivatives/fmriprep directory, with each subject's data organized into subfolders like sub-01. These folders contain both the preprocessed functional and anatomical data, alongside JSON files with metadata.

Each subject’s report (sub-xx.html) includes:

Registration Plots: Check the alignment of functional and anatomical images.
Field Map Corrections: Review the effect of susceptibility distortion corrections.
Motion Correction: Look for high motion frames using Framewise Displacement (FD) plots.

What is Framewise Displacement (FD)?

FD is a measure of head movement between frames. High FD values indicate potential motion artifacts.

Let’s walk through the key components of the output and how to interpret the HTML summary reports.

1. Output Directory Structure¶

Within each subject's directory (sub-01):

anat/ folder: Contains anatomical images, including normalized versions (e.g., MNI152 template) and images in native space.
func/ folder: Contains functional data for each run, including:
Confound Regressors (.tsv): Time series of noise estimates like white matter and cerebrospinal fluid (CSF).
Preprocessed Functional Images: Aligned to templates like MNI152.
Brain Masks: Estimated masks for the brain, used in further analyses.

These files will be referenced in the HTML summary report, which provides an overview of the preprocessing steps and quality metrics.

2. Opening the HTML Summary Report¶

To view the HTML report, navigate to derivatives/fmriprep/sub-01/ and open sub-01.html by double-clicking it or using the terminal:

The report contains the following sections: Summary, Anatomical, Functional, About, Methods, and Errors. Use the tabs at the top of the report to navigate these sections.

3. Understanding the Summary Section¶

The Summary tab includes:

Number of Structural and Functional Images: Lists the number of anatomical and functional images processed.
Normalization Template: Shows the template used for alignment (e.g., MNI152NLin2009cAsym).
FreeSurfer: Indicates whether surface-based preprocessing was performed.

Make sure these details match the parameters specified in your fMRIPrep command.

4. Anatomical Quality Checks¶

The Anatomical section provides:

Brain Mask Overlay: Displays the brain mask (red outline), gray matter (magenta), and white matter boundaries (blue) overlaid on the anatomical image in sagittal, axial, and coronal views.

Trigger box

Normalization Check: A GIF compares the subject’s anatomical image with the MNI template. Ensure that:
The outlines of the brain and internal structures (e.g., ventricles) align well.
Any misalignment could indicate poor normalization, which may need further inspection.

Trigger box

Tip

Hover over the GIF to see the back-and-forth comparison between the subject's brain and the template. Look closely at the alignment of internal brain structures.

Surface Reconstruction if you ran the recon-all routine in fMRIprep

Trigger box

5. Functional Quality Checks¶

In the Functional section, you’ll find:

Functional-to-Anatomical Alignment: A GIF shows how well the preprocessed functional images align with the anatomical image.

Check for alignment

Check for alignment between internal structures like ventricles in the functional and anatomical images. Open the image in a new tab (Right Click on the image -> Open in a new tab) and hover to see the dynamic image.

Trigger box

CompCor Masks: Displays masks used for Anatomical Component Correction (aCompCor):
White Matter and CSF (Magenta): Masks used to extract noise components.
High-Variance Voxels (Blue): Used for Functional Component Correction (fCompCor).

Assessing Alignment

Good alignment between functional and anatomical images is crucial for accurate analysis. Pay special attention to lighter fluid-filled regions in the functional image, which should correspond with dark CSF areas in the anatomical image.

6. BOLD Summary and Carpet plot¶

The report includes time series plots for various confounds:

Global Signal (GS): Measures signal fluctuations across the entire brain.
CSF Signal (GSCSF) and White Matter Signal: Represent fluctuations in specific tissue types.
Motion Metrics (DVARS, Framewise Displacement):
DVARS: Shows changes in BOLD signal intensity from one time point to the next.
Framewise Displacement (FD): Tracks the amount of head movement between frames.
Use DVARS and FD to identify frames with high motion that could affect data quality.

Tip

High motion values often correlate with changes in global signal. Consider including these regressors in your GLM to account for motion-related noise.

The carpet plot displays time series of BOLD signals across different brain regions:

Cortex (blue), Subcortex (orange), Gray Matter (green), and White Matter/CSF (red).
Look for sudden changes across a column, which may indicate motion artifacts affecting the entire brain at a particular time point.

Trigger box

7. Correlation Matrix of Confound Regressors¶

The report also includes a correlation matrix showing relationships between confound regressors:

High correlations between CSF and motion regressors may indicate that motion affects CSF signals.
Use this matrix to decide which regressors to include in your GLM for better noise correction.

High Correlations

High correlation values may suggest redundancy among some regressors. Consider removing or combining them to avoid overfitting when building your GLM.

Trigger box

8. Making Decisions for Further Analysis¶

After reviewing the report:

Identify Good Quality Runs: Look for well-aligned images and minimal motion artifacts.
Decide on Regressors: Choose confounds like DVARS, FD, and CompCor components to include in your GLM.

What confound regressors should I use in my GLM?

A common choice is to include at least the 6 Head Motion parameters, and optionally FD and Global Signal ad nuisance regressors in your GLM.

See this awesome NeuroStars conversation with advice on choosing regressors and relevant resources.

For more details on interpreting fMRIPrep reports, see the fMRIPrep Outputs Documentation and discussions on NeuroStars.

MRIQC HTML Report¶

The MRIQC report highlights:

Summary Image: A visual overview of key metrics, including signal-to-noise ratio (SNR) and temporal SNR (tSNR).
Detailed Metrics: Click through different tabs to examine metrics like Mean Framewise Displacement, EPI-to-T1w registration quality, and artifact presence.

Interpreting tSNR

Higher temporal SNR (tSNR) values indicate better data quality. Typical values range from 30-60 for fMRI. Low tSNR may suggest issues like excessive noise or scanner artifacts. Review the group-level metrics to identify subjects with unusually high motion or low tSNR.

For more information on understanding these metrics, check out the MRIQC interpretation guide on NeuroStars.

Common Issues with fMRIPrep and MRIQC¶

Memory Errors: Out of Memory (OOM) or Crash

Problem: fMRIPrep crashes or terminates unexpectedly due to insufficient memory.
Solution: Reduce the --mem-mb parameter to allocate less memory or increase the swap space available on your system. This can help prevent OOM errors.
Tip: Monitor your memory usage during processing using tools like htop (Linux) or Activity Monitor (Mac). Aim to use around 80-90% of your available RAM without exceeding it.

Docker File Permissions Error

Problem: fMRIPrep cannot access input or output directories due to file permissions.
Solution: Ensure that Docker has read and write permissions to the directories being mounted. Adjust permissions using:
```
chmod -R 755 /path/to/BIDS /path/to/derivatives
```
Tip: On Windows, ensure that Shared Drives are enabled in Docker Desktop settings.

Missing Fields in JSON Files

Problem: fMRIPrep fails due to missing SliceTiming or PhaseEncodingDirection fields in the JSON sidecar files.
Solution: Verify that all required metadata fields are present using the BIDS Validator. For guidance on JSON sidecar fields, see the BIDS Specification.
Tip: If using custom acquisition parameters, manually edit JSON files to include the missing fields.

RuntimeError: Fieldmap Issues

Problem: fMRIPrep throws a RuntimeError related to fieldmaps, such as missing or improperly specified fieldmaps.
Solution: Ensure that fieldmaps are correctly specified in your BIDS dataset according to the BIDS Fieldmap documentation.
Tip: If your study does not require fieldmap correction, you can skip this step by specifying --ignore fieldmaps in your fMRIPrep command.

MRIQC: NaN Values in JSON Files

Problem: MRIQC fails when encountering NaN values in JSON metadata files.
Solution: Use a script like sanitize_json.py to replace NaN values with valid placeholders before running MRIQC.
Tip: Validate your JSON files before running MRIQC to avoid processing interruptions.

Docker: Cannot Allocate Memory

Problem: fMRIPrep crashes with the error cannot allocate memory when using Docker.
Solution: Restart the Docker service or allocate more memory and CPUs through the Docker Desktop settings under Resources.
Tip: Increase memory allocation gradually (e.g., 2-4 GB increments) until fMRIPrep runs smoothly.

Slow Processing: fMRIPrep Takes Too Long

Problem: fMRIPrep runs slowly, taking an excessively long time for each subject.
Solution: Use a faster SSD for the --work-dir to improve read/write speeds and reduce processing time. Also, ensure --n-cpus is set to the majority of available cores, but not all, to avoid system slowdowns.
Tip: Consider running fMRIPrep on a high-performance computing (HPC) cluster if available.

Missing or Corrupted Output Files

Problem: After running fMRIPrep or MRIQC, certain output files (e.g., sub-xx.html reports) are missing or corrupted.
Solution: Check for errors in the log files generated during the run. Often, disk space issues or interruptions during processing can cause missing files. Re-run the affected subjects with sufficient disk space.
Tip: Use a dedicated work directory and ensure it has at least 100 GB of free space to accommodate intermediate files.

MRIQC: No Group Report Generated

Problem: Group-level analysis in MRIQC does not produce a report.
Solution: Ensure that MRIQC was run in group mode using the correct group argument. Check if all individual reports are present in the output directory before running the group-level command.
Tip: Verify that the derivatives/mriqc directory has read and write access for Docker.

fMRIPrep output: empty surf files

Problem: Some files in freesurfer/sub-xx/surf are empty (0 KB), namely:
- *h.fsaverage.sphere.reg
- *h.pial
- *h.white.H
- *h.white.K
These files are supposed to be symbolic links pointing to other outputs in the folder. A 0 KB size indicates that the link is broken. This often happens if preprocessing was done on Windows, since Windows does not fully preserve these link-type files.

Even if the symbolic link is broken, the files to which the links originally pointed are likely still present in your surf/ folder, so you do not need to re-run recon-all or fmriprep.

Solution: If you need any of these files, you can either use the corresponding “original” file directly, or recreate the symbolic link (or a duplicate file) so external tools can see it under the expected filename. Below are the relevant file mappings:

Broken (link) file	Original (target) file
`*h.fsaverage.sphere.reg`	`*h.sphere.reg`
`*h.pial`	`*h.pial.T1`
`*h.white.K`	`*h.white.preaparc.K`
`*h.white.H`	`*h.white.preaparc.H`

For instance, if you need lh.pial and it’s empty, you can create it by copying lh.pial.T1 with the following command:

cp lh.pial.T1 lh.pial

To fix these links automatically across multiple subjects (on Windows, use the WSL terminal, not in the native PowerShell / Windows terminal)):

Set your FREESURFER_PATH (the folder containing your pre-existing recon-all or output):
```
export FREESURFER_PATH=/BIDE/derivatives/freesurfer
```
Copy and paste the script below into an empty file and save it as fix_surf_files.sh.
Open a terminal and navigate to the folder where you saved the file (e.g., cd ~/Documents).
Make the script executable: chmod +x fix_surf_files.sh
Run the script: ./fix_surf_files.sh

Here is the full script:

#!/bin/bash

# ==============================================================================
# Fix Broken Files in FreeSurfer Directories
#
# This script checks for specific broken or empty files in a FreeSurfer directory.
# If a broken file is found, it creates a symbolic link to its corresponding
# original file.
#
# Usage:
#   ./fix_freesurfer_links.sh         # Process all subjects in $FREESURFER_PATH
#   ./fix_freesurfer_links.sh sub-01  # Process only the given subject(s)
#
# Requirements:
#   - The environment variable $FREESURFER_PATH must be set and point to the 
#     directory containing the subject folders.
#   - FreeSurfer outputs must exist for the fix to work.
# ==============================================================================

# Check if FREESURFER_PATH is set
if [[ -z "$FREESURFER_PATH" ]]; then
    echo "❌ Error: FREESURFER_PATH is not set. Please export FREESURFER_PATH first."
    exit 1
fi

# If no subject is provided, process all subjects in the FreeSurfer directory
if [[ $# -eq 0 ]]; then
    SUBJS=($(ls "$FREESURFER_PATH"))  # Get all subjects in the directory
else
    SUBJS=("$@")  # Use provided subjects
fi

# Define file mappings: (broken file → target file)
declare -A FILE_MAP=(
    ["lh.fsaverage.sphere.reg"]="lh.sphere.reg"
    ["rh.fsaverage.sphere.reg"]="rh.sphere.reg"
    ["lh.pial"]="lh.pial.T1"
    ["rh.pial"]="rh.pial.T1"
    ["lh.white.K"]="lh.white.preaparc.K"
    ["rh.white.K"]="rh.white.preaparc.K"
    ["lh.white.H"]="lh.white.preaparc.H"
    ["rh.white.H"]="rh.white.preaparc.H"
)

# Loop over subjects
for SUBJ in "${SUBJS[@]}"; do
    SURF_PATH="${FREESURFER_PATH}/${SUBJ}/surf"

    # Check if the subject directory exists
    if [[ ! -d "$SURF_PATH" ]]; then
        echo "⚠️ Warning: Subject directory not found for $SUBJ. Skipping..."
        continue
    fi

    # Loop over each broken file type
    for BROKEN_FILE in "${!FILE_MAP[@]}"; do
        TARGET_FILE="${FILE_MAP[$BROKEN_FILE]}"
        BROKEN_PATH="${SURF_PATH}/${BROKEN_FILE}"
        TARGET_PATH="${SURF_PATH}/${TARGET_FILE}"

        # Check if the broken file exists and is empty
        if [[ -e "$BROKEN_PATH" && ! -s "$BROKEN_PATH" ]]; then
            echo "🛠 Fixing $BROKEN_FILE for $SUBJ..."

            # Check if the corresponding target file exists before creating the link
            if [[ -e "$TARGET_PATH" ]]; then
                ln -sf "$TARGET_PATH" "$BROKEN_PATH"
                echo "✔ Created symbolic link: $BROKEN_FILE → $TARGET_FILE"
            else
                echo "⚠️ Warning: $TARGET_FILE not found for $SUBJ. Cannot create link."
            fi
        else
            echo "✅ $BROKEN_FILE for $SUBJ is fine. No action needed."
        fi
    done

done

echo "✅ Done."

With these quality checks complete, you're ready to proceed to the General Linear Model (GLM) analysis. See the next guide for instructions on setting up your GLM. → Go to GLM