CoBRA: Containerized Bioinformatics Workflow for Reproducible ChIP/ATAC-seq Analysis

Xintao Qiu, Avery S. Feit, Ariel Feiglin, Yingtian Xie, Nikolas Kesten, Len Taing, Joseph Perkins, Shengqing Gu, Yihao Li, Paloma Cejas, Ningxuan Zhou, Rinath Jeselsohn, Myles Brown, X. Shirley Liu, Henry W. Long

Research output: Contribution to journalArticlepeer-review

23 Scopus citations

Abstract

Chromatin immunoprecipitation sequencing (ChIP-seq) and the Assay for Transposase-Accessible Chromatin with high-throughput sequencing (ATAC-seq) have become essential technologies to effectively measure protein–DNA interactions and chromatin accessibility. However, there is a need for a scalable and reproducible pipeline that incorporates proper normalization between samples, correction of copy number variations, and integration of new downstream analysis tools. Here we present Containerized Bioinformatics workflow for Reproducible ChIP/ATAC-seq Analysis (CoBRA), a modularized computational workflow which quantifies ChIP-seq and ATAC-seq peak regions and performs unsupervised and supervised analyses. CoBRA provides a comprehensive state-of-the-art ChIP-seq and ATAC-seq analysis pipeline that can be used by scientists with limited computational experience. This enables researchers to gain rapid insight into protein–DNA interactions and chromatin accessibility through sample clustering, differential peak calling, motif enrichment, comparison of sites to a reference database, and pathway analysis. CoBRA is publicly available online at https://bitbucket.org/cfce/cobra

Original languageEnglish
Pages (from-to)652-661
Number of pages10
JournalGenomics, Proteomics and Bioinformatics
Volume19
Issue number4
DOIs
StatePublished - Aug 2021
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022

Funding

Henry W. Long and Myles Brown acknowledge funding from the National Institutes of Health, United States (Grant Nos. 2PO1CA163227 and P01CA080111 ).

FundersFunder number
National Institutes of HealthP01CA080111, 2PO1CA163227

    Keywords

    • ATAC-seq
    • ChIP-seq
    • Docker
    • Snakemake
    • Workflow

    Fingerprint

    Dive into the research topics of 'CoBRA: Containerized Bioinformatics Workflow for Reproducible ChIP/ATAC-seq Analysis'. Together they form a unique fingerprint.

    Cite this