Abstract
Chromatin immunoprecipitation sequencing (ChIP-seq) and the Assay for Transposase-Accessible Chromatin with high-throughput sequencing (ATAC-seq) have become essential technologies to effectively measure protein–DNA interactions and chromatin accessibility. However, there is a need for a scalable and reproducible pipeline that incorporates proper normalization between samples, correction of copy number variations, and integration of new downstream analysis tools. Here we present Containerized Bioinformatics workflow for Reproducible ChIP/ATAC-seq Analysis (CoBRA), a modularized computational workflow which quantifies ChIP-seq and ATAC-seq peak regions and performs unsupervised and supervised analyses. CoBRA provides a comprehensive state-of-the-art ChIP-seq and ATAC-seq analysis pipeline that can be used by scientists with limited computational experience. This enables researchers to gain rapid insight into protein–DNA interactions and chromatin accessibility through sample clustering, differential peak calling, motif enrichment, comparison of sites to a reference database, and pathway analysis. CoBRA is publicly available online at https://bitbucket.org/cfce/cobra
Original language | English |
---|---|
Pages (from-to) | 652-661 |
Number of pages | 10 |
Journal | Genomics, Proteomics and Bioinformatics |
Volume | 19 |
Issue number | 4 |
DOIs | |
State | Published - Aug 2021 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2022
Funding
Henry W. Long and Myles Brown acknowledge funding from the National Institutes of Health, United States (Grant Nos. 2PO1CA163227 and P01CA080111 ).
Funders | Funder number |
---|---|
National Institutes of Health | P01CA080111, 2PO1CA163227 |
Keywords
- ATAC-seq
- ChIP-seq
- Docker
- Snakemake
- Workflow