QC Report


general
Report generated at2021-11-16 23:01:52
TitleENCSR274HQD-1
DescriptionNo description
Pipeline versionv1.10.0
Pipeline typeatac
Genomehg38
Alignerbowtie2
Sequencing endedness{'rep1': {'paired_end': True}}
Peak callermacs2

Alignment quality metrics


Fragment length statistics (filtered/deduped BAM)

rep1
Fraction of reads in NFR0.5269639200101196
Fraction of reads in NFR (QC pass)True
Fraction of reads in NFR (QC reason)OK
NFR / mono-nuc reads1.8535732640575608
NFR / mono-nuc reads (QC pass)False
NFR / mono-nuc reads (QC reason)out of range [2.5, inf]
Presence of NFR peakTrue
Presence of Mono-Nuc peakTrue
Presence of Di-Nuc peakTrue

rep1
rep1

Open chromatin assays show distinct fragment length enrichments, as the cut sites are only in open chromatin and not in nucleosomes. As such, peaks representing different n-nucleosomal (ex mono-nucleosomal, di-nucleosomal) fragment lengths will arise. Good libraries will show these peaks in a fragment length distribution and will show specific peak ratios.



Sequence quality metrics (filtered/deduped BAM)

rep1
rep1

Open chromatin assays are known to have significant GC bias. Please take this into consideration as necessary.


Annotated genomic region enrichment

rep1
Fraction of Reads in universal DHS regions0.339396533048957
Fraction of Reads in blacklist regions0.008079798447170279
Fraction of Reads in promoter regions0.14959440904450713
Fraction of Reads in enhancer regions0.2759019758842537

Signal to noise can be assessed by considering whether reads are falling into known open regions (such as DHS regions) or not. A high fraction of reads should fall into the universal (across cell type) DHS set. A small fraction should fall into the blacklist regions. A high set (though not all) should fall into the promoter regions. A high set (though not all) should fall into the enhancer regions. The promoter regions should not take up all reads, as it is known that there is a bias for promoters in open chromatin assays.


Replication quality metrics


Number of raw peaks

rep1
Number of peaks217779

Top 300000 raw peaks from macs2 with p-val threshold 0.01

Peak calling statistics


Peak region size

rep1
Min size150.0
25 percentile196.0
50 percentile (median)318.0
75 percentile641.0
Max size2382.0
Mean472.15733381088165

rep1
rep1

Enrichment / Signal-to-noise ratio


Jensen-Shannon distance (filtered/deduped BAM)

rep1
AUC0.22611902190935435
Synthetic AUC0.493734591418901
X-intercept0.13484123271074003
Synthetic X-intercept2.711128502768229e-218
Elbow Point0.6875477341522614
Synthetic Elbow Point0.5083180640080235
Synthetic JS Distance0.3735787236899883

Peak enrichment


Fraction of reads in peaks (FRiP)

FRiP for macs2 raw peaks

rep1
Fraction of Reads in Peaks0.2355531236956968

For macs2 raw peaks:


For overlap/IDR peaks: