QC Report


general
Report generated at2019-10-21 00:40:39
Titleh1 oct4 cutrun
DescriptionThis is a cut n run data for h1 oct4.
Pipeline versiondev-v0.1.0
Pipeline typecut_n_run
Genomehg19
Paired-end per replicate[True, True]
Alignerbowtie2
Peak callermacs2
Control paired-end per replicate[True, True]

Alignment quality metrics


SAMstat (raw BAM)

rep1rep2
Total Reads2523205818152438
Total Reads (QC-failed)00
Duplicate Reads00
Duplicate Reads (QC-failed)00
Mapped Reads2382588816411382
Mapped Reads (QC-failed)00
% Mapped Reads94.3999999999999990.4
Paired Reads2523205818152438
Paired Reads (QC-failed)00
Read1126160299076219
Read1 (QC-failed)00
Read2126160299076219
Read2 (QC-failed)00
Properly Paired Reads2300687215985094
Properly Paired Reads (QC-failed)00
% Properly Paired Reads91.288.1
With itself2356080016164128
With itself (QC-failed)00
Singletons265088247254
Singletons (QC-failed)00
% Singleton1.09999999999999991.4000000000000001
Diff. Chroms352452103285
Diff. Chroms (QC-failed)00

Marking duplicates (filtered BAMs)

rep1rep2
Unpaired Reads00
Paired Reads95977506807264
Unmapped Reads00
Unpaired Duplicate Reads00
Paired Duplicate Reads13019071725975
Paired Optical Duplicate Reads00
% Duplicate Reads13.56469999999999825.3549

Filtered out (samtools view -F 1804):


Fraction of mitochondrial reads

rep1rep2
Rm = Number of Non-mitochondrial Reads1978920914773119
Rn = Number of Mitochondrial Reads45971951862893
Rm/(Rn+Rm) = Frac. of mitochondrial reads0.188514673996215270.11197954173151595

SAMstat (filtered/deduped BAM)

rep1rep2
Total Reads158911289785264
Total Reads (QC-failed)00
Duplicate Reads00
Duplicate Reads (QC-failed)00
Mapped Reads158911289785264
Mapped Reads (QC-failed)00
% Mapped Reads100.0100.0
Paired Reads158911289785264
Paired Reads (QC-failed)00
Read179455644892632
Read1 (QC-failed)00
Read279455644892632
Read2 (QC-failed)00
Properly Paired Reads158911289785264
Properly Paired Reads (QC-failed)00
% Properly Paired Reads100.0100.0
With itself158911289785264
With itself (QC-failed)00
Singletons00
Singletons (QC-failed)00
% Singleton0.00.0
Diff. Chroms00
Diff. Chroms (QC-failed)00

Filtered and duplicates removed


Library complexity quality metrics


Library complexity (filtered non-mito BAM)

rep1rep2
Total Fragments80254566161792
Distinct Fragments79455644892638
Positions with Two Read66120851536
NRF = Distinct/Total0.9900450.794028
PBC1 = OneRead/Distinct0.9914420.787425
PBC2 = OneRead/TwoRead119.1403964.524278

Mitochondrial reads are filtered out by default. The non-redundant fraction (NRF) is the fraction of non-redundant mapped reads in a dataset; it is the ratio between the number of positions in the genome that uniquely mapped reads map to and the total number of uniquely mappable reads. The NRF should be > 0.8. The PBC1 is the ratio of genomic locations with EXACTLY one read pair over the genomic locations with AT LEAST one read pair. PBC1 is the primary measure, and the PBC1 should be close to 1. Provisionally 0-0.5 is severe bottlenecking, 0.5-0.8 is moderate bottlenecking, 0.8-0.9 is mild bottlenecking, and 0.9-1.0 is no bottlenecking. The PBC2 is the ratio of genomic locations with EXACTLY one read pair over the genomic locations with EXACTLY two read pairs. The PBC2 should be significantly greater than 1.


NRF (non redundant fraction)
PBC1 (PCR Bottleneck coefficient 1)
PBC2 (PCR Bottleneck coefficient 2)
PBC1 is the primary measure. Provisionally


Replication quality metrics


IDR (Irreproducible Discovery Rate) plots

rep1_vs_rep2
rep1_vs_rep2
rep1-pr1_vs_rep1-pr2
rep1-pr1_vs_rep1-pr2
rep2-pr1_vs_rep2-pr2
rep2-pr1_vs_rep2-pr2
pooled-pr1_vs_pooled-pr2
pooled-pr1_vs_pooled-pr2

Reproducibility QC and peak detection statistics

overlapidr
Nt2833936
N12378465
N279311480
Np40401242
N optimal40401242
N conservative2833936
Optimal Setpooled-pr1_vs_pooled-pr2pooled-pr1_vs_pooled-pr2
Conservative Setrep1_vs_rep2rep1_vs_rep2
Rescue Ratio1.42605012354394641.3269230769230769
Self Consistency Ratio3.335155592935243.182795698924731
Reproducibility Testborderlineborderline

Reproducibility QC


Number of raw peaks

rep1rep2
Number of peaks446622882

Top 300000 raw peaks from macs2 with p-val threshold 0.01

Peak calling statistics


Peak region size

rep1rep2idr_optoverlap_opt
Min size500.0500.0502.0500.0
25 percentile530.25521.0761.25607.0
50 percentile (median)621.0595.0932.0737.0
75 percentile793.0749.01196.75935.0
Max size3372.04756.04453.04453.0
Mean719.4180474697716693.26710951839881048.8582930756843831.5925742574258

rep1
rep1
rep2
rep2
idr_opt
idr_opt
overlap_opt
overlap_opt

Enrichment / Signal-to-noise ratio


Jensen-Shannon distance

rep1rep2
AUC0.243815002873016470.18825980265990153
Synthetic AUC0.482304130413392650.4774740650980266
X-intercept0.256157970414544360.4173241101265291
Synthetic X-intercept1.4542802125367884e-263.3345397088789085e-16
Elbow Point0.57483856388350610.5747565709348996
Synthetic Elbow Point0.50025530796837130.5211533434732087
Synthetic JS Distance0.215107509785987860.2142757515214338

Peak enrichment


Fraction of reads in peaks (FRiP)

FRiP for macs2 raw peaks

rep1rep2rep1-pr1rep2-pr1rep1-pr2rep2-pr2pooledpooled-pr1pooled-pr2
Fraction of Reads in Peaks0.0090560594565722450.047306746143997750.0258308912998498270.073625198052908940.0257357438691576820.073501951505856160.014760562932673720.019106500632954970.018991375423774493

FRiP for overlap peaks

rep1_vs_rep2rep1-pr1_vs_rep1-pr2rep2-pr1_vs_rep2-pr2pooled-pr1_vs_pooled-pr2
Fraction of Reads in Peaks0.0091496499975541740.0063516573524547780.027683565818970240.0118446158634749

FRiP for IDR peaks

rep1_vs_rep2rep1-pr1_vs_rep1-pr2rep2-pr1_vs_rep2-pr2pooled-pr1_vs_pooled-pr2
Fraction of Reads in Peaks0.0048851879189256810.00245847871843962250.0106680821283922440.006041152510835634

For macs2 raw peaks:


For overlap/IDR peaks: