QC Report


general
Report generated at2019-10-22 17:33:37
Titleh1 ctcf cutrun auto a
DescriptionThis is a cut n run data for h1 ctcf with auto a control.
Pipeline versiondev-v0.1.0
Pipeline typecut_n_run
Genomehg19
Paired-end per replicate[True, True]
Alignerbowtie2
Peak callermacs2
Control paired-end per replicate[True, True]

Alignment quality metrics


SAMstat (raw BAM)

rep1rep2
Total Reads1627057816068156
Total Reads (QC-failed)00
Duplicate Reads00
Duplicate Reads (QC-failed)00
Mapped Reads1532045415377563
Mapped Reads (QC-failed)00
% Mapped Reads94.1999999999999995.7
Paired Reads1627057816068156
Paired Reads (QC-failed)00
Read181352898034078
Read1 (QC-failed)00
Read281352898034078
Read2 (QC-failed)00
Properly Paired Reads1503549615113712
Properly Paired Reads (QC-failed)00
% Properly Paired Reads92.494.1
With itself1518790415255914
With itself (QC-failed)00
Singletons132550121649
Singletons (QC-failed)00
% Singleton0.80.8
Diff. Chroms112545105493
Diff. Chroms (QC-failed)00

Marking duplicates (filtered BAMs)

rep1rep2
Unpaired Reads00
Paired Reads69744597018816
Unmapped Reads00
Unpaired Duplicate Reads00
Paired Duplicate Reads18645141598947
Paired Optical Duplicate Reads00
% Duplicate Reads26.733522.780900000000003

Filtered out (samtools view -F 1804):


Fraction of mitochondrial reads

rep1rep2
Rm = Number of Non-mitochondrial Reads1497810215033209
Rn = Number of Mitochondrial Reads392120394996
Rm/(Rn+Rm) = Frac. of mitochondrial reads0.025511667951185090.02560220064485791

SAMstat (filtered/deduped BAM)

rep1rep2
Total Reads1010830410724558
Total Reads (QC-failed)00
Duplicate Reads00
Duplicate Reads (QC-failed)00
Mapped Reads1010830410724558
Mapped Reads (QC-failed)00
% Mapped Reads100.0100.0
Paired Reads1010830410724558
Paired Reads (QC-failed)00
Read150541525362279
Read1 (QC-failed)00
Read250541525362279
Read2 (QC-failed)00
Properly Paired Reads1010830410724558
Properly Paired Reads (QC-failed)00
% Properly Paired Reads100.0100.0
With itself1010830410724558
With itself (QC-failed)00
Singletons00
Singletons (QC-failed)00
% Singleton0.00.0
Diff. Chroms00
Diff. Chroms (QC-failed)00

Filtered and duplicates removed


Library complexity quality metrics


Library complexity (filtered non-mito BAM)

rep1rep2
Total Fragments68407056883945
Distinct Fragments50541565362286
Positions with Two Read992324906514
NRF = Distinct/Total0.7388360.778955
PBC1 = OneRead/Distinct0.7406030.785402
PBC2 = OneRead/TwoRead3.7720764.645873

Mitochondrial reads are filtered out by default. The non-redundant fraction (NRF) is the fraction of non-redundant mapped reads in a dataset; it is the ratio between the number of positions in the genome that uniquely mapped reads map to and the total number of uniquely mappable reads. The NRF should be > 0.8. The PBC1 is the ratio of genomic locations with EXACTLY one read pair over the genomic locations with AT LEAST one read pair. PBC1 is the primary measure, and the PBC1 should be close to 1. Provisionally 0-0.5 is severe bottlenecking, 0.5-0.8 is moderate bottlenecking, 0.8-0.9 is mild bottlenecking, and 0.9-1.0 is no bottlenecking. The PBC2 is the ratio of genomic locations with EXACTLY one read pair over the genomic locations with EXACTLY two read pairs. The PBC2 should be significantly greater than 1.


NRF (non redundant fraction)
PBC1 (PCR Bottleneck coefficient 1)
PBC2 (PCR Bottleneck coefficient 2)
PBC1 is the primary measure. Provisionally


Replication quality metrics


IDR (Irreproducible Discovery Rate) plots

rep1_vs_rep2
rep1_vs_rep2
rep1-pr1_vs_rep1-pr2
rep1-pr1_vs_rep1-pr2
rep2-pr1_vs_rep2-pr2
rep2-pr1_vs_rep2-pr2
pooled-pr1_vs_pooled-pr2
pooled-pr1_vs_pooled-pr2

Reproducibility QC and peak detection statistics

overlapidr
Nt7587045726
N17392542632
N27407741649
Np7419245558
N optimal7587045726
N conservative7587045726
Optimal Setrep1_vs_rep2rep1_vs_rep2
Conservative Setrep1_vs_rep2rep1_vs_rep2
Rescue Ratio1.02261699374595641.0036876070064533
Self Consistency Ratio1.002056137977681.0236020072510745
Reproducibility Testpasspass

Reproducibility QC


Number of raw peaks

rep1rep2
Number of peaks9905996342

Top 300000 raw peaks from macs2 with p-val threshold 0.01

Peak calling statistics


Peak region size

rep1rep2idr_optoverlap_opt
Min size243.0243.0243.0243.0
25 percentile321.0323.0722.0430.0
50 percentile (median)547.0547.5989.0711.0
75 percentile990.0964.01343.01112.0
Max size5288.05210.05365.05365.0
Mean729.1375241017979717.41898652716361093.6416262082842840.7859101093976

rep1
rep1
rep2
rep2
idr_opt
idr_opt
overlap_opt
overlap_opt

Enrichment / Signal-to-noise ratio


Jensen-Shannon distance

rep1rep2
AUC0.074026494253863510.07529275480782728
Synthetic AUC0.477866835111407040.47852711468616993
X-intercept0.6137312191151560.6045320102471188
Synthetic X-intercept8.475887066080954e-177.113487026695306e-18
Elbow Point0.76134652419891890.7506954401921435
Synthetic Elbow Point0.495558977269133560.5237477916163796
Synthetic JS Distance0.45434786899391970.45745915738309195

Peak enrichment


Fraction of reads in peaks (FRiP)

FRiP for macs2 raw peaks

rep1rep2rep1-pr1rep2-pr1rep1-pr2rep2-pr2pooledpooled-pr1pooled-pr2
Fraction of Reads in Peaks0.439650608054526270.435343908811906270.45922500945757070.464321892926143340.45878853663285160.464320947179538260.42179855077041260.43277179748305370.4325151707446793

FRiP for overlap peaks

rep1_vs_rep2rep1-pr1_vs_rep1-pr2rep2-pr1_vs_rep2-pr2pooled-pr1_vs_pooled-pr2
Fraction of Reads in Peaks0.408611740432015540.41058074628543030.41081730361288550.40647991620162416

FRiP for IDR peaks

rep1_vs_rep2rep1-pr1_vs_rep1-pr2rep2-pr1_vs_rep2-pr2pooled-pr1_vs_pooled-pr2
Fraction of Reads in Peaks0.348817219640777170.33671009498725010.33691560994868040.3481428523838923

For macs2 raw peaks:


For overlap/IDR peaks: