Untitled

No description

Pipeline version: v1.1.7

Report generated at 2019-10-12 06:34:17

Paired-end: False

Pipeline type: TF ChIP-Seq

Genome: hg19_klab.tsv

Peak caller: SPP

Alignment


Flagstat (raw BAM)

rep1rep2ctl1
Total181531533525361357444550
Total(QC-failed)000
Dupes000
Dupes(QC-failed)000
Mapped162180403099564734552286
Mapped(QC-failed)000
% Mapped89.340087.920060.1500
Paired000
Paired(QC-failed)000
Read1000
Read1(QC-failed)000
Read2000
Read2(QC-failed)000
Properly Paired000
Properly Paired(QC-failed)000
% Properly Paired0.00000.00000.0000
With itself000
With itself(QC-failed)000
Singletons000
Singletons(QC-failed)000
% Singleton0.00000.00000.0000
Diff. Chroms000
Diff. Chroms (QC-failed)000

Marking duplicates (filtered BAM)

Filtered out (samtools view -F 1804):


rep1rep2ctl1
Unpaired Reads119366182364182422975627
Paired Reads000
Unmapped Reads000
Unpaired Dupes25925818186560206977
Paired Dupes000
Paired Opt. Dupes000
% Dupes/1000.21720.34630.0090

Library complexity (filtered non-mito BAM)

rep1rep2ctl1
Total Reads118758702357745722886568
Distinct Reads94408521608907422788081
One Read74398421069658622701124
Two Reads1657392384536683850
NRF = Distinct/Total0.79500.68240.9957
PBC1 = OneRead/Distinct0.78800.66480.9962
PBC2 = OneRead/TwoReads4.48892.7817270.7349

Mitochondrial reads are filtered out.

NRF (non redundant fraction)
PBC1 (PCR Bottleneck coefficient 1)
PBC2 (PCR Bottleneck coefficient 2)
PBC1 is the primary measure. Provisionally


Flagstat (filtered/deduped BAM)

Filtered and duplicates removed

rep1rep2ctl1
Total93440371545526422768650
Total(QC-failed)000
Dupes000
Dupes(QC-failed)000
Mapped93440371545526422768650
Mapped(QC-failed)000
% Mapped100.0000100.0000100.0000
Paired000
Paired(QC-failed)000
Read1000
Read1(QC-failed)000
Read2000
Read2(QC-failed)000
Properly Paired000
Properly Paired(QC-failed)000
% Properly Paired0.00000.00000.0000
With itself000
With itself(QC-failed)000
Singletons000
Singletons(QC-failed)000
% Singleton0.00000.00000.0000
Diff. Chroms000
Diff. Chroms (QC-failed)000

Peak calling


IDR (Irreproducible Discovery Rate) plots

rep1-rep2
rep1-rep2
rep1-pr
rep1-pr
rep2-pr
rep2-pr
ppr
ppr

Reproducibility QC and peak detection statistics

The number of peaks is capped at 300K for peak-caller SPP


overlapIDR
Nt250373483
N1103992343
N2152751501
Np333554843
N optimal333554843
N conservative250373483
Optimal Setpprppr
Conservative Setrep1-rep2rep1-rep2
Rescue Ratio1.33221.3905
Self Consistency Ratio1.46891.5610
Reproducibilitypasspass

Overlapping peaks


IDR (Irreproducible Discovery Rate) peaks


Enrichment


Strand cross-correlation measures

Performed on subsampled reads (15M)

rep1rep2
Reads1500000015000000
Est. Fragment Len.140130
Corr. Est. Fragment Len.0.14610.1437
Phantom Peak3535
Corr. Phantom Peak0.15980.1553
Argmin. Corr.15001500
Min. Corr.0.12860.1282
NSC1.13601.1206
RSC0.56030.5712

NOTE1: For SE datasets, reads from replicates are randomly subsampled.
NOTE2: For PE datasets, the first end of each read-pair is selected and the reads are then randomly subsampled.


rep1
rep1
rep2
rep2

Fraction of reads in overlapping peaks

rep1-rep2rep1-prrep2-prppr
Fraction of Reads in Peak0.01740.01560.01080.0208


Fraction of reads in IDR peaks

rep1-rep2rep1-prrep2-prppr
Fraction of Reads in Peak0.00580.00680.00260.0072


Other quality metrics


Fingerprint and Jensen-Shannon distance

rep1rep2
% genome enriched0.20340.2492
AUC0.48080.4851
CHANCE divergence0.35460.2251
Elbow Point0.00000.0000
JS Distance0.61320.5739
Synthetic AUC0.52990.5037
Synthetic Elbow Point0.08100.0453
Synthetic JS Distance0.23300.2327