Title: Zfp281

Report generated at Fri Dec 21 14:10:41 PST 2018

Command line arguments

-species mm10 -filt_bam0 /home/users/avsec/oak/basepair-workflow/data/raw/deduplicated_bam/mesc_zfp281_nexus_1_id3546_filtered.bam -filt_bam1 /home/users/avsec/oak/basepair-workflow/data/raw/deduplicated_bam/mesc_zfp281_nexus_2_id3674_filtered.bam -filt_bam2 /home/users/avsec/oak/basepair-workflow/data/raw/deduplicated_bam/mesc_zfp281_nexus_3_id3594_filtered.bam -peak_caller macs2 -shift_macs2 -75 -extsize_macs2 150 -nth 8 -idr_thresh 0.05 -no_dup_removal -out_dir data/Zfp281 -title Zfp281

Pipeline version

Latest git commit SHA1: c44dcbf74eb0935a3a663db32cedc8ce35a4eb79 (Mon Dec 10 14:17:30 2018)


Directories and files
Expand all   Collapse all
FilesPath
Alignment
Replicate 1
Filtered & deduped bam /home/users/avsec/oak/basepair-workflow/data/raw/deduplicated_bam/mesc_zfp281_nexus_2_id3674_filtered.bam
Tag-align ./align/rep1/mesc_zfp281_nexus_2_id3674_filtered.tagAlign.gz
Replicate 2
Filtered & deduped bam /home/users/avsec/oak/basepair-workflow/data/raw/deduplicated_bam/mesc_zfp281_nexus_3_id3594_filtered.bam
Tag-align ./align/rep2/mesc_zfp281_nexus_3_id3594_filtered.tagAlign.gz
Pooled replicate
Tag-align ./align/pooled_rep/mesc_zfp281_nexus_2_id3674_filtered_mesc_zfp281_nexus_3_id3594_filtered.tagAlign.gz
Pseudo-replicates
Replicate 1
Pseudo-replicate 1
Tag-align ./align/pseudo_reps/rep1/pr1/mesc_zfp281_nexus_2_id3674_filtered.pr1.tagAlign.gz
Pseudo-replicate 2
Tag-align ./align/pseudo_reps/rep1/pr2/mesc_zfp281_nexus_2_id3674_filtered.pr2.tagAlign.gz
Replicate 2
Pseudo-replicate 1
Tag-align ./align/pseudo_reps/rep2/pr1/mesc_zfp281_nexus_3_id3594_filtered.pr1.tagAlign.gz
Pseudo-replicate 2
Tag-align ./align/pseudo_reps/rep2/pr2/mesc_zfp281_nexus_3_id3594_filtered.pr2.tagAlign.gz
Pooled pseudo-replicates
Pooled pseudo-replicate 1
Tag-align ./align/pooled_pseudo_reps/ppr1/mesc_zfp281_nexus_2_id3674_filtered.pr1_pooled.tagAlign.gz
Pooled pseudo-replicate 2
Tag-align ./align/pooled_pseudo_reps/ppr2/mesc_zfp281_nexus_2_id3674_filtered.pr2_pooled.tagAlign.gz
Signal tracks
MACS2
Replicate 1
P-value ./signal/macs2/rep1/mesc_zfp281_nexus_2_id3674_filtered.pval.signal.bw
Fold enrichment ./signal/macs2/rep1/mesc_zfp281_nexus_2_id3674_filtered.fc.signal.bw
Replicate 2
P-value ./signal/macs2/rep2/mesc_zfp281_nexus_3_id3594_filtered.pval.signal.bw
Fold enrichment ./signal/macs2/rep2/mesc_zfp281_nexus_3_id3594_filtered.fc.signal.bw
Pooled replicate
P-value ./signal/macs2/pooled_rep/mesc_zfp281_nexus_2_id3674_filtered_mesc_zfp281_nexus_3_id3594_filtered.pval.signal.bw
Fold enrichment ./signal/macs2/pooled_rep/mesc_zfp281_nexus_2_id3674_filtered_mesc_zfp281_nexus_3_id3594_filtered.fc.signal.bw
Peaks
MACS2
Replicate 1
Narrow peak ./peak/macs2/rep1/mesc_zfp281_nexus_2_id3674_filtered.pval0.01.500K.narrowPeak.gz
Replicate 2
Narrow peak ./peak/macs2/rep2/mesc_zfp281_nexus_3_id3594_filtered.pval0.01.500K.narrowPeak.gz
Pooled replicate
Narrow peak ./peak/macs2/pooled_rep/mesc_zfp281_nexus_2_id3674_filtered_mesc_zfp281_nexus_3_id3594_filtered.pval0.01.500K.narrowPeak.gz
Pseudo-replicates
Replicate 1
Pseudo-replicate 1
Narrow peak ./peak/macs2/pseudo_reps/rep1/pr1/mesc_zfp281_nexus_2_id3674_filtered.pr1.pval0.01.500K.narrowPeak.gz
Pseudo-replicate 2
Narrow peak ./peak/macs2/pseudo_reps/rep1/pr2/mesc_zfp281_nexus_2_id3674_filtered.pr2.pval0.01.500K.narrowPeak.gz
Replicate 2
Pseudo-replicate 1
Narrow peak ./peak/macs2/pseudo_reps/rep2/pr1/mesc_zfp281_nexus_3_id3594_filtered.pr1.pval0.01.500K.narrowPeak.gz
Pseudo-replicate 2
Narrow peak ./peak/macs2/pseudo_reps/rep2/pr2/mesc_zfp281_nexus_3_id3594_filtered.pr2.pval0.01.500K.narrowPeak.gz
Pooled pseudo-replicate
Pooled pseudo-replicate 1
Narrow peak ./peak/macs2/pooled_pseudo_reps/ppr1/mesc_zfp281_nexus_2_id3674_filtered.pr1_pooled.pval0.01.500K.narrowPeak.gz
Pooled pseudo-replicate 2
Narrow peak ./peak/macs2/pooled_pseudo_reps/ppr2/mesc_zfp281_nexus_2_id3674_filtered.pr2_pooled.pval0.01.500K.narrowPeak.gz
Naive overlap
True replicates
Rep. 1 vs. Rep. 2
Overlapping peak ./peak/macs2/overlap/true_reps/rep1-rep2/Zfp281_rep1-rep2.naive_overlap.filt.narrowPeak.gz
Pseudo-replicates
Replicate 1
Overlapping peak ./peak/macs2/overlap/pseudo_reps/rep1/Zfp281_rep1-pr.naive_overlap.filt.narrowPeak.gz
Replicate 2
Overlapping peak ./peak/macs2/overlap/pseudo_reps/rep2/Zfp281_rep2-pr.naive_overlap.filt.narrowPeak.gz
Pooled pseudo-replicates
Overlapping peak ./peak/macs2/overlap/pooled_pseudo_reps/Zfp281_ppr.naive_overlap.filt.narrowPeak.gz
Optimal set
Overlapping peak ./peak/macs2/overlap/optimal_set/Zfp281_ppr.naive_overlap.filt.narrowPeak.gz
Conservative set
Overlapping peak ./peak/macs2/overlap/conservative_set/Zfp281_rep1-rep2.naive_overlap.filt.narrowPeak.gz
IDR
True replicates
Rep. 1 vs. Rep. 2
IDR peak ./peak/macs2/idr/true_reps/rep1-rep2/Zfp281_rep1-rep2.IDR0.05.filt.narrowPeak.gz
Pseudo-replicates
Replicate 1
IDR peak ./peak/macs2/idr/pseudo_reps/rep1/Zfp281_rep1-pr.IDR0.05.filt.narrowPeak.gz
Replicate 2
IDR peak ./peak/macs2/idr/pseudo_reps/rep2/Zfp281_rep2-pr.IDR0.05.filt.narrowPeak.gz
Pooled pseudo-replicates
IDR peak ./peak/macs2/idr/pooled_pseudo_reps/Zfp281_ppr.IDR0.05.filt.narrowPeak.gz
Optimal set
IDR peak ./peak/macs2/idr/optimal_set/Zfp281_ppr.IDR0.05.filt.narrowPeak.gz
Conservative set
IDR peak ./peak/macs2/idr/conservative_set/Zfp281_rep1-rep2.IDR0.05.filt.narrowPeak.gz
QC and logs
Replicate 1
Cross-corr. log ./qc/rep1/mesc_zfp281_nexus_2_id3674_filtered.no_chrM.15M.cc.qc
Cross-corr. plot ./qc/rep1/mesc_zfp281_nexus_2_id3674_filtered.no_chrM.15M.cc.plot.pdf
Replicate 2
Cross-corr. log ./qc/rep2/mesc_zfp281_nexus_3_id3594_filtered.no_chrM.15M.cc.qc
Cross-corr. plot ./qc/rep2/mesc_zfp281_nexus_3_id3594_filtered.no_chrM.15M.cc.plot.pdf
Naive overlap
Overlap QC log ./qc/Zfp281_peak_overlap_final.qc
True replicates
Rep. 1 vs. Rep. 2
FRiP ./peak/macs2/overlap/true_reps/rep1-rep2/Zfp281_rep1-rep2.naive_overlap.filt.narrowPeak.FRiP.qc
Pseudo-replicates
Replicate 1
FRiP ./peak/macs2/overlap/pseudo_reps/rep1/Zfp281_rep1-pr.naive_overlap.filt.narrowPeak.FRiP.qc
Replicate 2
FRiP ./peak/macs2/overlap/pseudo_reps/rep2/Zfp281_rep2-pr.naive_overlap.filt.narrowPeak.FRiP.qc
IDR
IDR QC log ./qc/Zfp281_IDR_final.qc
True replicates
Rep. 1 vs. Rep. 2
FRiP ./peak/macs2/idr/true_reps/rep1-rep2/Zfp281_rep1-rep2.IDR0.05.filt.narrowPeak.FRiP.qc
IDR plot ./peak/macs2/idr/true_reps/rep1-rep2/Zfp281_rep1-rep2.unthresholded-peaks.txt.png
Pseudo-replicates
Replicate 1
FRiP ./peak/macs2/idr/pseudo_reps/rep1/Zfp281_rep1-pr.IDR0.05.filt.narrowPeak.FRiP.qc
IDR plot ./peak/macs2/idr/pseudo_reps/rep1/Zfp281_rep1-pr.unthresholded-peaks.txt.png
Replicate 2
FRiP ./peak/macs2/idr/pseudo_reps/rep2/Zfp281_rep2-pr.IDR0.05.filt.narrowPeak.FRiP.qc
IDR plot ./peak/macs2/idr/pseudo_reps/rep2/Zfp281_rep2-pr.unthresholded-peaks.txt.png
Pooled pseudo-replicates
IDR plot ./peak/macs2/idr/pooled_pseudo_reps/Zfp281_ppr.unthresholded-peaks.txt.png

Visualization (add -url_base [URL_ROOT_DIR_FOR_OUT_DIR] to the command line.)
Visualize  JSON (datahub)



Workflow diagram (g./b. peak : gapped/broad peak from MACS2, n. peak : narrow peak from MACS2, r. peak : relaxed peak from SPP)


Enrichment QC (strand cross-correlation measures)
 rep1
rep2
Reads1500000015000000
Est. Fragment Len.-15-25
Corr. Est. Fragment Len.0.2211426148417690.215661534214676
Phantom Peak4040
Corr. Phantom Peak0.22684590.2179461
Argmin. Corr.15001500
Min. Corr.0.18846830.1904444
NSC1.1733681.132412
RSC0.85139030.9169314

NOTE: Reads from replicates are subsampled to a max of 15M

  • Normalized strand cross-correlation coefficient (NSC) = col9 in outFile
  • Relative strand cross-correlation coefficient (RSC) = col10 in outFile
  • Estimated fragment length = col3 in outFile, take the top value


rep1
 
rep2
 
Number of peaks
rep1100865
rep1-pr1100713
rep1-pr2100264
rep2108765
rep2-pr188486
rep2-pr288464
pooled131012
ppr1104747
ppr2104688

  • ppr1: Raw peaks called on the first pooled pseudoreplicates
  • ppr2: Raw peaks called on the second pooled pseudoreplicates
  • repi: Raw peaks called on true replicate i
  • repi-pr1 : Raw peaks called on the first pseudoreplicate from replicate i
  • repi-pr2 : Raw peaks called on the second pseudoreplicates from replicate i
  • overlap : Overlapped peaks (filtered if blacklist exists)


Enrichment QC (Fraction of reads in raw peaks)
 pooled_rep
rep1
rep2
Fraction of Reads in Peak0.1798880.1542490.164521
Enrichment QC (Fraction of reads in overlapping peaks)
 rep1-pr
rep1-rep2
rep2-pr
Fraction of Reads in Peak0.1318840.1563380.140813

  • ppr: Overlapping peaks comparing pooled pseudo replicates
  • rep1-pr: Overlapping peaks comparing pseudoreplicates from replicate 1
  • rep2-pr: Overlapping peaks comparing pseudoreplicates from replicate 2
  • repi-repj: Overlapping peaks comparing true replicates (rep i vs. rep j)


Reproducibility QC and Peak Detection Statistics (Overlapping peaks)
Nt70480
N155535
N256830
Np72459
N optimal72459
N conservative70480
Optimal Setpooled_pseudo_rep
Conservative Setrep1-rep2
Rescue Ratio1.0280788876276958
Self Consistency Ratio1.02331862789232
Reproducibility Testpass

  • N1: Replicate 1 self-consistent overlapping peaks (comparing two pseudoreplicates generated by subsampling Rep1 reads)
  • N2: Replicate 2 self-consistent overlapping peaks (comparing two pseudoreplicates generated by subsampling Rep2 reads)
  • Nt: True Replicate consisten overlapping peaks (comparing true replicates Rep1 vs Rep2 )
  • Np: Pooled-pseudoreplicate consistent overlapping peaks (comparing two pseudoreplicates generated by subsampling pooled reads from Rep1 and Rep2 )
  • Self-consistency Ratio: max(N1,N2) / min (N1,N2)
  • Rescue Ratio: max(Np,Nt) / min (Np,Nt)
  • Reproducibility Test: If Self-consistency Ratio >2 AND Rescue Ratio > 2, then 'Fail' else 'Pass'


Enrichment QC (Fraction of reads in IDR peaks)
 rep1-pr
rep1-rep2
rep2-pr
Fraction of Reads in Peak0.09912580.1225270.104484

  • ppr: IDR peaks comparing pooled pseudo replicates
  • rep1-pr: IDR peaks comparing pseudoreplicates from replicate 1
  • rep2-pr: IDR peaks comparing pseudoreplicates from replicate 2
  • repi-repj: IDR peaks comparing true replicates (rep i vs. rep j)


Reproducibility QC and Peak Detection Statistics (Irreproducible Discovery Rate)
Nt33831
N124075
N224584
Np35439
N optimal35439
N conservative33831
Optimal Setpooled_pseudo_rep
Conservative Setrep1-rep2
Rescue Ratio1.047530371552718
Self Consistency Ratio1.0211422637590861
Reproducibility Testpass

  • N1: Replicate 1 self-consistent IDR 0.05 peaks (comparing two pseudoreplicates generated by subsampling Rep1 reads)
  • N2: Replicate 2 self-consistent IDR 0.05 peaks (comparing two pseudoreplicates generated by subsampling Rep2 reads)
  • Nt: True Replicate consistent IDR 0.05 peaks (comparing true replicates Rep1 vs Rep2 )
  • Np: Pooled-pseudoreplicate consistent IDR 0.05 peaks (comparing two pseudoreplicates generated by subsampling pooled reads from Rep1 and Rep2 )
  • Self-consistency Ratio: max(N1,N2) / min (N1,N2)
  • Rescue Ratio: max(Np,Nt) / min (Np,Nt)
  • Reproducibility Test: If Self-consistency Ratio >2 AND Rescue Ratio > 2, then 'Fail' else 'Pass'


true reps (rep1-rep2)
 
pooled pseudo-reps
 
rep1 pseudo-reps
 
rep2 pseudo-reps