In [1]:
# Parameters
data_name = "apTreg;8wk;D;En"
modisco_root = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/Naive_modisco2019"
tomtom_report_root = "http://mitra.stanford.edu/kundaje/msharmin/report/tomtom_outs/cells"
task_dir = "task_239-naivegw"
perf_file = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/fineFactorized/task_239-naivegw/NaiveauPRC.txt"
homer_root = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/Naive_scans"
reportfile = "/mnt/lab_data/kundaje/msharmin/annotations/filtering samples_MS2.xlsx"
sheetname = "filter23"
In [2]:
from matlas.reports import display_metadata
load data from labcluster
Using TensorFlow backend.
2019-08-30 19:34:20,300 [WARNING] git-lfs not installed
In [3]:
display_metadata(data_name, perf_file, reportfile, sheetname)
    Sample Information
    MetaData NameDescription
    Cell typeActivated primary T regulatory cells(adult-8wks)
    Cell GroupT cells
    Experiment NameDHS
    Experiment GroupENCODE
    Pipeline Output
    replicateNaïve overlap peaksIDR peaksTSS enrichment (< 8 is very poor <10 is low)Final number of unique mapping, dup-filtered, chrM filtered readsNumber of reads in called peak regionsFraction of reads in called peak regionsNumber of reads in promoter regionsFraction of reads in promoter regionsNumber of reads in enhancer regionsFraction of reads in enhancer regions
    rep11432181183865.9786143352065331364530.2312185304690.1293531614560.3709
    rep214321811838613.18112556725553118310.207946158650.180688249460.3453
    Modelling Metadata
    MetricValue
    auPRC0.6326
    Calibrated Recall at 50% FDR0.191
    Number of Positive Examples in Test Data120395
    Number of Negative Examples in Test Data7950456
    Imbalance Ratio in Test Data0.0149
    Test Chromosomeschr2, chr3, chr19
In [4]:
from matlas.reports import display_paiwise_pattern_comparison
from matlas.reports import display_denovo_patterns
In [5]:
display_denovo_patterns(data_name, modisco_root=modisco_root)
TF-MoDISco is using the TensorFlow backend.
The following two links show list of Denovo Patterns and corresponding Motifs discovered by TF-MoDISco
Click here for Denovo Patterns by TF-MoDISco: #10
Pattern NameTF Name(s)Modisco
metacluster_1/pattern_0 # seqlets: 17072 SequenceContrib ScoresHyp_Contrib Scores
Ctcf, Ctcfl
metacluster_1/pattern_1 # seqlets: 1721 SequenceContrib ScoresHyp_Contrib Scores
metacluster_1/pattern_2 # seqlets: 1539 SequenceContrib ScoresHyp_Contrib Scores
Ets1, Erg, Elk1, Gabpa, Etv4, Etv2, Elk4, Fli1, Elf2, Spi1,

Etv6, Elk3, Spib, Etv5, Ehf, Elf1, Fev, Etv1, Elf3, Etv3, XP_911724.4,

Irf8, Elf4, Irf4, Stat4, Elf5, Spic, Ets2
metacluster_1/pattern_3 # seqlets: 1093 SequenceContrib ScoresHyp_Contrib Scores
Irf1, Irf2, Stat1, Irf7, Stat2, Prdm1, Irf9, Irf3, Irf5, Batf3
metacluster_1/pattern_4 # seqlets: 558 SequenceContrib ScoresHyp_Contrib Scores
Zfp143, Thap11, Tbx2
metacluster_1/pattern_5 # seqlets: 302 SequenceContrib ScoresHyp_Contrib Scores
Nrf1
metacluster_1/pattern_6 # seqlets: 187 SequenceContrib ScoresHyp_Contrib Scores
metacluster_1/pattern_7 # seqlets: 115 SequenceContrib ScoresHyp_Contrib Scores
Rest
metacluster_1/pattern_8 # seqlets: 114 SequenceContrib ScoresHyp_Contrib Scores
Rfx1, Rfx3, Rfx2, Rfx4, Rfx7, Rfx6, Hic1
metacluster_1/pattern_9 # seqlets: 110 SequenceContrib ScoresHyp_Contrib Scores
Nfyb, Nfyc, Nfya, Foxi1, Pbx3
Click here for Motifs by TF-MoDISco: #57
TF NamePattern(s)
Ctcf
Pattern NameModiscoSignificance
metacluster_1/pattern_02.09649e-14
metacluster_1/pattern_10.00797185
Ctcfl
Pattern NameModiscoSignificance
metacluster_1/pattern_04.81708e-07
Ets1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Erg
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Elk1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Gabpa
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Etv4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Etv2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.000724362
Elk4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00652665
Fli1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Elf2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0138225
Spi1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0009815339999999998
Etv6
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00451796
Elk3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Spib
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00147342
metacluster_1/pattern_30.0090384
Etv5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00151232
Ehf
Pattern NameModiscoSignificance
metacluster_1/pattern_20.032016300000000004
Elf1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00453107
Fev
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00473208
Etv1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00505338
Elf3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0349434
Etv3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00540004
XP_911724.4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00558446
Irf8
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00983152
metacluster_1/pattern_30.000243708
Elf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00986783
Irf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.010817
metacluster_1/pattern_30.0090384
Stat4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0201183
Elf5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0209007
Spic
Pattern NameModiscoSignificance
metacluster_1/pattern_20.044206999999999996
Ets2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0467125
Irf1
Pattern NameModiscoSignificance
metacluster_1/pattern_31.19382e-07
Irf2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.011589700000000001
Stat1
Pattern NameModiscoSignificance
metacluster_1/pattern_32.9105099999999998e-06
Irf7
Pattern NameModiscoSignificance
metacluster_1/pattern_30.000117665
Stat2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00017896400000000002
Prdm1
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00214383
Irf9
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0090384
Irf3
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0101396
Irf5
Pattern NameModiscoSignificance
metacluster_1/pattern_30.026736799999999998
Batf3
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0590679
Zfp143
Pattern NameModiscoSignificance
metacluster_1/pattern_41.28049e-21
Thap11
Pattern NameModiscoSignificance
metacluster_1/pattern_44.30987e-20
Tbx2
Pattern NameModiscoSignificance
metacluster_1/pattern_46.0608e-14
Nrf1
Pattern NameModiscoSignificance
metacluster_1/pattern_52.95559e-07
Rest
Pattern NameModiscoSignificance
metacluster_1/pattern_71.16533e-15
Rfx1
Pattern NameModiscoSignificance
metacluster_1/pattern_83.25784e-07
Rfx3
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00387382
Rfx2
Pattern NameModiscoSignificance
metacluster_1/pattern_82.8925e-08
Rfx4
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00012665200000000002
Rfx7
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00136825
Rfx6
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00555405
Hic1
Pattern NameModiscoSignificance
metacluster_1/pattern_80.053837800000000005
Nfyb
Pattern NameModiscoSignificance
metacluster_1/pattern_91.0671200000000001e-08
Nfyc
Pattern NameModiscoSignificance
metacluster_1/pattern_92.67982e-05
Nfya
Pattern NameModiscoSignificance
metacluster_1/pattern_93.52438e-05
Foxi1
Pattern NameModiscoSignificance
metacluster_1/pattern_94.38196e-05
Pbx3
Pattern NameModiscoSignificance
metacluster_1/pattern_90.000240169
In [6]:
display_paiwise_pattern_comparison(data_name, modisco_root, homer_root)
Number of CISBP TFs obtained by TF-MoDISco and Homer
Shared TFs between TF-MoDISco and Homer: #35
TF NameModiscoHomer
Ctcfl
Pattern NameModiscoSignificance
metacluster_1/pattern_04.81708e-07
Pattern NameHomerSignificance
motif1.motif0.000161304
Nrf1
Pattern NameModiscoSignificance
metacluster_1/pattern_52.95559e-07
Pattern NameHomerSignificance
motif8.motif0.000489965
Etv2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.000724362
Pattern NameHomerSignificance
motif3.motif0.0008935869999999999
Spi1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0009815339999999998
Pattern NameHomerSignificance
motif3.motif0.0597383
Erg
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Pattern NameHomerSignificance
motif3.motif0.000232067
Spic
Pattern NameModiscoSignificance
metacluster_1/pattern_20.044206999999999996
Pattern NameHomerSignificance
motif3.motif0.030470599999999997
Elk1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Pattern NameHomerSignificance
motif3.motif0.00020406900000000003
Foxi1
Pattern NameModiscoSignificance
metacluster_1/pattern_94.38196e-05
Pattern NameHomerSignificance
motif9.motif0.00815845
Nfyb
Pattern NameModiscoSignificance
metacluster_1/pattern_91.0671200000000001e-08
Pattern NameHomerSignificance
motif9.motif1.94959e-06
Pbx3
Pattern NameModiscoSignificance
metacluster_1/pattern_90.000240169
Pattern NameHomerSignificance
motif9.motif0.0016124000000000002
Zfp143
Pattern NameModiscoSignificance
metacluster_1/pattern_41.28049e-21
Pattern NameHomerSignificance
motif7.motif1.77671e-06
Nfya
Pattern NameModiscoSignificance
metacluster_1/pattern_93.52438e-05
Pattern NameHomerSignificance
motif9.motif4.66513e-05
Fev
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00473208
Pattern NameHomerSignificance
motif3.motif0.000615153
Ctcf
Pattern NameModiscoSignificance
metacluster_1/pattern_02.09649e-14
metacluster_1/pattern_10.00797185
Pattern NameHomerSignificance
motif1.motif1.54489e-06
Thap11
Pattern NameModiscoSignificance
metacluster_1/pattern_44.30987e-20
Pattern NameHomerSignificance
motif7.motif5.28193e-07
Etv3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00540004
Pattern NameHomerSignificance
motif3.motif0.00037776300000000004
Etv4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Pattern NameHomerSignificance
motif3.motif0.00316138
Spib
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00147342
metacluster_1/pattern_30.0090384
Pattern NameHomerSignificance
motif3.motif0.056596900000000006
Gabpa
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Pattern NameHomerSignificance
motif3.motif0.000245427
Nfyc
Pattern NameModiscoSignificance
metacluster_1/pattern_92.67982e-05
Pattern NameHomerSignificance
motif9.motif4.66513e-05
Elf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00986783
Pattern NameHomerSignificance
motif3.motif0.00183367
Etv6
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00451796
Pattern NameHomerSignificance
motif3.motif0.00626676
Elf2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0138225
Pattern NameHomerSignificance
motif3.motif0.00744
Elf1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00453107
Pattern NameHomerSignificance
motif3.motif0.0006631139999999999
Fli1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Pattern NameHomerSignificance
motif3.motif0.000232067
Elk4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00652665
Pattern NameHomerSignificance
motif3.motif0.00020406900000000003
Elf5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0209007
Pattern NameHomerSignificance
motif3.motif0.021666
Ehf
Pattern NameModiscoSignificance
metacluster_1/pattern_20.032016300000000004
Pattern NameHomerSignificance
motif3.motif0.0311782
XP_911724.4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00558446
Pattern NameHomerSignificance
motif3.motif0.00036058300000000004
Etv1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00505338
Pattern NameHomerSignificance
motif3.motif0.00020406900000000003
Elk3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Pattern NameHomerSignificance
motif3.motif0.00020406900000000003
Elf3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0349434
Pattern NameHomerSignificance
motif3.motif0.0196844
Ets1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048909999999999995
Pattern NameHomerSignificance
motif3.motif0.000232067
Tbx2
Pattern NameModiscoSignificance
metacluster_1/pattern_46.0608e-14
Pattern NameHomerSignificance
motif7.motif7.86367e-05
Etv5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00151232
Pattern NameHomerSignificance
motif3.motif2.83156e-06
Unique TF-MoDISco TFs: #22
TF NameModiscoHomer
Batf3
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0590679
Absent
Irf9
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0090384
Absent
Irf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.010817
metacluster_1/pattern_30.0090384
Absent
Irf5
Pattern NameModiscoSignificance
metacluster_1/pattern_30.026736799999999998
Absent
Ets2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0467125
Absent
Irf2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.011589700000000001
Absent
Rfx2
Pattern NameModiscoSignificance
metacluster_1/pattern_82.8925e-08
Absent
Rfx4
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00012665200000000002
Absent
Irf1
Pattern NameModiscoSignificance
metacluster_1/pattern_31.19382e-07
Absent
Stat2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00017896400000000002
Absent
Rest
Pattern NameModiscoSignificance
metacluster_1/pattern_71.16533e-15
Absent
Irf8
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00983152
metacluster_1/pattern_30.000243708
Absent
Prdm1
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00214383
Absent
Irf3
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0101396
Absent
Rfx1
Pattern NameModiscoSignificance
metacluster_1/pattern_83.25784e-07
Absent
Rfx7
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00136825
Absent
Rfx6
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00555405
Absent
Rfx3
Pattern NameModiscoSignificance
metacluster_1/pattern_80.00387382
Absent
Irf7
Pattern NameModiscoSignificance
metacluster_1/pattern_30.000117665
Absent
Stat1
Pattern NameModiscoSignificance
metacluster_1/pattern_32.9105099999999998e-06
Absent
Stat4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0201183
Absent
Hic1
Pattern NameModiscoSignificance
metacluster_1/pattern_80.053837800000000005
Absent
Unique Homer TFs: #32
TF NameModiscoHomer
Klf15Absent
Pattern NameHomerSignificance
motif2.motif0.00528251
Klf6Absent
Pattern NameHomerSignificance
motif2.motif0.000608834
Sp1Absent
Pattern NameHomerSignificance
motif2.motif0.00451957
E2f4Absent
Pattern NameHomerSignificance
motif2.motif0.000950386
E2f1Absent
Pattern NameHomerSignificance
motif2.motif0.00451957
Sp4Absent
Pattern NameHomerSignificance
motif2.motif0.00860546
Sp3Absent
Pattern NameHomerSignificance
motif2.motif0.000171477
E2f7Absent
Pattern NameHomerSignificance
motif2.motif0.020237599999999998
RelaAbsent
Pattern NameHomerSignificance
motif18.motif0.00171879
Klf5Absent
Pattern NameHomerSignificance
motif2.motif0.00497398
Klf1Absent
Pattern NameHomerSignificance
motif2.motif0.035535199999999996
Zfp281Absent
Pattern NameHomerSignificance
motif2.motif0.0470607
Egr1Absent
Pattern NameHomerSignificance
motif2.motif0.040751300000000004
Taf1Absent
Pattern NameHomerSignificance
motif15.motif0.000258915
Nfkb1Absent
Pattern NameHomerSignificance
motif18.motif0.0009283910000000001
Klf7Absent
Pattern NameHomerSignificance
motif2.motif0.00018246200000000002
Sp5Absent
Pattern NameHomerSignificance
motif2.motif0.020727799999999998
Yy1Absent
Pattern NameHomerSignificance
motif15.motif3.5471600000000004e-07
Zfp42Absent
Pattern NameHomerSignificance
motif15.motif0.0108891
Klf12Absent
Pattern NameHomerSignificance
motif2.motif0.0175535
Nfkb2Absent
Pattern NameHomerSignificance
motif18.motif0.00171879
Sp2Absent
Pattern NameHomerSignificance
motif2.motif0.00018246200000000002
Egr2Absent
Pattern NameHomerSignificance
motif2.motif0.035535199999999996
Klf3Absent
Pattern NameHomerSignificance
motif2.motif0.000171477
Klf8Absent
Pattern NameHomerSignificance
motif2.motif0.00033890900000000003
E2f6Absent
Pattern NameHomerSignificance
motif2.motif0.0595131
SpdefAbsent
Pattern NameHomerSignificance
motif3.motif0.040258999999999996
Zbtb33Absent
Pattern NameHomerSignificance
motif17.motif0.00985755
Klf4Absent
Pattern NameHomerSignificance
motif2.motif0.0207095
Zbtb1Absent
Pattern NameHomerSignificance
motif2.motif0.0310535
RelbAbsent
Pattern NameHomerSignificance
motif18.motif0.0120371
Zbtb17Absent
Pattern NameHomerSignificance
motif2.motif0.00387805