In [1]:
# Parameters
data_name = "immatureB_SPLND;CD43-_CD11b-;D;GEO"
modisco_root = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/Naive_modisco2019"
tomtom_report_root = "http://mitra.stanford.edu/kundaje/msharmin/report/tomtom_outs/cells"
task_dir = "task_210-naivegw"
perf_file = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/fineFactorized/task_210-naivegw/NaiveauPRC.txt"
homer_root = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/Naive_scans"
reportfile = "/mnt/lab_data/kundaje/msharmin/annotations/filtering samples_MS2.xlsx"
sheetname = "filter23"
In [2]:
from matlas.reports import display_metadata
load data from labcluster
Using TensorFlow backend.
2019-08-30 19:28:21,512 [WARNING] git-lfs not installed
In [3]:
display_metadata(data_name, perf_file, reportfile, sheetname)
    Sample Information
    MetaData NameDescription
    Cell typemouse spleen B cells, CD43-,CD11b- (immature B cells)
    Cell GroupB cells
    Experiment NameDHS
    Experiment GroupGEO
    Pipeline Output
    replicateNaïve overlap peaksIDR peaksTSS enrichment (< 8 is very poor <10 is low)Final number of unique mapping, dup-filtered, chrM filtered readsNumber of reads in called peak regionsFraction of reads in called peak regionsNumber of reads in promoter regionsFraction of reads in promoter regionsNumber of reads in enhancer regionsFraction of reads in enhancer regions
    rep1125656994077.0454180941018NANANANANANA
    Modelling Metadata
    MetricValue
    auPRC0.6158
    Calibrated Recall at 50% FDR0.217
    Number of Positive Examples in Test Data84892
    Number of Negative Examples in Test Data7985959
    Imbalance Ratio in Test Data0.0105
    Test Chromosomeschr2, chr3, chr19
In [4]:
from matlas.reports import display_paiwise_pattern_comparison
from matlas.reports import display_denovo_patterns
In [5]:
display_denovo_patterns(data_name, modisco_root=modisco_root)
TF-MoDISco is using the TensorFlow backend.
The following two links show list of Denovo Patterns and corresponding Motifs discovered by TF-MoDISco
Click here for Denovo Patterns by TF-MoDISco: #8
Pattern NameTF Name(s)Modisco
metacluster_1/pattern_0 # seqlets: 11825 SequenceContrib ScoresHyp_Contrib Scores
Ctcf, Ctcfl
metacluster_1/pattern_1 # seqlets: 1459 SequenceContrib ScoresHyp_Contrib Scores
Spi1, Spib, Irf8, Irf4, Prdm1, Elf3, Spic, Irf3, Ehf, Etv2,

Ets1, Irf2, Irf1, Elf5, Gabpa, Elf1, Erg, Etv6, Etv4, Elk1, Elf4,

Etv3, Elf2, Fli1, XP_911724.4, Bcl6
metacluster_1/pattern_2 # seqlets: 893 SequenceContrib ScoresHyp_Contrib Scores
metacluster_1/pattern_3 # seqlets: 861 SequenceContrib ScoresHyp_Contrib Scores
Stat2, Zscan10, Irf7, Irf9, Stat1
metacluster_1/pattern_4 # seqlets: 411 SequenceContrib ScoresHyp_Contrib Scores
Zfp143, Thap11, Tbx2
metacluster_1/pattern_5 # seqlets: 39 SequenceContrib ScoresHyp_Contrib Scores
Nrf1, Sp2, Sp3, Klf3, Trp53, Zfx, Sp1
metacluster_1/pattern_6 # seqlets: 33 SequenceContrib ScoresHyp_Contrib Scores
Rfx1, Rfx3, Rfx2, Rfx4, Rfx7, Rfx6
metacluster_1/pattern_7 # seqlets: 57 SequenceContrib ScoresHyp_Contrib Scores
Usf2, Elk4, Etv1, Creb1, Elk3, Etv5, Fev, Atf1, Atf2, Atf7,

Crem
Click here for Motifs by TF-MoDISco: #60
TF NamePattern(s)
Ctcf
Pattern NameModiscoSignificance
metacluster_1/pattern_02.57772e-15
metacluster_1/pattern_20.00419748
Ctcfl
Pattern NameModiscoSignificance
metacluster_1/pattern_03.06874e-07
metacluster_1/pattern_20.051682500000000006
Spi1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00140627
metacluster_1/pattern_30.0060034
Spib
Pattern NameModiscoSignificance
metacluster_1/pattern_15.977530000000001e-10
metacluster_1/pattern_35.21979e-07
Irf8
Pattern NameModiscoSignificance
metacluster_1/pattern_17.37669e-07
metacluster_1/pattern_34.5965699999999997e-14
Irf4
Pattern NameModiscoSignificance
metacluster_1/pattern_11.3677500000000002e-06
metacluster_1/pattern_30.0481952
Prdm1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00037988199999999995
metacluster_1/pattern_30.00817878
Elf3
Pattern NameModiscoSignificance
metacluster_1/pattern_10.039593500000000004
Spic
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00442074
metacluster_1/pattern_30.0491048
Irf3
Pattern NameModiscoSignificance
metacluster_1/pattern_10.000728658
metacluster_1/pattern_30.00212942
Ehf
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0008959430000000001
Etv2
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00103638
Ets1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.039267800000000005
metacluster_1/pattern_70.0347686
Irf2
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00224878
metacluster_1/pattern_30.043151
Irf1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00320247
metacluster_1/pattern_34.81017e-05
Elf5
Pattern NameModiscoSignificance
metacluster_1/pattern_10.051592700000000005
Gabpa
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0431615
metacluster_1/pattern_70.0205861
Elf1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00904088
metacluster_1/pattern_70.014602799999999999
Erg
Pattern NameModiscoSignificance
metacluster_1/pattern_10.043825499999999996
metacluster_1/pattern_70.026714599999999995
Etv6
Pattern NameModiscoSignificance
metacluster_1/pattern_10.023893
Etv4
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0180588
metacluster_1/pattern_70.020133900000000003
Elk1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.045883600000000004
metacluster_1/pattern_70.0172563
Elf4
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0335649
Etv3
Pattern NameModiscoSignificance
metacluster_1/pattern_10.039267800000000005
Elf2
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0508199
Fli1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.051592700000000005
metacluster_1/pattern_70.033017000000000005
XP_911724.4
Pattern NameModiscoSignificance
metacluster_1/pattern_10.043825499999999996
Bcl6
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0508199
Stat2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00701224
Zscan10
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00893613
Irf7
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00902612
Irf9
Pattern NameModiscoSignificance
metacluster_1/pattern_30.016780200000000002
Stat1
Pattern NameModiscoSignificance
metacluster_1/pattern_30.018491099999999996
Zfp143
Pattern NameModiscoSignificance
metacluster_1/pattern_43.9730399999999997e-19
Thap11
Pattern NameModiscoSignificance
metacluster_1/pattern_41.8293599999999996e-18
Tbx2
Pattern NameModiscoSignificance
metacluster_1/pattern_41.77113e-13
Nrf1
Pattern NameModiscoSignificance
metacluster_1/pattern_59.061260000000001e-05
Sp2
Pattern NameModiscoSignificance
metacluster_1/pattern_50.00933334
metacluster_1/pattern_70.00480167
Sp3
Pattern NameModiscoSignificance
metacluster_1/pattern_50.026040300000000002
metacluster_1/pattern_70.00748667
Klf3
Pattern NameModiscoSignificance
metacluster_1/pattern_50.0359087
Trp53
Pattern NameModiscoSignificance
metacluster_1/pattern_50.057483000000000006
Zfx
Pattern NameModiscoSignificance
metacluster_1/pattern_50.057483000000000006
metacluster_1/pattern_70.00752257
Sp1
Pattern NameModiscoSignificance
metacluster_1/pattern_50.057483000000000006
metacluster_1/pattern_70.020701400000000002
Rfx1
Pattern NameModiscoSignificance
metacluster_1/pattern_65.15369e-05
Rfx3
Pattern NameModiscoSignificance
metacluster_1/pattern_60.029669099999999997
Rfx2
Pattern NameModiscoSignificance
metacluster_1/pattern_62.66701e-07
Rfx4
Pattern NameModiscoSignificance
metacluster_1/pattern_60.000788975
Rfx7
Pattern NameModiscoSignificance
metacluster_1/pattern_60.00900075
Rfx6
Pattern NameModiscoSignificance
metacluster_1/pattern_60.00437395
Usf2
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00294219
Elk4
Pattern NameModiscoSignificance
metacluster_1/pattern_70.0241391
Etv1
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00661023
Creb1
Pattern NameModiscoSignificance
metacluster_1/pattern_70.020701400000000002
Elk3
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00684215
Etv5
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00661023
Fev
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00684215
Atf1
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00715435
Atf2
Pattern NameModiscoSignificance
metacluster_1/pattern_70.026714599999999995
Atf7
Pattern NameModiscoSignificance
metacluster_1/pattern_70.046692199999999996
Crem
Pattern NameModiscoSignificance
metacluster_1/pattern_70.0486329
In [6]:
display_paiwise_pattern_comparison(data_name, modisco_root, homer_root)
Number of CISBP TFs obtained by TF-MoDISco and Homer
Shared TFs between TF-MoDISco and Homer: #48
TF NameModiscoHomer
XP_911724.4
Pattern NameModiscoSignificance
metacluster_1/pattern_10.043825499999999996
Pattern NameHomerSignificance
motif3.motif0.000169657
Elf1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00904088
metacluster_1/pattern_70.014602799999999999
Pattern NameHomerSignificance
motif3.motif6.58593e-05
Prdm1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00037988199999999995
metacluster_1/pattern_30.00817878
Pattern NameHomerSignificance
motif3.motif0.015265299999999999
motif11.motif0.00890098
Elf2
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0508199
Pattern NameHomerSignificance
motif3.motif0.037856400000000005
Stat1
Pattern NameModiscoSignificance
metacluster_1/pattern_30.018491099999999996
Pattern NameHomerSignificance
motif11.motif0.0522766
Irf4
Pattern NameModiscoSignificance
metacluster_1/pattern_11.3677500000000002e-06
metacluster_1/pattern_30.0481952
Pattern NameHomerSignificance
motif3.motif0.00155049
motif11.motif0.00316696
Sp2
Pattern NameModiscoSignificance
metacluster_1/pattern_50.00933334
metacluster_1/pattern_70.00480167
Pattern NameHomerSignificance
motif2.motif0.000427125
Fev
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00684215
Pattern NameHomerSignificance
motif3.motif0.000646821
Ctcfl
Pattern NameModiscoSignificance
metacluster_1/pattern_03.06874e-07
metacluster_1/pattern_20.051682500000000006
Pattern NameHomerSignificance
motif1.motif4.6891499999999996e-05
Etv6
Pattern NameModiscoSignificance
metacluster_1/pattern_10.023893
Pattern NameHomerSignificance
motif3.motif0.00700372
Klf3
Pattern NameModiscoSignificance
metacluster_1/pattern_50.0359087
Pattern NameHomerSignificance
motif2.motif0.000427035
Spi1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00140627
metacluster_1/pattern_30.0060034
Pattern NameHomerSignificance
motif3.motif0.00241045
Creb1
Pattern NameModiscoSignificance
metacluster_1/pattern_70.020701400000000002
Pattern NameHomerSignificance
motif15.motif0.0380614
Etv3
Pattern NameModiscoSignificance
metacluster_1/pattern_10.039267800000000005
Pattern NameHomerSignificance
motif3.motif0.000169657
Elk3
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00684215
Pattern NameHomerSignificance
motif3.motif0.000169657
Erg
Pattern NameModiscoSignificance
metacluster_1/pattern_10.043825499999999996
metacluster_1/pattern_70.026714599999999995
Pattern NameHomerSignificance
motif3.motif6.58593e-05
Crem
Pattern NameModiscoSignificance
metacluster_1/pattern_70.0486329
Pattern NameHomerSignificance
motif15.motif0.0380614
Sp3
Pattern NameModiscoSignificance
metacluster_1/pattern_50.026040300000000002
metacluster_1/pattern_70.00748667
Pattern NameHomerSignificance
motif2.motif0.000427035
Atf1
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00715435
Pattern NameHomerSignificance
motif15.motif0.045450800000000006
Irf1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00320247
metacluster_1/pattern_34.81017e-05
Pattern NameHomerSignificance
motif3.motif0.0322095
motif11.motif0.000782635
Stat2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00701224
Pattern NameHomerSignificance
motif11.motif0.0159467
Etv2
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00103638
Pattern NameHomerSignificance
motif3.motif0.00020083200000000002
Spib
Pattern NameModiscoSignificance
metacluster_1/pattern_15.977530000000001e-10
metacluster_1/pattern_35.21979e-07
Pattern NameHomerSignificance
motif3.motif0.000206299
motif11.motif0.0129755
Elk4
Pattern NameModiscoSignificance
metacluster_1/pattern_70.0241391
Pattern NameHomerSignificance
motif3.motif6.58593e-05
Irf8
Pattern NameModiscoSignificance
metacluster_1/pattern_17.37669e-07
metacluster_1/pattern_34.5965699999999997e-14
Pattern NameHomerSignificance
motif3.motif0.00133733
motif11.motif0.000782635
Elf4
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0335649
Pattern NameHomerSignificance
motif3.motif0.000964342
Fli1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.051592700000000005
metacluster_1/pattern_70.033017000000000005
Pattern NameHomerSignificance
motif3.motif6.58593e-05
Etv1
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00661023
Pattern NameHomerSignificance
motif3.motif0.00012161299999999999
Thap11
Pattern NameModiscoSignificance
metacluster_1/pattern_41.8293599999999996e-18
Pattern NameHomerSignificance
motif5.motif2.96253e-07
Nrf1
Pattern NameModiscoSignificance
metacluster_1/pattern_59.061260000000001e-05
Pattern NameHomerSignificance
motif6.motif0.00143337
Ets1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.039267800000000005
metacluster_1/pattern_70.0347686
Pattern NameHomerSignificance
motif3.motif6.58593e-05
Elk1
Pattern NameModiscoSignificance
metacluster_1/pattern_10.045883600000000004
metacluster_1/pattern_70.0172563
Pattern NameHomerSignificance
motif3.motif6.58593e-05
Elf3
Pattern NameModiscoSignificance
metacluster_1/pattern_10.039593500000000004
Pattern NameHomerSignificance
motif3.motif0.00938877
Tbx2
Pattern NameModiscoSignificance
metacluster_1/pattern_41.77113e-13
Pattern NameHomerSignificance
motif5.motif5.184399999999999e-07
Irf2
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00224878
metacluster_1/pattern_30.043151
Pattern NameHomerSignificance
motif3.motif0.020703299999999997
motif11.motif0.00205606
Irf7
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00902612
Pattern NameHomerSignificance
motif11.motif0.0101635
Atf2
Pattern NameModiscoSignificance
metacluster_1/pattern_70.026714599999999995
Pattern NameHomerSignificance
motif15.motif0.0380614
Atf7
Pattern NameModiscoSignificance
metacluster_1/pattern_70.046692199999999996
Pattern NameHomerSignificance
motif15.motif0.0380614
Etv4
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0180588
metacluster_1/pattern_70.020133900000000003
Pattern NameHomerSignificance
motif3.motif0.00157346
Irf3
Pattern NameModiscoSignificance
metacluster_1/pattern_10.000728658
metacluster_1/pattern_30.00212942
Pattern NameHomerSignificance
motif3.motif0.0476619
Gabpa
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0431615
metacluster_1/pattern_70.0205861
Pattern NameHomerSignificance
motif3.motif0.000851506
Sp1
Pattern NameModiscoSignificance
metacluster_1/pattern_50.057483000000000006
metacluster_1/pattern_70.020701400000000002
Pattern NameHomerSignificance
motif2.motif0.00928863
Elf5
Pattern NameModiscoSignificance
metacluster_1/pattern_10.051592700000000005
Pattern NameHomerSignificance
motif3.motif0.0452055
Etv5
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00661023
Pattern NameHomerSignificance
motif3.motif0.000169657
Ctcf
Pattern NameModiscoSignificance
metacluster_1/pattern_02.57772e-15
metacluster_1/pattern_20.00419748
Pattern NameHomerSignificance
motif1.motif7.38343e-08
Ehf
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0008959430000000001
Pattern NameHomerSignificance
motif3.motif0.00381713
Spic
Pattern NameModiscoSignificance
metacluster_1/pattern_10.00442074
metacluster_1/pattern_30.0491048
Pattern NameHomerSignificance
motif3.motif0.00297662
Zfp143
Pattern NameModiscoSignificance
metacluster_1/pattern_43.9730399999999997e-19
Pattern NameHomerSignificance
motif5.motif2.96253e-07
Unique TF-MoDISco TFs: #12
TF NameModiscoHomer
Trp53
Pattern NameModiscoSignificance
metacluster_1/pattern_50.057483000000000006
Absent
Rfx2
Pattern NameModiscoSignificance
metacluster_1/pattern_62.66701e-07
Absent
Bcl6
Pattern NameModiscoSignificance
metacluster_1/pattern_10.0508199
Absent
Usf2
Pattern NameModiscoSignificance
metacluster_1/pattern_70.00294219
Absent
Rfx4
Pattern NameModiscoSignificance
metacluster_1/pattern_60.000788975
Absent
Rfx1
Pattern NameModiscoSignificance
metacluster_1/pattern_65.15369e-05
Absent
Rfx3
Pattern NameModiscoSignificance
metacluster_1/pattern_60.029669099999999997
Absent
Rfx7
Pattern NameModiscoSignificance
metacluster_1/pattern_60.00900075
Absent
Rfx6
Pattern NameModiscoSignificance
metacluster_1/pattern_60.00437395
Absent
Zfx
Pattern NameModiscoSignificance
metacluster_1/pattern_50.057483000000000006
metacluster_1/pattern_70.00752257
Absent
Zscan10
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00893613
Absent
Irf9
Pattern NameModiscoSignificance
metacluster_1/pattern_30.016780200000000002
Absent
Unique Homer TFs: #29
TF NameModiscoHomer
RelaAbsent
Pattern NameHomerSignificance
motif21.motif0.0012963
SpdefAbsent
Pattern NameHomerSignificance
motif3.motif0.015265299999999999
Foxi1Absent
Pattern NameHomerSignificance
motif7.motif0.00096093
Klf6Absent
Pattern NameHomerSignificance
motif2.motif0.00423613
Snai1Absent
Pattern NameHomerSignificance
motif1.motif0.0503809
Egr1Absent
Pattern NameHomerSignificance
motif2.motif0.018080799999999998
Klf7Absent
Pattern NameHomerSignificance
motif2.motif0.000427125
Klf1Absent
Pattern NameHomerSignificance
motif2.motif0.040231
E2f4Absent
Pattern NameHomerSignificance
motif2.motif0.00126624
Klf8Absent
Pattern NameHomerSignificance
motif2.motif0.0009202739999999999
RelbAbsent
Pattern NameHomerSignificance
motif21.motif0.00370558
Zbtb1Absent
Pattern NameHomerSignificance
motif2.motif0.051469900000000006
Yy1Absent
Pattern NameHomerSignificance
motif16.motif1.07651e-06
Zfp42Absent
Pattern NameHomerSignificance
motif16.motif0.00047637900000000003
NfybAbsent
Pattern NameHomerSignificance
motif7.motif1.04471e-06
Klf5Absent
Pattern NameHomerSignificance
motif2.motif0.025638099999999997
RelAbsent
Pattern NameHomerSignificance
motif21.motif0.010101899999999999
Taf1Absent
Pattern NameHomerSignificance
motif16.motif2.99143e-05
Klf4Absent
Pattern NameHomerSignificance
motif2.motif0.000463374
Zic2Absent
Pattern NameHomerSignificance
motif1.motif0.0336453
NfycAbsent
Pattern NameHomerSignificance
motif7.motif1.5457e-05
NfyaAbsent
Pattern NameHomerSignificance
motif7.motif9.14635e-06
Sp4Absent
Pattern NameHomerSignificance
motif2.motif0.019171200000000003
Nfkb1Absent
Pattern NameHomerSignificance
motif21.motif0.000112727
Pbx3Absent
Pattern NameHomerSignificance
motif7.motif0.000875804
Klf12Absent
Pattern NameHomerSignificance
motif2.motif0.014141599999999999
Nfkb2Absent
Pattern NameHomerSignificance
motif21.motif0.000112727
MafbAbsent
Pattern NameHomerSignificance
motif15.motif0.045450800000000006
Zic3Absent
Pattern NameHomerSignificance
motif1.motif0.0336453