In [1]:
# Parameters
data_name = "apTreg;D;GEO"
modisco_root = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/Naive_modisco2019"
tomtom_report_root = "http://mitra.stanford.edu/kundaje/msharmin/report/tomtom_outs/cells"
task_dir = "task_209-naivegw"
perf_file = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/fineFactorized/task_209-naivegw/NaiveauPRC.txt"
homer_root = "/srv/scratch/msharmin/mouse_hem/with_tfd/full_mouse50/Naive_scans"
reportfile = "/mnt/lab_data/kundaje/msharmin/annotations/filtering samples_MS2.xlsx"
sheetname = "filter23"
In [2]:
from matlas.reports import display_metadata
load data from labcluster
Using TensorFlow backend.
2019-08-30 19:28:16,107 [WARNING] git-lfs not installed
In [3]:
display_metadata(data_name, perf_file, reportfile, sheetname)
    Sample Information
    MetaData NameDescription
    Cell typeActivated primary T regulatory cells, isolated ex vivo
    Cell GroupT cells
    Experiment NameDHS
    Experiment GroupGEO
    Pipeline Output
    replicateNaïve overlap peaksIDR peaksTSS enrichment (< 8 is very poor <10 is low)Final number of unique mapping, dup-filtered, chrM filtered readsNumber of reads in called peak regionsFraction of reads in called peak regionsNumber of reads in promoter regionsFraction of reads in promoter regionsNumber of reads in enhancer regionsFraction of reads in enhancer regions
    rep11381821162155.9779143353004331809250.2315185305610.1293531620030.3709
    Modelling Metadata
    MetricValue
    auPRC0.6033
    Calibrated Recall at 50% FDR0.212
    Number of Positive Examples in Test Data103253
    Number of Negative Examples in Test Data7967598
    Imbalance Ratio in Test Data0.0128
    Test Chromosomeschr2, chr3, chr19
In [4]:
from matlas.reports import display_paiwise_pattern_comparison
from matlas.reports import display_denovo_patterns
In [5]:
display_denovo_patterns(data_name, modisco_root=modisco_root)
TF-MoDISco is using the TensorFlow backend.
The following two links show list of Denovo Patterns and corresponding Motifs discovered by TF-MoDISco
Click here for Denovo Patterns by TF-MoDISco: #8
Pattern NameTF Name(s)Modisco
metacluster_1/pattern_0 # seqlets: 14017 SequenceContrib ScoresHyp_Contrib Scores
Ctcf, Ctcfl
metacluster_1/pattern_1 # seqlets: 1129 SequenceContrib ScoresHyp_Contrib Scores
metacluster_1/pattern_2 # seqlets: 1046 SequenceContrib ScoresHyp_Contrib Scores
Etv2, Ets1, Spi1, Erg, Elk1, Spib, Ehf, Elf3, Etv4, Elf1,

Gabpa, Elk4, Etv6, Fli1, Etv3, Elk3, Irf4, XP_911724.4, Irf8, Etv1, Etv5,

Elf4, Elf2, Spic, Elf5, Fev, Prdm1, Irf3, Spdef
metacluster_1/pattern_3 # seqlets: 732 SequenceContrib ScoresHyp_Contrib Scores
Irf1, Irf2, Stat1, Irf7, Stat2, Irf9, Batf3
metacluster_1/pattern_4 # seqlets: 102 SequenceContrib ScoresHyp_Contrib Scores
Jdp2, Batf, Fosl2, Fosb, Junb, Jun, Jund, Fos, Atf3, Fosl1,

Nfe2l2, Bach2, Nfatc1
metacluster_1/pattern_5 # seqlets: 53 SequenceContrib ScoresHyp_Contrib Scores
Zfp143, Tbx2, Thap11, Stat3
metacluster_1/pattern_6 # seqlets: 46 SequenceContrib ScoresHyp_Contrib Scores
T, Rorc
metacluster_1/pattern_7 # seqlets: 41 SequenceContrib ScoresHyp_Contrib Scores
Rest
Click here for Motifs by TF-MoDISco: #58
TF NamePattern(s)
Ctcf
Pattern NameModiscoSignificance
metacluster_1/pattern_02.09254e-14
metacluster_1/pattern_10.00293923
Ctcfl
Pattern NameModiscoSignificance
metacluster_1/pattern_04.974060000000001e-07
metacluster_1/pattern_10.057135000000000005
Etv2
Pattern NameModiscoSignificance
metacluster_1/pattern_27.060310000000001e-05
metacluster_1/pattern_60.018559299999999997
Ets1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00257166
metacluster_1/pattern_60.0456245
Spi1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0370544
metacluster_1/pattern_60.0381566
Erg
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00257166
metacluster_1/pattern_60.036686699999999996
Elk1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00257166
metacluster_1/pattern_60.0583058
Spib
Pattern NameModiscoSignificance
metacluster_1/pattern_20.000644258
metacluster_1/pattern_30.00236355
Ehf
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0303231
metacluster_1/pattern_60.050570699999999996
Elf3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.022549200000000002
Etv4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00463278
metacluster_1/pattern_60.021790900000000002
Elf1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00106377
Gabpa
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00513367
metacluster_1/pattern_60.0407209
Elk4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_60.0583058
Etv6
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_60.0120573
Fli1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_60.0381566
Etv3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
Elk3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00397752
metacluster_1/pattern_60.0248876
Irf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_30.0199363
XP_911724.4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
Irf8
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_34.3180200000000004e-05
Etv1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048786
metacluster_1/pattern_60.0248876
Etv5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00483995
metacluster_1/pattern_60.018559299999999997
Elf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00566999
Elf2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00986639
metacluster_1/pattern_60.018559299999999997
Spic
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00790418
Elf5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.043557099999999994
Fev
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0140345
metacluster_1/pattern_60.0407209
Prdm1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0145505
metacluster_1/pattern_30.000446728
Irf3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.029454400000000006
metacluster_1/pattern_30.00223351
Spdef
Pattern NameModiscoSignificance
metacluster_1/pattern_20.043557099999999994
Irf1
Pattern NameModiscoSignificance
metacluster_1/pattern_35.9939e-09
Irf2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.007568000000000001
Stat1
Pattern NameModiscoSignificance
metacluster_1/pattern_32.8692e-06
Irf7
Pattern NameModiscoSignificance
metacluster_1/pattern_34.3180200000000004e-05
Stat2
Pattern NameModiscoSignificance
metacluster_1/pattern_36.40394e-05
Irf9
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00223351
Batf3
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0360824
metacluster_1/pattern_40.00189355
Jdp2
Pattern NameModiscoSignificance
metacluster_1/pattern_44.6230599999999996e-06
metacluster_1/pattern_60.018559299999999997
Batf
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00017937099999999997
metacluster_1/pattern_60.0178092
Fosl2
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00626675
metacluster_1/pattern_60.018559299999999997
Fosb
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00155978
metacluster_1/pattern_60.018559299999999997
Junb
Pattern NameModiscoSignificance
metacluster_1/pattern_40.017496400000000002
metacluster_1/pattern_60.030546499999999997
Jun
Pattern NameModiscoSignificance
metacluster_1/pattern_40.058202
metacluster_1/pattern_60.0178092
Jund
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00394352
metacluster_1/pattern_60.0248876
Fos
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00209698
metacluster_1/pattern_60.0381566
Atf3
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00218776
metacluster_1/pattern_60.0220298
Fosl1
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00344027
metacluster_1/pattern_60.0381566
Nfe2l2
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00686171
Bach2
Pattern NameModiscoSignificance
metacluster_1/pattern_40.020729499999999998
Nfatc1
Pattern NameModiscoSignificance
metacluster_1/pattern_40.051769
Zfp143
Pattern NameModiscoSignificance
metacluster_1/pattern_57.50483e-09
Tbx2
Pattern NameModiscoSignificance
metacluster_1/pattern_57.50483e-09
Thap11
Pattern NameModiscoSignificance
metacluster_1/pattern_53.10351e-08
Stat3
Pattern NameModiscoSignificance
metacluster_1/pattern_50.028527499999999997
T
Pattern NameModiscoSignificance
metacluster_1/pattern_60.018559299999999997
Rorc
Pattern NameModiscoSignificance
metacluster_1/pattern_60.0248876
Rest
Pattern NameModiscoSignificance
metacluster_1/pattern_71.51099e-14
In [6]:
display_paiwise_pattern_comparison(data_name, modisco_root, homer_root)
Number of CISBP TFs obtained by TF-MoDISco and Homer
Shared TFs between TF-MoDISco and Homer: #33
TF NameModiscoHomer
Etv1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0048786
metacluster_1/pattern_60.0248876
Pattern NameHomerSignificance
motif2.motif0.0008750380000000001
Erg
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00257166
metacluster_1/pattern_60.036686699999999996
Pattern NameHomerSignificance
motif2.motif0.0007655639999999999
Fev
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0140345
metacluster_1/pattern_60.0407209
Pattern NameHomerSignificance
motif2.motif0.000167669
Irf3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.029454400000000006
metacluster_1/pattern_30.00223351
Pattern NameHomerSignificance
motif6.motif0.0109244
Etv4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00463278
metacluster_1/pattern_60.021790900000000002
Pattern NameHomerSignificance
motif2.motif0.00339492
Thap11
Pattern NameModiscoSignificance
metacluster_1/pattern_53.10351e-08
Pattern NameHomerSignificance
motif7.motif1.0417700000000001e-06
Irf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_30.0199363
Pattern NameHomerSignificance
motif6.motif0.0243433
Elk1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00257166
metacluster_1/pattern_60.0583058
Pattern NameHomerSignificance
motif2.motif0.000774308
Elf5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.043557099999999994
Pattern NameHomerSignificance
motif2.motif0.023465200000000002
Fli1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_60.0381566
Pattern NameHomerSignificance
motif2.motif0.0007655639999999999
Etv3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
Pattern NameHomerSignificance
motif2.motif0.0008750380000000001
Irf2
Pattern NameModiscoSignificance
metacluster_1/pattern_30.007568000000000001
Pattern NameHomerSignificance
motif6.motif0.039390100000000004
Etv6
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_60.0120573
Pattern NameHomerSignificance
motif2.motif0.0152125
Elf4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00566999
Pattern NameHomerSignificance
motif2.motif0.00895127
Elk3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00397752
metacluster_1/pattern_60.0248876
Pattern NameHomerSignificance
motif2.motif0.0007655639999999999
Spdef
Pattern NameModiscoSignificance
metacluster_1/pattern_20.043557099999999994
Pattern NameHomerSignificance
motif2.motif0.0421949
Etv5
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00483995
metacluster_1/pattern_60.018559299999999997
Pattern NameHomerSignificance
motif2.motif0.00022113900000000003
Etv2
Pattern NameModiscoSignificance
metacluster_1/pattern_27.060310000000001e-05
metacluster_1/pattern_60.018559299999999997
Pattern NameHomerSignificance
motif2.motif0.00147568
Zfp143
Pattern NameModiscoSignificance
metacluster_1/pattern_57.50483e-09
Pattern NameHomerSignificance
motif7.motif1.02649e-06
Elf1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00106377
Pattern NameHomerSignificance
motif2.motif0.0008750380000000001
XP_911724.4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
Pattern NameHomerSignificance
motif2.motif0.0008750380000000001
Ehf
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0303231
metacluster_1/pattern_60.050570699999999996
Pattern NameHomerSignificance
motif2.motif0.028776299999999998
Elk4
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_60.0583058
Pattern NameHomerSignificance
motif2.motif0.000517533
Elf2
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00986639
metacluster_1/pattern_60.018559299999999997
Pattern NameHomerSignificance
motif2.motif0.0160288
Irf7
Pattern NameModiscoSignificance
metacluster_1/pattern_34.3180200000000004e-05
Pattern NameHomerSignificance
motif6.motif0.039390100000000004
Ctcfl
Pattern NameModiscoSignificance
metacluster_1/pattern_04.974060000000001e-07
metacluster_1/pattern_10.057135000000000005
Pattern NameHomerSignificance
motif1.motif0.000189028
Elf3
Pattern NameModiscoSignificance
metacluster_1/pattern_20.022549200000000002
Pattern NameHomerSignificance
motif2.motif0.0566915
Gabpa
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00513367
metacluster_1/pattern_60.0407209
Pattern NameHomerSignificance
motif2.motif0.00082812
Ets1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00257166
metacluster_1/pattern_60.0456245
Pattern NameHomerSignificance
motif2.motif0.0007655639999999999
Irf9
Pattern NameModiscoSignificance
metacluster_1/pattern_30.00223351
Pattern NameHomerSignificance
motif6.motif0.0216638
Tbx2
Pattern NameModiscoSignificance
metacluster_1/pattern_57.50483e-09
Pattern NameHomerSignificance
motif7.motif6.73135e-05
Spi1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0370544
metacluster_1/pattern_60.0381566
Pattern NameHomerSignificance
motif2.motif0.0186041
Ctcf
Pattern NameModiscoSignificance
metacluster_1/pattern_02.09254e-14
metacluster_1/pattern_10.00293923
Pattern NameHomerSignificance
motif1.motif4.923569999999999e-06
Unique TF-MoDISco TFs: #25
TF NameModiscoHomer
Irf8
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00355199
metacluster_1/pattern_34.3180200000000004e-05
Absent
Spib
Pattern NameModiscoSignificance
metacluster_1/pattern_20.000644258
metacluster_1/pattern_30.00236355
Absent
Fosb
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00155978
metacluster_1/pattern_60.018559299999999997
Absent
Rest
Pattern NameModiscoSignificance
metacluster_1/pattern_71.51099e-14
Absent
Jund
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00394352
metacluster_1/pattern_60.0248876
Absent
Bach2
Pattern NameModiscoSignificance
metacluster_1/pattern_40.020729499999999998
Absent
T
Pattern NameModiscoSignificance
metacluster_1/pattern_60.018559299999999997
Absent
Spic
Pattern NameModiscoSignificance
metacluster_1/pattern_20.00790418
Absent
Rorc
Pattern NameModiscoSignificance
metacluster_1/pattern_60.0248876
Absent
Stat2
Pattern NameModiscoSignificance
metacluster_1/pattern_36.40394e-05
Absent
Fosl2
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00626675
metacluster_1/pattern_60.018559299999999997
Absent
Batf3
Pattern NameModiscoSignificance
metacluster_1/pattern_30.0360824
metacluster_1/pattern_40.00189355
Absent
Atf3
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00218776
metacluster_1/pattern_60.0220298
Absent
Fos
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00209698
metacluster_1/pattern_60.0381566
Absent
Nfe2l2
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00686171
Absent
Stat3
Pattern NameModiscoSignificance
metacluster_1/pattern_50.028527499999999997
Absent
Fosl1
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00344027
metacluster_1/pattern_60.0381566
Absent
Prdm1
Pattern NameModiscoSignificance
metacluster_1/pattern_20.0145505
metacluster_1/pattern_30.000446728
Absent
Nfatc1
Pattern NameModiscoSignificance
metacluster_1/pattern_40.051769
Absent
Jun
Pattern NameModiscoSignificance
metacluster_1/pattern_40.058202
metacluster_1/pattern_60.0178092
Absent
Stat1
Pattern NameModiscoSignificance
metacluster_1/pattern_32.8692e-06
Absent
Jdp2
Pattern NameModiscoSignificance
metacluster_1/pattern_44.6230599999999996e-06
metacluster_1/pattern_60.018559299999999997
Absent
Irf1
Pattern NameModiscoSignificance
metacluster_1/pattern_35.9939e-09
Absent
Junb
Pattern NameModiscoSignificance
metacluster_1/pattern_40.017496400000000002
metacluster_1/pattern_60.030546499999999997
Absent
Batf
Pattern NameModiscoSignificance
metacluster_1/pattern_40.00017937099999999997
metacluster_1/pattern_60.0178092
Absent
Unique Homer TFs: #40
TF NameModiscoHomer
Klf3Absent
Pattern NameHomerSignificance
motif3.motif0.00034182
Zfp42Absent
Pattern NameHomerSignificance
motif14.motif0.025294099999999996
E2f4Absent
Pattern NameHomerSignificance
motif3.motif0.00048016800000000006
Yy1Absent
Pattern NameHomerSignificance
motif14.motif3.51526e-05
RelaAbsent
Pattern NameHomerSignificance
motif18.motif0.000542213
Zfp281Absent
Pattern NameHomerSignificance
motif3.motif0.053991
NfyaAbsent
Pattern NameHomerSignificance
motif9.motif2.95166e-06
Klf5Absent
Pattern NameHomerSignificance
motif3.motif0.00870626
NfycAbsent
Pattern NameHomerSignificance
motif9.motif5.81558e-06
Pbx3Absent
Pattern NameHomerSignificance
motif9.motif0.000396653
Egr1Absent
Pattern NameHomerSignificance
motif3.motif0.047002300000000004
Taf1Absent
Pattern NameHomerSignificance
motif14.motif0.00051887
Egr2Absent
Pattern NameHomerSignificance
motif3.motif0.036674599999999995
Irf5Absent
Pattern NameHomerSignificance
motif6.motif0.030336099999999998
Klf7Absent
Pattern NameHomerSignificance
motif3.motif0.000362306
Klf12Absent
Pattern NameHomerSignificance
motif3.motif0.0325621
Sp2Absent
Pattern NameHomerSignificance
motif3.motif0.000124698
Zbtb1Absent
Pattern NameHomerSignificance
motif3.motif0.0137376
Klf1Absent
Pattern NameHomerSignificance
motif3.motif0.047002300000000004
E2f7Absent
Pattern NameHomerSignificance
motif3.motif0.00942314
Nrf1Absent
Pattern NameHomerSignificance
motif10.motif1.85491e-07
RelbAbsent
Pattern NameHomerSignificance
motif18.motif0.00178309
Runx3Absent
Pattern NameHomerSignificance
motif12.motif0.0337742
E2f6Absent
Pattern NameHomerSignificance
motif3.motif0.016942099999999998
Klf6Absent
Pattern NameHomerSignificance
motif3.motif0.00221498
E2f1Absent
Pattern NameHomerSignificance
motif3.motif0.00469127
Klf15Absent
Pattern NameHomerSignificance
motif3.motif0.00942314
Sp4Absent
Pattern NameHomerSignificance
motif3.motif0.00677345
Zbtb17Absent
Pattern NameHomerSignificance
motif3.motif0.00317956
Sp1Absent
Pattern NameHomerSignificance
motif3.motif0.0028425
Sp5Absent
Pattern NameHomerSignificance
motif3.motif0.036674599999999995
Foxi1Absent
Pattern NameHomerSignificance
motif9.motif0.00133412
Nfkb2Absent
Pattern NameHomerSignificance
motif18.motif0.00037016
NfybAbsent
Pattern NameHomerSignificance
motif9.motif4.92437e-08
Klf8Absent
Pattern NameHomerSignificance
motif3.motif0.00057705
Nfkb1Absent
Pattern NameHomerSignificance
motif18.motif8.70161e-05
Klf4Absent
Pattern NameHomerSignificance
motif3.motif0.0248979
Sp3Absent
Pattern NameHomerSignificance
motif3.motif0.000124698
Runx1Absent
Pattern NameHomerSignificance
motif12.motif0.0337742
CbfbAbsent
Pattern NameHomerSignificance
motif12.motif0.0451068