# WARNING: this file is not sorted! # db id alt consensus E-value adj_p-value log_adj_p-value bin_location bin_width total_width sites_in_bin total_sites p_success p-value mult_tests 1 GTGGCYGBGKBBGBGK MEME-1 GTGGCYGBGKBBGBGK 2.6e-077 3.5e-080 -182.96 0.0 99 485 337 599 0.20412 1.4e-082 242 1 YGTGGCY MEME-2 YGTGGCY 8.1e-067 1.1e-069 -158.79 0.0 92 494 299 576 0.18623 4.4e-072 246 1 MACACMCACCACMMMCACCACM MEME-3 MACACMCACCACMMMCACCACM 5.8e-001 7.8e-004 -7.15 0.0 141 479 144 354 0.29436 3.3e-006 239 2 GCCACDD DREME-1 GCCACRG 2.7e-055 3.6e-058 -132.26 0.0 86 494 257 528 0.17409 1.5e-060 246 2 CGTGGHY DREME-2 CGTGGHY 1.9e-046 2.6e-049 -111.86 0.0 118 494 247 423 0.23887 1.1e-051 246 2 GTGGMW DREME-3 GTGGMW 1.6e-029 2.2e-032 -72.89 0.0 119 495 268 557 0.24040 9.0e-035 247 2 CCACG DREME-4 CCACG 1.9e-041 2.5e-044 -100.39 0.0 94 496 244 523 0.18952 1.0e-046 247 3 M0177_1.02 (TWIST2)_(Mus_musculus)_(DBD_1.00) NCCATATGTN 2.3e-011 3.1e-014 -31.10 0.0 67 491 134 478 0.13646 1.3e-016 245 3 M0189_1.02 (ID2)_(Mus_musculus)_(DBD_0.98) RCACGTGR 2.4e-018 3.3e-021 -47.16 0.0 65 493 162 548 0.13185 1.3e-023 246 3 M0196_1.02 (NPAS2)_(Mus_musculus)_(DBD_1.00) NSCACGTGTN 4.2e-022 5.7e-025 -55.82 0.0 65 491 168 539 0.13238 2.3e-027 245 3 M0198_1.02 (SOHLH2)_(Mus_musculus)_(DBD_0.84) NGYVCGTGCN 2.1e-004 2.8e-007 -15.09 0.0 77 491 146 573 0.15682 1.1e-009 245 3 M0199_1.02 (HES2)_(Mus_musculus)_(DBD_0.83) NNNNGRCACGTGM 7.9e0000 1.1e-002 -4.55 0.0 38 488 71 565 0.07787 4.4e-005 243 3 M0211_1.02 (MLXIP)_(Mus_musculus)_(DBD_0.82) BCACGTGK 1.9e-020 2.5e-023 -52.04 0.0 71 493 177 555 0.14402 1.0e-025 246 3 M0212_1.02 (TCFL5)_(Mus_musculus)_(DBD_1.00) NBCDCGHGVN 3.7e-012 4.9e-015 -32.94 0.0 69 491 129 432 0.14053 2.0e-017 245 3 M0305_1.02 (CREB3L2)_(Mus_musculus)_(DBD_1.00) GCCACGTGD 1.1e-037 1.5e-040 -91.69 0.0 70 492 200 514 0.14228 6.2e-043 245 3 M1889_1.02 MAX RRGCACATGK 1.1e-011 1.5e-014 -31.83 0.0 57 491 132 539 0.11609 6.1e-017 245 3 M1917_1.02 USF1 GGTCACRTGRB 8.2e-012 1.1e-014 -32.14 0.0 66 490 140 505 0.13469 4.5e-017 244 3 M1927_1.02 (MYCL1)_(Mus_musculus)_(DBD_0.66) RCCACGTG 9.5e-043 1.3e-045 -103.36 0.0 71 493 222 564 0.14402 5.2e-048 246 3 M1950_1.02 ZNF354C GTGGAK 1.4e-005 1.8e-008 -17.82 0.0 127 495 225 597 0.25657 7.4e-011 247 3 M2310_1.02 RUNX2 MAAACCACARAMMMM 1.0e-014 1.3e-017 -38.85 0.0 54 486 142 577 0.11111 5.6e-020 242 3 M4451_1.02 ATF3 GGTCACGTGRS 1.3e-009 1.8e-012 -27.04 0.0 70 490 132 468 0.14286 7.4e-015 244 3 M4481_1.02 USF2 GGTCACGTGRSSSSV 8.7e-016 1.2e-018 -41.29 0.0 72 486 144 439 0.14815 4.9e-021 242 3 M4532_1.02 MYC CCACGTGSYY 2.9e-028 3.9e-031 -70.02 0.0 67 491 190 565 0.13646 1.6e-033 245 3 M4543_1.02 MXI1 VVVVCCACGTG 1.1e-031 1.5e-034 -77.88 0.0 60 490 183 558 0.12245 6.2e-037 244 3 M4553_1.02 BHLHE40 NGKCACGTGC 3.2e-011 4.3e-014 -30.77 0.0 87 491 171 524 0.17719 1.8e-016 245 3 M4680_1.02 BACH1 GTCACGTG 4.5e-005 6.0e-008 -16.63 0.0 67 493 121 504 0.13590 2.4e-010 246 3 M5234_1.02 (SIM1)_(Drosophila_melanogaster)_(DBD_0.89) GGTCACGTAC 6.5e0000 8.7e-003 -4.74 0.0 81 491 95 390 0.16497 3.6e-005 245 3 M5321_1.02 CLOCK AACACGTGTH 3.3e-011 4.4e-014 -30.76 0.0 65 491 128 463 0.13238 1.8e-016 245 3 M5325_1.02 CREB3L1 TGRTGACGTGGCAB 9.1e-018 1.2e-020 -45.85 0.0 93 487 171 430 0.19097 5.0e-023 243 3 M5493_1.02 GMEB2 KTRCGTAA 3.4e0000 4.6e-003 -5.39 0.0 169 493 200 458 0.34280 1.9e-005 246 3 M5504_1.02 HES5 YGGCACGTGCCR 2.1e-001 2.8e-004 -8.18 0.0 75 489 69 256 0.15337 1.1e-006 244 3 M5506_1.02 HES7 YGGCACGTGCCR 1.8e0000 2.4e-003 -6.01 0.0 107 489 70 198 0.21881 1.0e-005 244 3 M5509_1.02 HEY1 GRCACGTGBC 1.9e-020 2.5e-023 -52.03 0.0 71 491 165 497 0.14460 1.0e-025 245 3 M5592_1.02 KLF14 ARGKGGGCGTGGYM 1.3e-001 1.7e-004 -8.68 0.0 77 487 124 512 0.15811 7.0e-007 243 3 M5632_1.02 MLX RTCACGTGAT 1.8e-006 2.4e-009 -19.84 0.0 69 491 101 365 0.14053 9.9e-012 245 3 M5634_1.02 MNT VVCACGTGVH 8.6e-027 1.2e-029 -66.62 0.0 65 491 179 546 0.13238 4.8e-032 245 3 M5652_1.02 NEUROD2 RMCATATGBY 3.0e0000 4.0e-003 -5.51 0.0 43 491 68 459 0.08758 1.7e-005 245 3 M5856_1.02 SP8 RGKGGGCGTGGY 4.4e-002 6.0e-005 -9.72 0.0 163 489 252 579 0.33333 2.5e-007 244 3 M5932_1.02 TFEC VTCAYGTGMY 9.6e-005 1.3e-007 -15.86 0.0 69 491 128 530 0.14053 5.3e-010 245 3 M5962_1.02 ZBTB7C NTCGGTGGTCGY 2.4e-008 3.3e-011 -24.14 0.0 127 489 232 579 0.25971 1.4e-013 244 3 M6139_1.02 AHR KCACGCRAH 1.8e-004 2.4e-007 -15.26 0.0 124 492 203 550 0.25203 9.7e-010 245 3 M6150_1.02 ARNT2 GYSYSCCACGNC 2.9e-010 3.9e-013 -28.58 0.0 83 489 179 593 0.16973 1.6e-015 244 3 M6151_1.02 ARNT BYRCGTGC 1.6e-016 2.1e-019 -43.00 0.0 101 493 199 511 0.20487 8.6e-022 246 3 M6155_1.02 ATF6 GBGSTGACGTGG 1.6e-011 2.1e-014 -31.48 0.0 111 489 192 487 0.22699 8.7e-017 244 3 M6162_1.02 ARNTL GRGTCACGTGTYCM 1.7e-016 2.4e-019 -42.89 0.0 69 487 138 424 0.14168 9.7e-022 243 3 M6210_1.02 ENO1 YDSMCACRTGSYB 1.4e-014 2.0e-017 -38.47 0.0 38 488 114 579 0.07787 8.1e-020 243 3 M6265_1.02 GLI2 GTGGGTGGTCY 5.8e-002 7.8e-005 -9.46 0.0 106 490 154 490 0.21633 3.2e-007 244 3 M6266_1.02 GLI3 BTGGGTGGTCB 4.4e-001 5.9e-004 -7.43 0.0 62 490 109 556 0.12653 2.4e-006 244 3 M6271_1.02 HES1 KGKCKCGTGBCKB 1.4e-005 2.0e-008 -17.75 0.0 90 488 174 591 0.18443 8.1e-011 243 3 M6272_1.02 HESX1 AKKYYAYKWGCCRSVT 7.3e0000 9.8e-003 -4.62 0.0 123 485 180 545 0.25361 4.1e-005 242 3 M6273_1.02 HEY2 GBBGGCWCGTGGCHTBV 4.8e-011 6.4e-014 -30.37 0.0 120 484 219 537 0.24793 2.7e-016 241 3 M6275_1.02 HIF1A SBSTACGTGCSB 4.0e-014 5.4e-017 -37.46 0.0 111 489 213 530 0.22699 2.2e-019 244 3 M6326_1.02 KLF8 CMGGGKGTG 1.1e-001 1.5e-004 -8.82 0.0 70 492 122 554 0.14228 6.1e-007 245 3 M6345_1.02 MITF VKCACATGWY 5.6e-002 7.6e-005 -9.49 0.0 35 491 74 560 0.07128 3.1e-007 245 3 M6346_1.02 MLXIPL VCACGVSGKKGBCMCRTGC 4.8e-006 6.5e-009 -18.86 0.0 56 482 103 455 0.11618 2.7e-011 240 3 M6352_1.02 MYCN CCACGTGS 3.2e-025 4.3e-028 -63.01 0.0 69 493 185 560 0.13996 1.8e-030 246 3 M6416_1.02 CBFB YYTGTGGTYDB 1.4e-001 1.9e-004 -8.58 0.0 62 490 116 586 0.12653 7.7e-007 244 3 M6456_1.02 RREB1 DKGKKKGKGGKTGKTTKGGGKT 2.4e0000 3.2e-003 -5.74 0.0 83 479 124 497 0.17328 1.3e-005 239 3 M6457_1.02 RUNX1 WAACCACARW 2.6e-009 3.5e-012 -26.38 0.0 65 491 147 586 0.13238 1.4e-014 245 3 M6459_1.02 RUNX3 AACCRCAAAMCCCV 8.1e-004 1.1e-006 -13.72 0.0 71 487 128 529 0.14579 4.5e-009 243 3 M6517_1.02 TFE3 RGTCAYGTGV 1.5e-005 2.1e-008 -17.69 0.0 59 491 121 555 0.12016 8.4e-011 245 3 M6518_1.02 TFEB RGTCACGTG 2.0e-011 2.7e-014 -31.24 0.0 68 492 106 340 0.13821 1.1e-016 245 3 M6536_1.02 XBP1 GACGTGKCMHWW 1.8e-012 2.4e-015 -33.67 0.0 135 489 211 455 0.27607 9.8e-018 244 ## # Detailed descriptions of columns in this file: # # db: The name of the database (file name) that contains the motif. # id: A name for the motif that is unique in the motif database file. # alt: An alternate name of the motif that may be provided # in the motif database file. # consensus: A consensus sequence computed from the motif. # E-value: The expected number motifs that would have least one. # region as enriched for best matches to the motif as the reported region. # The E-value is the p-value multiplied by the number of motifs in the # input database(s). # adj_p-value: The probability that any tested region would be as enriched for # best matches to this motif as the reported region is. # By default the p-value is calculated by using the one-tailed binomial # test on the number of sequences with a match to the motif # that have their best match in the reported region, corrected for # the number of regions and score thresholds tested. # The test assumes that the probability that the best match in a sequence # falls in the region is the region width divided by the # number of places a motif # can align in the sequence (sequence length minus motif width plus 1). # When CentriMo is run in discriminative mode with a negative # set of sequences, the p-value of a region is calculated # using the Fisher exact test on the # enrichment of best matches in the positive sequences relative # to the negative sequences, corrected # for the number of regions and score thresholds tested. # The test assumes that the probability that the best match (if any) # falls into a given region # is the same for all positive and negative sequences. # log_adj_p-value: Log of adjusted p-value. # bin_location: Location of the center of the most enriched region. # bin_width: The width (in sequence positions) of the most enriched region. # A best match to the motif is counted as being in the region if the # center of the motif falls in the region. # total_width: The window maximal size which can be reached for this motif: # rounded(sequence length - motif length +1)/2 # sites_in_bin: The number of (positive) sequences whose best match to the motif # falls in the reported region. # Note: This number may be less than the number of # (positive) sequences that have a best match in the region. # The reason for this is that a sequence may have many matches that score # equally best. # If n matches have the best score in a sequence, 1/n is added to the # appropriate bin for each match. # total_sites: The number of sequences containing a match to the motif # above the score threshold. # p_success: The probability of falling in the enriched window: # bin width / total width # p-value: The uncorrected p-value before it gets adjusted to the # number of multiple tests to give the adjusted p-value. # mult_tests: This is the number of multiple tests (n) done for this motif. # It was used to correct the original p-value of a region for # multiple tests using the formula: # p' = 1 - (1-p)^n where p is the uncorrected p-value. # The number of multiple tests is the number of regions # considered times the number of score thresholds considered. # It depends on the motif length, sequence length, and the type of # optimizations being done (central enrichment, local enrichment, # score optimization).