# WARNING: this file is not sorted! # db id alt consensus E-value adj_p-value log_adj_p-value bin_location bin_width total_width sites_in_bin total_sites p_success p-value mult_tests 1 TGTWWRTRTGGTATAT MEME-1 TGTWWRTRTGGTATAT 6.5e-162 8.7e-165 -377.76 0.0 111 485 418 524 0.22887 3.6e-167 242 1 TRTGGTATATRT MEME-2 TRTGGTATATRT 1.0e-138 1.4e-141 -324.35 0.0 103 489 374 502 0.21063 5.6e-144 244 1 TGGAGAAGTGACWAGACAAA MEME-3 TGGAGAAGTGACWAGACAAA 6.3e-024 8.5e-027 -60.03 0.0 63 481 84 174 0.13098 3.5e-029 240 2 TGGTATAB DREME-1 TGGTATAT 5.9e-132 7.9e-135 -308.78 0.0 107 493 325 407 0.21704 3.2e-137 246 2 AASAAAG DREME-2 AASAAAG 6.4e-030 8.5e-033 -73.84 0.0 100 494 130 225 0.20243 3.5e-035 246 2 CTTTRTC DREME-3 CTTTRTC 9.6e-016 1.3e-018 -41.19 0.0 130 494 104 172 0.26316 5.2e-021 246 2 AKTGTTAG DREME-4 AGTGTTAG 2.3e-012 3.0e-015 -33.43 0.0 97 493 43 61 0.19675 1.2e-017 246 2 GTGAYAA DREME-5 GTGAYAA 1.4e-004 1.8e-007 -15.52 0.0 40 494 33 123 0.08097 7.4e-010 246 2 AATTGTGG DREME-6 AATTGTGG 5.0e-015 6.7e-018 -39.55 0.0 137 493 151 280 0.27789 2.7e-020 246 2 GYTCTGAA DREME-7 GCTCTGAA 2.3e0000 3.1e-003 -5.77 0.0 113 493 59 155 0.22921 1.3e-005 246 2 AGTGRCT DREME-9 AGTGRCT 5.9e-001 7.9e-004 -7.15 0.0 76 494 37 112 0.15385 3.2e-006 246 2 AAAGGR DREME-10 AAAGGR 1.0e-005 1.3e-008 -18.12 0.0 133 495 156 366 0.26869 5.5e-011 247 3 M0437_1.02 (ZNF32)_(Mus_musculus)_(DBD_0.99) TATRTATRT 7.0e-007 9.3e-010 -20.79 0.0 170 492 208 406 0.34553 3.8e-012 245 3 M0633_1.02 DMRT2 KAATKTATWN 1.3e-001 1.8e-004 -8.64 0.0 61 491 108 544 0.12424 7.2e-007 245 3 M0635_1.02 (DMRTC2)_(Mus_musculus)_(DBD_1.00) NAHATGTATHMWN 9.8e-001 1.3e-003 -6.64 0.0 292 488 390 566 0.59836 5.4e-006 243 3 M0737_1.02 (FOXB2)_(Mus_musculus)_(DBD_0.94) WAWGTAAAYA 7.7e0000 1.0e-002 -4.58 0.0 337 491 441 579 0.68635 4.2e-005 245 3 M1007_1.02 (HOXA11)_(Mus_musculus)_(DBD_1.00) TTWAYDDNN 2.0e-001 2.7e-004 -8.21 0.0 58 492 24 69 0.11789 1.1e-006 245 3 M1157_1.02 (HOXB9)_(PBM_CONSTRUCTS)_(DBD_1.00) WTTWATKRBN 1.0e-001 1.3e-004 -8.92 0.0 37 491 68 483 0.07536 5.5e-007 245 3 M1592_1.02 (SOX3)_(Mus_musculus)_(DBD_1.00) NNNWCAAT 1.1e-001 1.5e-004 -8.83 0.0 367 493 274 319 0.74442 6.0e-007 246 3 M2267_1.02 CDX2 TTTTATKRCHB 7.6e-004 1.0e-006 -13.80 0.0 70 490 121 502 0.14286 4.2e-009 244 3 M4636_1.02 THAP1 YTGCCCDBANYMAAGATGGCG 1.9e0000 2.5e-003 -5.98 0.0 98 480 84 265 0.20417 1.1e-005 239 3 M5304_1.02 BHLHA15 ACCATATGKT 4.1e0000 5.4e-003 -5.21 0.0 277 491 283 427 0.56415 2.2e-005 245 3 M5307_1.02 BHLHE22 NAVCATATGTTT 7.0e0000 9.3e-003 -4.67 0.0 277 489 276 416 0.56646 3.8e-005 244 3 M5460_1.02 FOXL1 RTAAACA 7.0e0000 9.4e-003 -4.67 0.0 278 494 373 579 0.56275 3.8e-005 246 3 M5551_1.02 HOXC12 TTTTATTRC 1.8e-002 2.4e-005 -10.64 0.0 38 492 75 509 0.07724 9.8e-008 245 3 M5557_1.02 HOXD12 GTAATAAAA 1.1e-001 1.5e-004 -8.79 0.0 38 492 72 505 0.07724 6.2e-007 245 3 M5581_1.02 IRX2 CWTGTCRTGTWN 7.9e0000 1.1e-002 -4.54 0.0 343 489 435 560 0.70143 4.4e-005 244 3 M5582_1.02 IRX5 SWTGTYRTGTWN 4.0e-001 5.4e-004 -7.53 0.0 343 489 448 569 0.70143 2.2e-006 244 3 M5690_1.02 OLIG1 AMCATATGKT 1.7e-002 2.3e-005 -10.66 0.0 253 491 274 427 0.51527 9.5e-008 245 3 M5691_1.02 OLIG2 AMCATATGKT 1.5e-001 2.0e-004 -8.54 0.0 253 491 301 482 0.51527 8.0e-007 245 3 M5693_1.02 OLIG3 AMCATATGBY 4.9e0000 6.5e-003 -5.03 0.0 277 491 332 508 0.56415 2.7e-005 245 3 M5738_1.02 POU3F4 ATTWGCATA 2.1e0000 2.8e-003 -5.89 0.0 324 492 384 515 0.65854 1.1e-005 245 3 M5972_1.02 ZNF410 GANTATTATGGGATGKM 9.2e-004 1.2e-006 -13.61 0.0 182 484 101 170 0.37603 5.1e-009 241 3 M6225_1.02 MECOM WAGAYAAGATAANAKW 1.6e0000 2.1e-003 -6.17 0.0 95 485 122 432 0.19588 8.7e-006 242 3 M6239_1.02 FOXF2 HWADGTAAACA 5.1e-002 6.8e-005 -9.59 0.0 282 490 380 559 0.57551 2.8e-007 244 3 M6280_1.02 HMGA2 AATWWYSSSSAATAT 6.0e0000 8.0e-003 -4.83 0.0 186 486 170 347 0.38272 3.3e-005 242 3 M6289_1.02 HOXA9 WCATAAAYYATH 1.7e-002 2.3e-005 -10.66 0.0 55 489 104 551 0.11247 9.6e-008 244 3 M6292_1.02 HOXA5 CATTAATYAR 1.2e0000 1.6e-003 -6.46 0.0 53 491 83 469 0.10794 6.4e-006 245 3 M6297_1.02 HOXB7 MATYAATCAA 1.0e-005 1.4e-008 -18.10 0.0 59 491 97 413 0.12016 5.6e-011 245 3 M6298_1.02 HOXB8 BMATTAATCAA 2.4e-004 3.3e-007 -14.93 0.0 84 490 115 391 0.17143 1.3e-009 244 3 M6327_1.02 LEF1 CTTTGWW 9.9e-007 1.3e-009 -20.44 0.0 100 494 180 549 0.20243 5.4e-012 246 3 M6368_1.02 NFIL3 VKVMRTTACRTAAY 5.8e-008 7.8e-011 -23.28 0.0 241 487 267 396 0.49487 3.2e-013 243 3 M6453_1.02 RFX3 TYRCCATGGYAACV 9.5e-013 1.3e-015 -34.30 0.0 101 487 162 412 0.20739 5.2e-018 243 3 M6457_1.02 RUNX1 WAACCACARW 2.9e-002 3.9e-005 -10.14 0.0 127 491 210 592 0.25866 1.6e-007 245 3 M6470_1.02 SOX10 BCWTTGT 5.7e-004 7.7e-007 -14.08 0.0 38 494 84 558 0.07692 3.1e-009 246 3 M6471_1.02 SOX13 YATTGTTY 2.8e-002 3.8e-005 -10.19 0.0 79 493 137 560 0.16024 1.5e-007 246 3 M6472_1.02 SOX15 CWTTGTT 1.2e-008 1.6e-011 -24.87 0.0 118 494 207 539 0.23887 6.5e-014 246 3 M6476_1.02 SOX4 GAGAACAAAGSG 2.3e0000 3.0e-003 -5.80 0.0 39 489 67 485 0.07975 1.2e-005 244 3 M6477_1.02 SOX5 WAACAATR 7.3e-008 9.8e-011 -23.05 0.0 55 493 121 555 0.11156 4.0e-013 246 3 M6478_1.02 SOX9 ARAACAATRGR 2.0e0000 2.7e-003 -5.91 0.0 38 490 68 508 0.07755 1.1e-005 244 3 M6490_1.02 SRY DAAACAAWR 4.0e-004 5.3e-007 -14.44 0.0 102 492 174 552 0.20732 2.2e-009 245 3 M6506_1.02 TCF7 VHSCTTTGWWST 6.5e0000 8.7e-003 -4.75 0.0 71 489 109 517 0.14519 3.6e-005 244 3 M6507_1.02 TEAD1 YACATTCCWSHKSH 8.0e0000 1.1e-002 -4.53 0.0 83 487 134 570 0.17043 4.5e-005 243 ## # Detailed descriptions of columns in this file: # # db: The name of the database (file name) that contains the motif. # id: A name for the motif that is unique in the motif database file. # alt: An alternate name of the motif that may be provided # in the motif database file. # consensus: A consensus sequence computed from the motif. # E-value: The expected number motifs that would have least one. # region as enriched for best matches to the motif as the reported region. # The E-value is the p-value multiplied by the number of motifs in the # input database(s). # adj_p-value: The probability that any tested region would be as enriched for # best matches to this motif as the reported region is. # By default the p-value is calculated by using the one-tailed binomial # test on the number of sequences with a match to the motif # that have their best match in the reported region, corrected for # the number of regions and score thresholds tested. # The test assumes that the probability that the best match in a sequence # falls in the region is the region width divided by the # number of places a motif # can align in the sequence (sequence length minus motif width plus 1). # When CentriMo is run in discriminative mode with a negative # set of sequences, the p-value of a region is calculated # using the Fisher exact test on the # enrichment of best matches in the positive sequences relative # to the negative sequences, corrected # for the number of regions and score thresholds tested. # The test assumes that the probability that the best match (if any) # falls into a given region # is the same for all positive and negative sequences. # log_adj_p-value: Log of adjusted p-value. # bin_location: Location of the center of the most enriched region. # bin_width: The width (in sequence positions) of the most enriched region. # A best match to the motif is counted as being in the region if the # center of the motif falls in the region. # total_width: The window maximal size which can be reached for this motif: # rounded(sequence length - motif length +1)/2 # sites_in_bin: The number of (positive) sequences whose best match to the motif # falls in the reported region. # Note: This number may be less than the number of # (positive) sequences that have a best match in the region. # The reason for this is that a sequence may have many matches that score # equally best. # If n matches have the best score in a sequence, 1/n is added to the # appropriate bin for each match. # total_sites: The number of sequences containing a match to the motif # above the score threshold. # p_success: The probability of falling in the enriched window: # bin width / total width # p-value: The uncorrected p-value before it gets adjusted to the # number of multiple tests to give the adjusted p-value. # mult_tests: This is the number of multiple tests (n) done for this motif. # It was used to correct the original p-value of a region for # multiple tests using the formula: # p' = 1 - (1-p)^n where p is the uncorrected p-value. # The number of multiple tests is the number of regions # considered times the number of score thresholds considered. # It depends on the motif length, sequence length, and the type of # optimizations being done (central enrichment, local enrichment, # score optimization).