The database of the primary motif.
The ID of the primary motif followed by the alternate ID in brackets if it has one.
The logo of the primary motif.
Sections of the motif with a gray background have been trimmed and were not used for scanning.
The number of secondary motifs found that had significant spacings in the tested region.
The list of secondary motifs found that had significant spacings in the tested region.
The name of the sequence database.
The last modified date of the sequence database.
The number of sequences in the sequence database.
The number of sequences in the sequence database which were excluded because they were shorter than twice the margin plus the primary motif length.
The number of sequences in the sequence database which were excluded because they contained large runs of ambiguous symbols (normally wildcard masking) that could bias the results.
The number of sequences in the sequence database which were excluded because no match to the primary motif could be found at a distance to the edges larger than the margin.
The number of sequences in the sequence database which were excluded because they were largly identical to other sequences when aligned on the primary motif site.
The number of sequences which were scanned with the secondary motifs.
The name of the motif database derived from the file name.
The date that the motif database was last modified.
The number of motifs loaded from the motif database. Some motifs may have been excluded.
The number of motifs with significant E-values whose significant spacings were not considered too similar to those of another motif.
The number of motifs that while having significant spacings were less signficant than another motif that matched most of the same sites.
This checkbox ensures the row stays visible after a filter operation that would normally hide it.
The ID of the secondary motif.
The alternate name of the secondary motif.
The ID of the secondary motif followed by the alternate ID in brackets if it has one.
The name of the cluster to which this secondary motif belongs. SpaMo assigns each secondary motif to a cluster, and names the cluster after the motif in it with the most significant spacing. SpaMo assigns two secondary motifs to the same cluster if the matches in their most significant spacings (from the primary motif) overlap substantially. Clustering is controlled by the -joint and -overlap options.
The E-value is the lowest p-value of any spacing of the secondary motif times the number of secondary motifs. It estimates the expected number of random secondary motifs that would have the observed minimum p-value or less.
The gap between the primary and secondary motifs for the most significant spacing.
The strand and position of the secondary motif relative to the primary motif for the most significant spacing.
The minimum score accepted as a match to either the primary or secondary motif. This value can greatly affect the results of SpaMo. If it is too high, there will be no matches to the primary motif. If too low, sequences with non-significant matches to the primary and/or secondary motif will reduce the effectiveness of the spacing analysis.
The distance either side of the primary motif site which makes up the region that can contain the secondary motif site. Additionally it is the minimum gap between the primary motif site and the edge of the sequence. These constraints mean that input sequences shorter than the trimmed length of the primary motif plus two times the margin size can not be used by SpaMo.
A histogram showing the counts for the orientation with the best spacing.
The significant spacings are highlighted in red.
The primary motif is used as the reference point for all spacing calculation.
Sections of the motif with a gray background have been trimmed and were not used for scanning.
The secondary motif occurs at the spacings relative to the primary shown in the histogram below.
Sections of the motif with a gray background have been trimmed and were not used for scanning.
The regions matching the secondary motif in the sequences with the given spacing are used to construct a motif. The logo for this "inferred" motif is shown aligned with that of the actual secondary motif.
The inferred secondary motif logo should closely resemble that of the secondary motif. If it does not, this may suggest that the observed spacing may actually be due to the enrichment of a motif that differs from the secondary motif.
You can download the inferred secondary motif by moving the mouse cursor over the logo and clicking "Download as MEME motif". You can then use this downloaded motif as an input to Tomtom to see what other known motifs it may resemble.
These are the sequence logos created by aligning all of the sequences with the significant motif spacing. Alignments are centered on the match to the primary motif and done separately for each of the quadrants that contribute to the significant spacing. The logos extend in both directions (up to) 10 positions past the maximum region considered in the significance tests.
Note 1: If you don't
see the complete logo(s), you can use the scroll bar underneath
the Alignment window. If you don't see a scroll bar and are
on a Mac, you can turn on scroll bars by clicking on the Apple Icon
at the top left of your terminal and clicking:
System Preferences/General/Show scroll bars/Always
.
Note 2:These logos are useful for detecting cases where highly similar regions (such as DNA repeats) are present among the sequences with the significant motif spacing. Such cases may indicate that the spacing is due to recent duplication events rather than to a functional biological relationship between the primary and secondary motifs. Ideally, the regions around the primary and secondary motifs should have low information content and their logos in the alignment should closely match their motifs.
This table shows the details of the significant spacings between the primary motif and the secondary motif currently selected in the "Secondaries" section, below. Click on a row in this table to select a particular spacing for detailed analysis.
The histogram below shows the frequency of spacings from the primary motif to the secondary motif.
The two quadrants on the left show spacings where the secondary motif is upstream of the primary motif and the two quadrants on the right show spacings where the secondary motif is downstream of the primary motif.
The two quadrants on the top show spacings where the secondary motif is on the same strand as the primary motif and the two quadrants on the bottom show spacings where the secondary motif is on the opposite strand to the primary motif.
Histogram bars highlighted pink are part of one of the listed significant spacings. This feature can be disabled by unchecking the "highlight all" option under the spacings.
Histogram bars highlighted red are part of the currently selected significant spacing. This feature can be disabled by unchecking the "highlight selected" option under the spacings.
The selected orientation graph shows the combined quadrants from the selected spacing with a zoomed view that only shows the portion of the graph for which significance testing was performed.
Histogram bars highlighted pink are one of the listed significant spacings for this orientation. This feature can be disabled by unchecking the "highlight all" option under the spacings.
The histogram bar highlighted red is the currently selected significant spacing. This feature can be disabled by unchecking the "highlight selected" option under the spacings.
This lists the FASTA sequence identifiers of the subset of sequences that contain the significant motif spacing.
These identifiers can be cut-and-pasted into other programs for further analysis (e.g., Genome Ontology analysis or location analysis in the case ChIP-seq peak regions).
Click on a row in this table to select one of the significant secondary motifs for detailed analysis. The details of the significant spacings between the primary motif and the secondary motif you select here will be displayed in the table and plots above.
Specify which secondary motifs to display in the Secondaries table by checking one or more of the tick boxes below and then entering filter criteria. Then click "Update" to refresh the view of the Secondaries table.
Specify the order in which secondary motifs are displayed in the Secondaries table by selecting a sorting criteria in the menu below. Then click "Update" to refresh the view of the Secondaries table.
For further information on how to interpret these results or to get a copy of the MEME software please access http://meme.nbcr.net.
If you use SpaMo in your research please cite the following paper:
Tom Whitington, Martin C. Frith, James Johnson and Timothy L. Bailey,
"Inferring transcription factor complexes from ChIP-seq data",
Nucleic Acids Research, 39(15):e98, 2011.
[full text]
Database | Name | Preview | Significant Secondaries | List |
---|---|---|---|---|
Name | Last Modified | Contained | Too Short | Too Masked | No Primary | Too Similar | Used |
---|---|---|---|---|---|---|---|
Name | Last Modified | Number of Motifs | Motifs Significant | Motifs Redundant |
---|---|---|---|---|
Secondary Motif: | Cluster: | E-value: | Best Gap: | Best Orientation: |
---|
Alignment Logo | |
---|---|
Up+ | |
Up- | |
Dn+ | |
Dn- | |
|
Gap
Gap
|
Orientation
Orientation
|
p-value
p-value
|
---|
Lock | ID | Name | Cluster | E-value | Best Gap | Best Orientation | Spacings |
---|