Seqlet comparisons are not shown because a TF-MoDISco H5 file with seqlet data is not provided.{% elif not compute_recall %}
Seqlet recall and other statistics directly comparing hits and seqlets are not computed because the -n/--no-recall argument is set.
{% endif %}
{% if use_seqlets %}
The following figures and statistics compare the called hits with the seqlets used by TF-MoDISco to construct each motif.
This figure shows the number of hits called vs. the number of TF-MoDISco seqlets identified for each motif. The dashed line is the identity line. When comparing a shared set of regions, the hit counts should be mostly greater than the corresponding seqlet counts, since TF-MoDISco stringently filters seqlets and usually uses a smaller input window.
{% endif %}
For each motif, this table examines the consistency between hits and TF-MoDISco seqlets.
The following statistics report the number of hits, seqlets, and their relationships:
Note that the seqlet counts here may be lower than those shown in the tfmodisco-lite report due to double-counting in overlapping regions. The seqlet counts shown here are unique while the counts in the tfmodisco-lite report are not de-duplicated.
{% if compute_recall %}
Note that palindromic motifs may have lower recall due to disagreements on orientation.
If seqlet recall is near zero for all motifs, the -W/--modisco-region-width argument is likely incorrect.
This value is required to infer genomic coordinates of seqlets from the tfmodisco-lite output H5.
Motif CWMs (contribution weight matrices) are average contribution scores over a set of regions. The CWMs plotted here are:
The plots span the full untrimmed motif, with the trimmed motif shaded.
The hit-seqlet similarity is the cosine similarity between the additional-restricted-hits CWM and the seqlet CWM. This statistic measures the similarity between hits that were missed by TF-MoDISco and the seqlets used to construct the motif.
| Motif Name | {% if compute_recall %}Seqlet Recall | {% endif %}Hit-Seqlet CWM Similarity | Hits | {% if compute_recall %}Restricted Hits | {% endif %} {% if use_seqlets %}Seqlets | {% endif %} {% if compute_recall %}Hit/Seqlet Overlaps | Missed Seqlets | Additional Restricted Hits | {% endif %}Hit CWM (FC) | Hit CWM (RC) | TF-MoDISco CWM (FC) | TF-MoDISco CWM (RC) | {% if compute_recall %}Missed-Seqlet-Only CWM | Additional-Restricted-Hit CWM | {% endif %}
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
{{ item.motif_name }} |
{% if compute_recall %}
{{ '%0.3f'| format(item.seqlet_recall|float) }} | {% endif %}{{ '%0.3f'| format(item.cwm_similarity|float) }} | {{ item.num_hits_total }} | {% if compute_recall %}{{ item.num_hits_restricted }} | {% endif %} {% if use_seqlets %}{{ item.num_seqlets }} | {% endif %} {% if compute_recall %}{{ item.num_overlaps }} | {{ item.num_seqlets_only }} | {{ item.num_hits_restricted_only }} | {% endif %}![]() |
![]() |
![]() |
![]() |
{% if compute_recall %}
![]() |
![]() |
{% endif %}
The following figures visualize the distribution of hits across motifs and peaks.
This plot shows the distribution of hit counts per peak for any motif. The number of peaks with no hits should be near zero.
These plots show the distribution of hit counts per peak for each motif.
| Motif Name | Hits Per Peak |
|---|---|
{{ m }} |
![]() |
This heatmap shows the co-occurrence of motifs across peaks. The color intensity here represents the cosine similarity between the motifs' occurrence across peaks, where occurence is defined as the presence of a hit for a motif in a peak.