Shasta assembly summary

Shasta version

Shasta Release 0.11.1

Reads used in this assembly

Read representation1 (RLE)
Minimum read length200
Number of reads1706664
Number of read bases5560477726
Average read length3258
Read N505563
Number of run-length encoded bases3890327243
Average length ratio of run-length encoded sequence over raw sequence0.6996
Number of reads flagged as palindromic by self alignment119421
Number of reads flagged as chimeric621

Reads discarded on input

ReadsBases
Reads discarded on input because they contained invalid bases00
Reads discarded on input because they were too short1871250931
Reads discarded on input because they contained repeat counts greater than 25572322361
Reads discarded on input, total1943573292
Fraction of reads discarded on input over total present in input files0.0011370.0001031

Marker k-mers

Length k of k-mers used as markers14
Total number of k-mers6377292
Number of k-mers used as markers638031
Fraction of k-mers used as markers0.1

Markers

Total number of markers on all reads, one strand379210669
Total number of markers on all reads, both strands758421338
Average number of markers per raw base0.0682
Average number of markers per run-length encoded base0.09748
Average base offset between markers in raw sequence14.66
Average base offset between markers in run-length encoded sequence10.26
Average base gap between markers in run-length encoded sequence-3.741

Alignments

Number of alignment candidates found by the LowHash algorithm40390
Number of good alignments36638
Number of good alignments kept in the read graph27757

Alignment criteria actually used for creation of the read graph

minAlignedMarkerCount25
minAlignedFraction0.475
maxSkip21
maxDrift10
maxTrim80

Read graph

Number of vertices3413328
Number of edges55514
ReadsBases
Isolated reads16785055365926349
Non-isolated reads28159194551377
Isolated reads fraction0.98350.965
Non-isolated reads fraction0.01650.03499

Marker graph

Total number of vertices222694
Total number of edges302514
Number of vertices that are not isolated after edge removal201106
Number of edges that were not removed199368

Assembly graph

Number of vertices3502
Number of edges1764
Number of edges assembled882

Assembled segments ("contigs")

Number of segments assembled882
Total assembled segment length2444894
Longest assembled segment length10877
Assembled segments N504170

Performance

Elapsed time (seconds)240.9
Elapsed time (minutes)4.014
Elapsed time (hours)0.0669
Average CPU utilization0.204
Peak virtual memory utilization (bytes)37619863552
Number of threads used60
Total number of virtual CPUs available20
Total physical memory available (bytes)270208720896