skip to main content
Roche logo
1. GS De Novo Assembler
1.
GS De Novo Assembler
1.1
Overview of the GS De Novo Assembler
The GS De Novo Assembler application constructs de novo assemblies of the reads from one or more sequencing Runs, using the “read flowgrams” (SFF files) as input. The GS De Novo Assembler software is an interactive application used to create assembly projects, add or remove reads from the project, specify project parameters, run the assembly algorithms on the project data, and view the output produced by the assembly computations. The application can be accessed via a Graphical User Interface (GUI) or from a command line interface (CLI).
The GS De Novo Assembler allows the inclusion of one or more Paired End Runs into the analysis, enabling the ordering and orientation of the assembled contigs from shotgun sequencing Runs into scaffolds. This requires the preparation of a separate Paired End library from the same DNA material that was used to prepare the library for shotgun sequencing. In certain cases, the presence of reads from a separate shotgun library in the project is not strictly required, as Paired End reads can themselves be assembled with each other.
The GS De Novo Assembler application can be run from the Attendant PC or a DataRig when using the GS Junior Instrument.
The GS De Novo Assembler application is not available on the GS FLX+ Instrument and must be run on a DataRig for users of this instrument.
The GS De Novo Assembler allows users to create, modify, and run assemblies in the form of projects. Both the GUI and command line interface (CLI) provide this functionality. Projects may be setup to assemble all reads at once. Alternatively, incremental operation allows additional reads to be added to an existing assembly. Results appear as output files using either the GUI or the CLI. The GUI provides a graphical interface to view many of the results from the assembly whether the project was assembled using the GUI or the CLI.
The GS De Novo Assembler application uses a folder on the file system to hold the assembly project information (whether the assembly of the reads, i.e. the computation, is carried out in project-based mode through the GUI application or through the newAssembly and related commands) and to hold the output files generated during and after the assembly computation.