skip to main content
Roche logo
1. GS De Novo Assembler : 1.15 GS De Novo Assembler Output : 1.15.2 GS De Novo Assembler cDNA / Transcriptome Output
1.15.2
GS De Novo Assembler cDNA / Transcriptome Output
1.15.2.1
454Isotigs.fna, 454Isotigs.qual, and 454Isotigs.txt files
The files 454LargeContigs.fna and 454LargeContigs.qual (1.15.1.2) are not generated for a cDNA / transcriptome assembly project. However, the new files 454Isotigs.fna (Figure 35) and 454Isotigs.qual files are generated (in addition to the 454AllContigs.fna and 454AllContigs.qual files). Similar to contig files, they contain respectively FASTA sequences and their quality scores. These files contain the isotig sequences traversed from the multiple-alignment graph structure (i.e. from the isogroup), but they also can contain single contigs which haven’t been traversed (and therefore aren’t considered isotigs) because of thresholds, limits, cycles, etc.
1.15.2.2
454Isotigs.faa and 454IsotigOrfAlign.txt
Isotig name (as it appears in all other output files)
Isotig nucleotide end position (1-based, inclusive)
ORF frame {-3, -2, -1, +1, +2, +3}
Nucleotide sequence length (including stop codon, if present in the sequence)
Protein sequence length (excluding stop codon, if present in the sequence)
The 454IsotigOrfAlign.txt file contains one or more lines for isotig nucleotide sequence, and each amino acid sequence encoded by a particular ORF found in the isotig (Figure 38). The isotig sequence is always first and amino acid sequences are sorted by start position in the isotig (or by end position for negative frames). Each line consists of following items:
Accno: The isotig name or ORF info (frame:startBase..endbase); the longest ORF for each isotig ends with an asterisk “*”
First position of the current section: nucleotide base position for an isotig, or amino acid position for an ORF sequence; a negative frame indicates it is greater than the last position for the current section
Sequence: Section of nucleotide or amino acid sequence
Last position of the current section: Nucleotide base position for isotig, or amino acid position for an ORF sequence; a negative frame indicates it is less than the first position for the current section
1.15.2.3
454Isotigs.ace and consed/…
The cDNA assembler produces an ACE file similar to that produced by the genomic assembler (see 1.15.1.5). Isotigs and large contigs that do not appear in any isotig are represented in the ACE file along with reads used to construct their multiple alignments.
1.15.2.4
454NewblerMetrics.txt
1.15.2.5
454RefLink.txt
1.15.2.6
454IsotigsLayout.txt
Figure 43: The end of the 454IsotigsLayout.txt file