skip to main content
Roche logo
3. SFF Tools Commands : 3.1 sfffile
3.1
sfffile [options...] [MIDList@](sfffile | datadir)
where “MIDList” is a multiplexing information string used to filter the set of file reads output. (See the Appendix 4.7 for details on the use of MIDs in the data analysis.)
These two options adjust the trim points for some or all reads in the output SFF file. The specified “trimfile” should contain one or more lines consisting of (1) a read accession number, (2) a starting trimpoint, and (3) an ending trimpoint, separated by whitespace characters or where the trimpoints are separated by a dash (e.g., “accno 12 543” or “accno 12-543”). The trimpoint values are 1-based positions that denote the first and last base of the trimmed region (e.g. for a read 800 bases in length, the lines above specify that bases 1-11 and 544-800 should be ignored, and bases 12-543 form the trimmed region of the read). A value of 0 specifies that the beginning or end of the read should be used (e.g. for a read 800 bases in length, the line “accno 12 0” sets the trimmed region to 12-800).
[-mcf filename]
This option tells sfffile to generate an output file containing a certain number of reads, by randomly “picking” reads from the input. The argument number can be followed by a ‘k’ or ‘m’ to specify thousands or millions of reads, respectively. Reads in any input SFF files are merged directly into the single output SFF file generated by this command. If SFF files are not present in a Data Processing folder (e.g. for a Run whose data has been processed with a version of the 454 Sequencing System software anterior to 1.0.52), the signal processing step of the GS Run Processor application must be rerun on the Data Processing folder, and can then merged into the output SFF file generated by the sfffile command. Input “D_...” directories (with data generated on a GS FLX+ Instrument) may be prepended with a list of regions separated by a colon. For example, “1,3-5,7:R_dir/D_dir” tells sfffile to use regions 1, 3, 4, 5 and 7 of R_dir/D_dir. An optional multiplexing information string can be prepended to each file/data-directory argument.