skip to main content
Roche logo
The number of CPUs is selected from the dropdown list above the “Start Computation” button (Figure 1‑44). The dropdown list contains values from 0 through the number of processors available on the Attendant PC or Datarig. The value 0 represents “All available processors” and is equivalent to selecting the highest value in the list. The number of CPUs option is the GUI equivalent to the command line option –cpu, which is described in section 3.3.2.1.
1.
Trim Read Data: for each Read Data Set, the Primer sequences and MIDs, if used, of all the reads are identified for demultiplexing purposes and the trim points are noted for the “Target” sequences. (As mentioned in section 1.1.1.3, trimming the Primers is important because any variations found therein would have no biological significance, and therefore should not be reported by the AVA software.)
a.
splitting the reads of a Read Data Set over multiple Samples, e.g. if the experiment was set up such that one or more Amplicons (which are associated with different Samples either directly, or through the use of MIDs) were present in a PicoTiterPlate Device (GS Junior Instrument) or PicoTiterPlate region (GS FLX+ System) of a sequencing Run; and/or
b.
joining of the reads from multiple Read Data Sets into any given Sample, e.g. if the experiment was set up such that multiple regions of a PicoTiterPlate Device (GS FLX+ System) and/or sequencing Runs contributed reads to the Amplicons that are associated with the Sample.
5.
In addition, the AVA software automatically searches for potential Variants not explicitly entered into the system. These Auto-Detected “Putative” Variants receive “Intelligent Names” that are unique and compact, yet descriptive (see section 4.2); these Auto-Detected Variants are not automatically loaded into the Project, but can be manually loaded based on particular selection criteria on the main Variants Tab (see section 1.5.2.6). The AVA software automatically searches for substitution (SNP) and block deletion (>=3 bp) Variants. (Insertion Variants and block deletion Variants <3 bp are not currently automatically detected, so manual browsing of the alignments may be needed if these types of Variants are anticipated.) Combined with the ability to selectively load and subsequently sort and filter these Variants based on their frequencies and read-orientation support, the vast majority of interesting Variants can be easily discovered and evaluated from the view of the data provided in the Variant tab’s Sample – Variants Table. By providing the ability to both edit the Variant Status, and filter by that Status, the AVA software provides a simple Discovery Workflow to determine which Variants have been evaluated and what the outcome of that evaluation was. See section 1.5.2.7 for more details on the proposed Variant Discovery Workflow process.
If you modified some information in the Project after computing it (e.g. imported new Read Data Sets, defined new Variants, etc.), you must re-compute it to update the results reported in all the tabs. When you re-compute a Project, the AVA software uses cached results, if possible, for any step that has not changed (except for demultiplexing, which is brief and is always carried out). This can save a lot of time in what would otherwise be needless repetition of calculations.
The Computations tab also has a “Stop computation” button that you can use to abort calculations, e.g. if you decide to make further changes to your Project set up while calculations are on the way. This can be useful for very large Projects, where the calculations can take some time. When you do this, the AVA software accepts the results that have already been re-computed, but it also keeps the results from the previous computation that have not yet been altered by the re-computation. The reason for this is that since Amplicon Projects are incremental, re-calculations are often done after adding Read Data Set(s) or Sample(s) to the Project in a manner such that much of the previously computed results are still valid. On the other hand, the results that were in the process of being re-computed at the time of the interruption could truly be corrupted; the results for these computations, caught in an intermediate state, are removed from the computation’s output (such as in the Variants Tab) and the navigation elements that would be used to load these results are disabled (such as those used to load multiple alignments in the Global Align tab).
Interrupted computations: If a computation (or re-computation) is interrupted, there is a risk that part of the output may not match the state of the saved Project. While the AVA software withholds the potentially corrupted results from the data that was being processed at the time of the interruption, it also maintains the results from previous computations that had not yet been altered at the time of the interruption. Be aware that those older results may not be consistent with more recent updates to the Project. The outcome of this is similar to the case described in the “Caution” at the end of section 1.3 (editing a Project in a manner that is germane to previously computed results). If you find that the data in these tabs does not reflect the current state of the Project, try re-computing it. The only way to be completely sure that the Project is consistent is to allow the computation to run to completion, without interruption.
Figure 1‑45 shows the different kinds of warning messages that can occur:
The message indicating that further messages in the Computation Warning window are based on the Project in its current state, i.e. as computation would see it if it were saved; this gives you warning of problems in a Project even before you save it to disk. If your Project is up to date on the disk, the other messages below may still occur but they would then concern the Project as saved.
the Reference Sequence itself is incompletely defined (e.g. it was given a name, but no actual sequence; in this case the Amplicon wouldn't likely have any Target Start/End set either). 
the Variant pattern is inconsistent with the Reference Sequence [e.g. a substitute constraint specifies the substituted nucleotide as the same as the one already in the Reference Sequence (should be a match constraint instead)]