Train a CNN model for filtering variants
gatk CNNVariantTrain \ -tensor-type reference \ -input-tensor-dir my_tensor_folder \ -model-name my_1d_model
gatk CNNVariantTrain \ -input-tensor-dir my_tensor_folder \ -tensor-type read-tensor \ -model-name my_2d_model
This table summarizes the command-line arguments that are specific to this tool. For more details on each argument, see the list further down below the table or click on an argument name to jump directly to that entry in the list.
| Argument name(s) | Default value | Summary | |
|---|---|---|---|
| Required Arguments | |||
| --input-tensor-dir |
null | Directory of training tensors to create. | |
| Optional Tool Arguments | |||
| --annotation-shortcut |
false | Shortcut connections on the annotation layers. | |
| --annotation-units |
16 | Number of units connected to the annotation input layer | |
| --arguments_file |
[] | read one or more arguments files and add them to the command line | |
| --conv-batch-normalize |
false | Batch normalize convolution layers | |
| --conv-dropout |
0.0 | Dropout rate in convolution layers | |
| --conv-height |
5 | Height of convolution kernels | |
| --conv-layers |
[] | List of number of filters to use in each convolutional layer | |
| --conv-width |
5 | Width of convolution kernels | |
| --epochs |
10 | Maximum number of training epochs. | |
| --fc-batch-normalize |
false | Batch normalize fully-connected layers | |
| --fc-dropout |
0.0 | Dropout rate in fully-connected layers | |
| --fc-layers |
[] | List of number of filters to use in each fully-connected layer | |
| --gcs-max-retries -gcs-retries |
20 | If the GCS bucket channel errors out, how many times it will attempt to re-initiate the connection | |
| --gcs-project-for-requester-pays |
"" | Project to bill when accessing "requester pays" buckets. If unset, these buckets cannot be accessed. | |
| --help -h |
false | display the help message | |
| --image-dir |
null | Path where plots and figures are saved. | |
| --model-name |
variant_filter_model | Name of the model to be trained. | |
| --output-dir |
./ | Directory where models will be saved, defaults to current working directory. | |
| --padding |
valid | Padding for convolution layers, valid or same | |
| --spatial-dropout |
false | Spatial dropout on convolution layers | |
| --tensor-type |
reference | Type of tensors to use as input reference for 1D reference tensors and read_tensor for 2D tensors. | |
| --training-steps |
10 | Number of training steps per epoch. | |
| --validation-steps |
2 | Number of validation steps per epoch. | |
| --version |
false | display the version number for this tool | |
| Optional Common Arguments | |||
| --gatk-config-file |
null | A configuration file to use with the GATK. | |
| --QUIET |
false | Whether to suppress job-summary info on System.err. | |
| --tmp-dir |
null | Temp directory to use. | |
| --use-jdk-deflater -jdk-deflater |
false | Whether to use the JdkDeflater (as opposed to IntelDeflater) | |
| --use-jdk-inflater -jdk-inflater |
false | Whether to use the JdkInflater (as opposed to IntelInflater) | |
| --verbosity |
INFO | Control verbosity of logging. | |
| Advanced Arguments | |||
| --annotation-set |
best_practices | Which set of annotations to use. | |
| --channels-last |
true | Store the channels in the last axis of tensors, tensorflow->true, theano->false | |
| --showHidden |
false | display hidden arguments | |
Arguments in this list are specific to this tool. Keep in mind that other arguments are available that are shared with other tools (e.g. command-line GATK arguments); see Inherited arguments above.
Which set of annotations to use.
String best_practices
Shortcut connections on the annotation layers.
boolean false
Number of units connected to the annotation input layer
int 16 [ [ -∞ ∞ ] ]
read one or more arguments files and add them to the command line
List[File] []
Store the channels in the last axis of tensors, tensorflow->true, theano->false
boolean true
Batch normalize convolution layers
boolean false
Dropout rate in convolution layers
float 0.0 [ [ -∞ ∞ ] ]
Height of convolution kernels
int 5 [ [ -∞ ∞ ] ]
List of number of filters to use in each convolutional layer
List[Integer] []
Width of convolution kernels
int 5 [ [ -∞ ∞ ] ]
Maximum number of training epochs.
int 10 [ [ 0 ∞ ] ]
Batch normalize fully-connected layers
boolean false
Dropout rate in fully-connected layers
float 0.0 [ [ -∞ ∞ ] ]
List of number of filters to use in each fully-connected layer
List[Integer] []
A configuration file to use with the GATK.
String null
If the GCS bucket channel errors out, how many times it will attempt to re-initiate the connection
int 20 [ [ -∞ ∞ ] ]
Project to bill when accessing "requester pays" buckets. If unset, these buckets cannot be accessed.
String ""
display the help message
boolean false
Path where plots and figures are saved.
String null
Directory of training tensors to create.
R String null
Name of the model to be trained.
String variant_filter_model
Directory where models will be saved, defaults to current working directory.
String ./
Padding for convolution layers, valid or same
String valid
Whether to suppress job-summary info on System.err.
Boolean false
display hidden arguments
boolean false
Spatial dropout on convolution layers
boolean false
Type of tensors to use as input reference for 1D reference tensors and read_tensor for 2D tensors.
The --tensor-type argument is an enumerated type (TensorType), which can have one of the following values:
TensorType reference
Temp directory to use.
String null
Number of training steps per epoch.
int 10 [ [ 0 ∞ ] ]
Whether to use the JdkDeflater (as opposed to IntelDeflater)
boolean false
Whether to use the JdkInflater (as opposed to IntelInflater)
boolean false
Number of validation steps per epoch.
int 2 [ [ 0 ∞ ] ]
Control verbosity of logging.
The --verbosity argument is an enumerated type (LogLevel), which can have one of the following values:
LogLevel INFO
display the version number for this tool
boolean false
See also General Documentation | Tool Docs Index Tool Documentation Index | Support Forum
GATK version 4.1.0.0 built at Tue, 29 Jan 2019 22:20:41 -0500.