SUMMARISING RUN PARAMETERS ========================== Input filename: A-15173-2.end2.fastq.gz Trimming mode: paired-end Trim Galore version: 0.4.4 Cutadapt version: 1.16 Quality Phred score cutoff: 20 Quality encoding type selected: ASCII+33 Adapter sequence: 'CTGTCTCTTATA' (Nextera Transposase sequence; auto-detected) Maximum trimming error rate: 0.1 (default) Minimum required adapter overlap (stringency): 1 bp Minimum required sequence length for both reads before a sequence pair gets removed: 20 bp All sequences will be trimmed by 1 bp on their 3' end to avoid problems with invalid paired-end alignments with Bowtie 1 Output file will be GZIP compressed This is cutadapt 1.16 with Python 2.7.13 Command line parameters: -f fastq -e 0.1 -q 20 -O 1 -a CTGTCTCTTATA A-15173-2.end2.fastq.gz Running on 1 core Trimming 1 adapter with at most 10.0% errors in single-end mode ... Finished in 907.46 s (37 us/read; 1.61 M reads/minute). === Summary === Total reads processed: 24,422,282 Reads with adapters: 9,555,470 (39.1%) Reads written (passing filters): 24,422,282 (100.0%) Total basepairs processed: 1,566,212,859 bp Quality-trimmed: 86,753 bp (0.0%) Total written (filtered): 1,549,701,259 bp (98.9%) === Adapter 1 === Sequence: CTGTCTCTTATA; Type: regular 3'; Length: 12; Trimmed: 9555470 times. No. of allowed errors: 0-9 bp: 0; 10-12 bp: 1 Bases preceding removed adapters: A: 27.7% C: 27.3% G: 23.1% T: 21.9% none/other: 0.0% Overview of removed sequences length count expect max.err error counts 1 7487962 6105570.5 0 7487962 2 1413125 1526392.6 0 1413125 3 376485 381598.2 0 376485 4 82152 95399.5 0 82152 5 21266 23849.9 0 21266 6 11497 5962.5 0 11497 7 4370 1490.6 0 4370 8 3509 372.7 0 3509 9 3714 93.2 0 2951 763 10 3743 23.3 1 2493 1250 11 4300 5.8 1 2630 1670 12 3656 1.5 1 2760 896 13 3536 1.5 1 2857 679 14 4069 1.5 1 2553 1516 15 3856 1.5 1 2390 1466 16 3041 1.5 1 2254 787 17 2939 1.5 1 2246 693 18 2756 1.5 1 2515 241 19 3458 1.5 1 2977 481 20 3678 1.5 1 3146 532 21 4029 1.5 1 3706 323 22 5359 1.5 1 4159 1200 23 5829 1.5 1 5404 425 24 5920 1.5 1 5419 501 25 4452 1.5 1 4174 278 26 3444 1.5 1 3123 321 27 3413 1.5 1 3034 379 28 3909 1.5 1 3471 438 29 4330 1.5 1 3992 338 30 4831 1.5 1 4512 319 31 4999 1.5 1 4624 375 32 5384 1.5 1 5001 383 33 10137 1.5 1 9560 577 34 4514 1.5 1 4165 349 35 7637 1.5 1 7249 388 36 3835 1.5 1 3540 295 37 3435 1.5 1 3179 256 38 5692 1.5 1 5285 407 39 1757 1.5 1 1532 225 40 2378 1.5 1 2127 251 41 1132 1.5 1 909 223 42 746 1.5 1 572 174 43 830 1.5 1 371 459 44 531 1.5 1 332 199 45 772 1.5 1 327 445 46 601 1.5 1 354 247 47 510 1.5 1 271 239 48 926 1.5 1 574 352 49 1054 1.5 1 827 227 50 1878 1.5 1 97 1781 51 362 1.5 1 16 346 52 271 1.5 1 4 267 53 451 1.5 1 8 443 54 127 1.5 1 8 119 55 445 1.5 1 4 441 56 209 1.5 1 5 204 57 152 1.5 1 2 150 58 421 1.5 1 2 419 59 292 1.5 1 2 290 60 1014 1.5 1 2 1012 61 166 1.5 1 2 164 62 221 1.5 1 18 203 63 396 1.5 1 5 391 64 122 1.5 1 10 112 65 130 1.5 1 3 127 66 166 1.5 1 1 165 67 252 1.5 1 3 249 68 383 1.5 1 3 380 69 69 1.5 1 5 64 70 85 1.5 1 7 78 71 59 1.5 1 5 54 72 92 1.5 1 12 80 73 387 1.5 1 18 369 74 140 1.5 1 34 106 75 1614 1.5 1 1379 235 76 68 1.5 1 5 63 RUN STATISTICS FOR INPUT FILE: A-15173-2.end2.fastq.gz ============================================= 24422282 sequences processed in total Total number of sequences analysed for the sequence pair length validation: 24422282 Number of sequence pairs removed because at least one read was shorter than the length cutoff (20 bp): 20253 (0.08%)