SUMMARISING RUN PARAMETERS ========================== Input filename: KO-N4371-2.end2.fastq.gz Trimming mode: paired-end Trim Galore version: 0.4.4 Cutadapt version: 1.16 Quality Phred score cutoff: 20 Quality encoding type selected: ASCII+33 Adapter sequence: 'CTGTCTCTTATA' (Nextera Transposase sequence; auto-detected) Maximum trimming error rate: 0.1 (default) Minimum required adapter overlap (stringency): 1 bp Minimum required sequence length for both reads before a sequence pair gets removed: 20 bp All sequences will be trimmed by 1 bp on their 3' end to avoid problems with invalid paired-end alignments with Bowtie 1 Output file will be GZIP compressed This is cutadapt 1.16 with Python 2.7.13 Command line parameters: -f fastq -e 0.1 -q 20 -O 1 -a CTGTCTCTTATA KO-N4371-2.end2.fastq.gz Running on 1 core Trimming 1 adapter with at most 10.0% errors in single-end mode ... Finished in 819.50 s (35 us/read; 1.69 M reads/minute). === Summary === Total reads processed: 23,086,373 Reads with adapters: 9,130,373 (39.5%) Reads written (passing filters): 23,086,373 (100.0%) Total basepairs processed: 1,485,521,156 bp Quality-trimmed: 102,208 bp (0.0%) Total written (filtered): 1,469,298,826 bp (98.9%) === Adapter 1 === Sequence: CTGTCTCTTATA; Type: regular 3'; Length: 12; Trimmed: 9130373 times. No. of allowed errors: 0-9 bp: 0; 10-12 bp: 1 Bases preceding removed adapters: A: 26.9% C: 27.6% G: 23.6% T: 21.8% none/other: 0.1% Overview of removed sequences length count expect max.err error counts 1 7123031 5771593.2 0 7123031 2 1351407 1442898.3 0 1351407 3 375235 360724.6 0 375235 4 82510 90181.1 0 82510 5 22091 22545.3 0 22091 6 12321 5636.3 0 12321 7 4418 1409.1 0 4418 8 3361 352.3 0 3361 9 3476 88.1 0 2770 706 10 3495 22.0 1 2367 1128 11 4034 5.5 1 2579 1455 12 3657 1.4 1 2746 911 13 3262 1.4 1 2638 624 14 3681 1.4 1 2599 1082 15 3317 1.4 1 2325 992 16 2627 1.4 1 2084 543 17 2724 1.4 1 2178 546 18 2754 1.4 1 2455 299 19 3428 1.4 1 3034 394 20 3447 1.4 1 3004 443 21 3951 1.4 1 3612 339 22 4980 1.4 1 4061 919 23 5549 1.4 1 5197 352 24 5461 1.4 1 5070 391 25 4105 1.4 1 3814 291 26 3234 1.4 1 2918 316 27 3331 1.4 1 2926 405 28 3751 1.4 1 3346 405 29 4156 1.4 1 3788 368 30 4649 1.4 1 4304 345 31 5110 1.4 1 4698 412 32 5620 1.4 1 5197 423 33 10335 1.4 1 9782 553 34 4339 1.4 1 4001 338 35 8028 1.4 1 7617 411 36 4159 1.4 1 3852 307 37 3934 1.4 1 3662 272 38 6756 1.4 1 6332 424 39 2138 1.4 1 1888 250 40 3420 1.4 1 3099 321 41 1597 1.4 1 1366 231 42 1129 1.4 1 921 208 43 1039 1.4 1 568 471 44 640 1.4 1 452 188 45 712 1.4 1 396 316 46 680 1.4 1 433 247 47 487 1.4 1 284 203 48 986 1.4 1 627 359 49 1105 1.4 1 900 205 50 1453 1.4 1 111 1342 51 316 1.4 1 17 299 52 228 1.4 1 10 218 53 448 1.4 1 5 443 54 122 1.4 1 5 117 55 432 1.4 1 9 423 56 206 1.4 1 4 202 57 121 1.4 1 1 120 58 427 1.4 1 8 419 59 249 1.4 1 6 243 60 987 1.4 1 7 980 61 174 1.4 1 7 167 62 256 1.4 1 63 193 63 387 1.4 1 10 377 64 136 1.4 1 23 113 65 131 1.4 1 7 124 66 156 1.4 1 5 151 67 227 1.4 1 12 215 68 373 1.4 1 7 366 69 87 1.4 1 6 81 70 90 1.4 1 17 73 71 68 1.4 1 10 58 72 117 1.4 1 30 87 73 377 1.4 1 25 352 74 199 1.4 1 95 104 75 2846 1.4 1 2632 214 76 103 1.4 1 15 88 RUN STATISTICS FOR INPUT FILE: KO-N4371-2.end2.fastq.gz ============================================= 23086373 sequences processed in total Total number of sequences analysed for the sequence pair length validation: 23086373 Number of sequence pairs removed because at least one read was shorter than the length cutoff (20 bp): 21022 (0.09%)