Basic Statistics
| Measure | Value |
|---|---|
| Filename | YD1_1.fq.gz |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 97801958 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 150 |
| %GC | 51 |
Per base sequence quality
Per tile sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| GATCGGAAGAGCACACGTCTGAACTCCAGTCACGAGGCGTATCTCGTATG | 512373 | 0.5238882845269826 | TruSeq Adapter, Index 20 (97% over 38bp) |
| CGCGATCCCACTACTGATCAGCACGGGAGTTTTGACCTGCTCCGTTTCCG | 105977 | 0.10835877130394465 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| GCGACAT | 24640 | 0.0 | 31.69384 | 1 |
| CGCGATC | 60305 | 0.0 | 26.442646 | 1 |
| GCGCGAT | 44140 | 0.0 | 23.51273 | 1 |
| CGACATC | 39705 | 0.0 | 23.37483 | 2 |
| CGATCCC | 74000 | 0.0 | 22.096283 | 3 |
| CCCACTA | 88690 | 0.0 | 19.093634 | 7 |
| CACTACT | 90560 | 0.0 | 18.182804 | 9 |
| TCGCTCG | 18315 | 0.0 | 17.887033 | 3 |
| CGGCGAT | 28800 | 0.0 | 17.310673 | 1 |
| CCACTAC | 102420 | 0.0 | 16.934547 | 8 |
| TACCCTA | 37775 | 0.0 | 16.879927 | 7 |
| TTCGCTC | 30645 | 0.0 | 16.117779 | 2 |
| CCCTACG | 38855 | 0.0 | 15.910637 | 9 |
| ATCCCAC | 119260 | 0.0 | 15.69024 | 5 |
| CGGCAAT | 22505 | 0.0 | 14.940971 | 1 |
| CGGCGCT | 66805 | 0.0 | 14.740245 | 1 |
| ACTACCC | 52505 | 0.0 | 14.570809 | 5 |
| GCGATCC | 111155 | 0.0 | 14.127568 | 2 |
| TTGACTA | 52275 | 0.0 | 13.869988 | 2 |
| TACTCGG | 68165 | 0.0 | 13.693669 | 7 |