Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: D10

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 67973905
ACGTcount: A:0.33, C:0.16, G:0.16, T:0.32

Warning! 1760000 characters in sequence are not A, C, G, or T


File 70 of 228

Found at i:20832561 original size:10 final size:10

Alignment explanation

Indices: 20832507--20832561 Score: 51 Period size: 10 Copynumber: 5.5 Consensus size: 10 20832497 TTTATTTCTA 20832507 TTTTATAATT 1 TTTTATAATT * 20832517 TAATTTATATTT 1 T--TTTATAATT * 20832529 TTTTATATTT 1 TTTTATAATT 20832539 TTTTAT-A-T 1 TTTTATAATT 20832547 TTTTATAATT 1 TTTTATAATT * 20832557 CTTTA 1 TTTTA 20832562 AAGTTGGTTT Statistics Matches: 38, Mismatches: 3, Indels: 8 0.78 0.06 0.16 Matches are distributed among these distances: 8 7 0.18 9 1 0.03 10 21 0.55 12 9 0.24 ACGTcount: A:0.27, C:0.02, G:0.00, T:0.71 Consensus pattern (10 bp): TTTTATAATT Found at i:20833578 original size:14 final size:14 Alignment explanation

Indices: 20833547--20833585 Score: 53 Period size: 14 Copynumber: 2.9 Consensus size: 14 20833537 TTCAACTAGT * 20833547 ATAATA-ATAATCA 1 ATAATATATAATAA * 20833560 TTAATATATAATAA 1 ATAATATATAATAA 20833574 ATAATATATAAT 1 ATAATATATAAT 20833586 GTAGTACTAT Statistics Matches: 22, Mismatches: 3, Indels: 1 0.85 0.12 0.04 Matches are distributed among these distances: 13 5 0.23 14 17 0.77 ACGTcount: A:0.59, C:0.03, G:0.00, T:0.38 Consensus pattern (14 bp): ATAATATATAATAA Found at i:20841051 original size:21 final size:22 Alignment explanation

Indices: 20840996--20841131 Score: 81 Period size: 21 Copynumber: 6.5 Consensus size: 22 20840986 GCTCTTACGG 20840996 GCTTCTGTTTAACTC-ATATGA 1 GCTTCTGTTTAACTCTATATGA * * * 20841017 GTTTCCGTTCAACTCTAT-TGA 1 GCTTCTGTTTAACTCTATATGA * * 20841038 GCTTCTATTTAACTCT-TATGG 1 GCTTCTGTTTAACTCTATATGA * * * * 20841059 GTTTCTATTCAGCTCT-TATGA 1 GCTTCTGTTTAACTCTATATGA * * * 20841080 GCTTCCGTTCAAAC-CT-TATGG 1 GCTTCTGTT-TAACTCTATATGA * * 20841101 GTTTCTGTTTAGCTC-A-ATGA 1 GCTTCTGTTTAACTCTATATGA 20841121 GCTTCTGTTTA 1 GCTTCTGTTTA 20841132 GCCTTCGAGC Statistics Matches: 86, Mismatches: 24, Indels: 11 0.71 0.20 0.09 Matches are distributed among these distances: 20 16 0.19 21 66 0.77 22 4 0.05 ACGTcount: A:0.19, C:0.21, G:0.16, T:0.44 Consensus pattern (22 bp): GCTTCTGTTTAACTCTATATGA Found at i:20841081 original size:42 final size:41 Alignment explanation

Indices: 20840996--20841131 Score: 139 Period size: 42 Copynumber: 3.3 Consensus size: 41 20840986 GCTCTTACGG * * ** * 20840996 GCTTCTGTTTAACTCATATGAGTTTCCGTTCAACTCTATTGA 1 GCTTCTGTTTAACTCTTATGGGTTTCTATTCAGCTCTA-TGA * 20841038 GCTTCTATTTAACTCTTATGGGTTTCTATTCAGCTCTTATGA 1 GCTTCTGTTTAACTCTTATGGGTTTCTATTCAGCTC-TATGA * * * * * 20841080 GCTTCCGTTCAAAC-CTTATGGGTTTCTGTTTAGCTCAATGA 1 GCTTCTGTT-TAACTCTTATGGGTTTCTATTCAGCTCTATGA 20841121 GCTTCTGTTTA 1 GCTTCTGTTTA 20841132 GCCTTCGAGC Statistics Matches: 78, Mismatches: 14, Indels: 6 0.80 0.14 0.06 Matches are distributed among these distances: 40 1 0.01 41 12 0.15 42 60 0.77 43 5 0.06 ACGTcount: A:0.19, C:0.21, G:0.16, T:0.44 Consensus pattern (41 bp): GCTTCTGTTTAACTCTTATGGGTTTCTATTCAGCTCTATGA Found at i:20844763 original size:51 final size:51 Alignment explanation

Indices: 20844612--20844851 Score: 315 Period size: 51 Copynumber: 4.7 Consensus size: 51 20844602 TAAAAGTGAA * * * * 20844612 AGTGATGGTCACATGTGTAGTACTATGTGCAGGCTACTACGTGTACCAGA- 1 AGTGATGGTCACATGTGTAGTACTATGTGTAGGCTACGACGTGTATCGGAT * * * * * * 20844662 A-TGATAGGTCGCATGTGTAGTACTATGTGCAGGCTACTATGCGTACCGGAT 1 AGTGAT-GGTCACATGTGTAGTACTATGTGTAGGCTACGACGTGTATCGGAT * * * * 20844713 AGCT-TTGATCACGTGTGTAGTACTATATGTAGGCTACGACGTGTATCGGAT 1 AG-TGATGGTCACATGTGTAGTACTATGTGTAGGCTACGACGTGTATCGGAT 20844764 AGTGATGGTCACATGTGTAGTACTATGTGTAGGCTACGACGTGTATCGGAT 1 AGTGATGGTCACATGTGTAGTACTATGTGTAGGCTACGACGTGTATCGGAT 20844815 AGTGATGGTCACATGTGTAGTACTATGTGTAGGCTAC 1 AGTGATGGTCACATGTGTAGTACTATGTGTAGGCTAC 20844852 TATGTGAACC Statistics Matches: 167, Mismatches: 18, Indels: 9 0.86 0.09 0.05 Matches are distributed among these distances: 49 4 0.02 50 42 0.25 51 119 0.71 52 1 0.01 53 1 0.01 ACGTcount: A:0.24, C:0.16, G:0.29, T:0.31 Consensus pattern (51 bp): AGTGATGGTCACATGTGTAGTACTATGTGTAGGCTACGACGTGTATCGGAT Found at i:20844833 original size:25 final size:25 Alignment explanation

Indices: 20844753--20844833 Score: 71 Period size: 25 Copynumber: 3.2 Consensus size: 25 20844743 AGGCTACGAC 20844753 GTGTATCGGATAGTGATGGTCACAT 1 GTGTATCGGATAGTGATGGTCACAT * * 20844778 GTGTA--GTACTATGTG-TAGG-CTACGAC 1 GTGTATCGGA-TA-GTGAT-GGTC-AC-AT 20844804 GTGTATCGGATAGTGATGGTCACAT 1 GTGTATCGGATAGTGATGGTCACAT 20844829 GTGTA 1 GTGTA 20844834 GTACTATGTG Statistics Matches: 43, Mismatches: 4, Indels: 18 0.66 0.06 0.28 Matches are distributed among these distances: 23 2 0.05 24 4 0.09 25 18 0.42 26 13 0.30 27 4 0.09 28 2 0.05 ACGTcount: A:0.23, C:0.12, G:0.32, T:0.32 Consensus pattern (25 bp): GTGTATCGGATAGTGATGGTCACAT Found at i:20853216 original size:19 final size:18 Alignment explanation

Indices: 20853177--20853218 Score: 50 Period size: 19 Copynumber: 2.3 Consensus size: 18 20853167 ACACTCGCCG * * 20853177 ATCA-CAACAATCTCCAC 1 ATCATCAACAATCTCAAA 20853194 ATCATCAACAATTCTCAAA 1 ATCATCAACAA-TCTCAAA 20853213 ATCATC 1 ATCATC 20853219 CTCTTAGTCA Statistics Matches: 21, Mismatches: 2, Indels: 2 0.84 0.08 0.08 Matches are distributed among these distances: 17 4 0.19 18 6 0.29 19 11 0.52 ACGTcount: A:0.43, C:0.33, G:0.00, T:0.24 Consensus pattern (18 bp): ATCATCAACAATCTCAAA Found at i:20854835 original size:37 final size:37 Alignment explanation

Indices: 20854778--20854848 Score: 124 Period size: 37 Copynumber: 1.9 Consensus size: 37 20854768 GTTTACAGAT * 20854778 CAAATATAATTCAATCGTATCATTTACTATTCATATC 1 CAAATATAATTAAATCGTATCATTTACTATTCATATC * 20854815 CAAATATAATTAAATTGTATCATTTACTATTCAT 1 CAAATATAATTAAATCGTATCATTTACTATTCAT 20854849 GCATATTCAA Statistics Matches: 32, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 37 32 1.00 ACGTcount: A:0.39, C:0.15, G:0.03, T:0.42 Consensus pattern (37 bp): CAAATATAATTAAATCGTATCATTTACTATTCATATC Found at i:20856125 original size:27 final size:25 Alignment explanation

Indices: 20856095--20856151 Score: 60 Period size: 25 Copynumber: 2.2 Consensus size: 25 20856085 ACATAATTGG 20856095 GAAGCACACTCTCGAGCCATATAACAA 1 GAAGCACA-TCT-GAGCCATATAACAA * * * * 20856122 GAAGCTCATGTGAGCCATGTAACAG 1 GAAGCACATCTGAGCCATATAACAA 20856147 GAAGC 1 GAAGC 20856152 TTATCCGGGT Statistics Matches: 26, Mismatches: 4, Indels: 2 0.81 0.12 0.06 Matches are distributed among these distances: 25 17 0.65 26 2 0.08 27 7 0.27 ACGTcount: A:0.37, C:0.25, G:0.23, T:0.16 Consensus pattern (25 bp): GAAGCACATCTGAGCCATATAACAA Found at i:20866994 original size:21 final size:20 Alignment explanation

Indices: 20866961--20867000 Score: 53 Period size: 21 Copynumber: 1.9 Consensus size: 20 20866951 CCTTTTTAAT 20866961 ATAAAACCAAATAGAAACCA 1 ATAAAACCAAATAGAAACCA * * 20866981 ATAAATCCTAAATATAAACC 1 ATAAAACC-AAATAGAAACC 20867001 CACAAAAAGA Statistics Matches: 17, Mismatches: 2, Indels: 1 0.85 0.10 0.05 Matches are distributed among these distances: 20 7 0.41 21 10 0.59 ACGTcount: A:0.60, C:0.20, G:0.03, T:0.17 Consensus pattern (20 bp): ATAAAACCAAATAGAAACCA Found at i:20874018 original size:221 final size:219 Alignment explanation

Indices: 20873610--20874053 Score: 610 Period size: 221 Copynumber: 2.0 Consensus size: 219 20873600 AGTTTAAATC * * * 20873610 CCTTAGGACTCTCATCTTGCAGATAGACACTAATCTCAGCCACTGATGTAATTCAATCTGTACCG 1 CCTTAGGAATCTCATATTGCAGATAAACACTAATCTCAGCCACTGATGTAATTCAATCTGTACCG * * * * * * 20873675 TTGGATTTGGGGGAGCTCAACTAAATAGAGGTCTTCTCCCTCATTTGTAATCACTTAGTTATTCC 66 TTGGATCTGGGAGAGATCAACTAAATAGAGGTCTTCCCCCTCATTTGTAATCACTCAGTTAATCC * * 20873740 TTCTAAGTAATAGAATATTTTTTAGAACATTTACTCAAACACTTGGTGTGTTATTATTCTCTTTT 131 TTCTAAGTAATAGAATATTTTTTAGAACATTTACTCAAACACTTGGTGTGCTATTATTCTCTTTC 20873805 GGC-TTTTCTGTTCAATACAAGTT 196 GGCTTTTTCTGTTCAATACAAGTT * * * * 20873828 CCTTAGGAATCTCATATTGTAGATAAACACTAATC-CTAGTCATTGATGTAATTCAATTTGTACC 1 CCTTAGGAATCTCATATTGCAGATAAACACTAATCTC-AGCCACTGATGTAATTCAATCTGTACC 20873892 GTTGGATCT-GGAGAGATCAACTATAAATAGA-GTCCTTCCCCCTCATTTGTAATCACTCAGTTG 65 GTTGGATCTGGGAGAGATCAAC--TAAATAGAGGT-CTTCCCCCTCATTTGTAATCACTCAG-T- ** 20873955 TAATCCTTC-ATAGTAATAGAATATTTTTTAGAGTATTTACTCAAACACTTGGTGTGCTATTATT 125 TAATCCTTCTA-AGTAATAGAATATTTTTTAGAACATTTACTCAAACACTTGGTGTGCTATTATT * * * 20874019 TTCTTTCGGCTTTTTTTGTTCAATATAAGTT 189 CTCTTTCGGCTTTTTCTGTTCAATACAAGTT 20874050 CCTT 1 CCTT 20874054 TGTATTTTCG Statistics Matches: 198, Mismatches: 20, Indels: 12 0.86 0.09 0.05 Matches are distributed among these distances: 217 11 0.06 218 65 0.33 219 32 0.16 220 2 0.01 221 66 0.33 222 22 0.11 ACGTcount: A:0.27, C:0.19, G:0.15, T:0.39 Consensus pattern (219 bp): CCTTAGGAATCTCATATTGCAGATAAACACTAATCTCAGCCACTGATGTAATTCAATCTGTACCG TTGGATCTGGGAGAGATCAACTAAATAGAGGTCTTCCCCCTCATTTGTAATCACTCAGTTAATCC TTCTAAGTAATAGAATATTTTTTAGAACATTTACTCAAACACTTGGTGTGCTATTATTCTCTTTC GGCTTTTTCTGTTCAATACAAGTT Found at i:20874638 original size:26 final size:26 Alignment explanation

Indices: 20874609--20874667 Score: 66 Period size: 29 Copynumber: 2.2 Consensus size: 26 20874599 CCAAGTACGA * 20874609 TTTCT-AGCCTTCATTGCTGATTATTT 1 TTTCTAAGCCTTCATTGC-CATTATTT * 20874635 TTTCACTAAGCCTTCATTGCCATTTTTT 1 TTT--CTAAGCCTTCATTGCCATTATTT 20874663 TTTCT 1 TTTCT 20874668 TTCAAACCTC Statistics Matches: 28, Mismatches: 2, Indels: 6 0.78 0.06 0.17 Matches are distributed among these distances: 26 5 0.18 28 11 0.39 29 12 0.43 ACGTcount: A:0.15, C:0.22, G:0.08, T:0.54 Consensus pattern (26 bp): TTTCTAAGCCTTCATTGCCATTATTT Found at i:20887728 original size:12 final size:12 Alignment explanation

Indices: 20887713--20887740 Score: 56 Period size: 12 Copynumber: 2.3 Consensus size: 12 20887703 AAATATTACC 20887713 TTAAATGGAAAT 1 TTAAATGGAAAT 20887725 TTAAATGGAAAT 1 TTAAATGGAAAT 20887737 TTAA 1 TTAA 20887741 TATTTTTTTA Statistics Matches: 16, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 12 16 1.00 ACGTcount: A:0.50, C:0.00, G:0.14, T:0.36 Consensus pattern (12 bp): TTAAATGGAAAT Found at i:20895697 original size:61 final size:61 Alignment explanation

Indices: 20895589--20895738 Score: 167 Period size: 61 Copynumber: 2.4 Consensus size: 61 20895579 ATTTACACAT * * * * 20895589 TCTATCAATTT-GATCCTAAATATAAAAAATAATAATAAATTTAGCCCTCAATATAAACAAAA 1 TCTATCAATTTAG-TCCTAAATCTAAAATAT-TTAATAAATTTAGCCCTCAAAATAAACAAAA * * * * * * ** 20895651 TTTGTCATTTTAGTTCTAATTCTAAAATATTTAATAAATTTGGCCCTCAAAATTTACAAAA 1 TCTATCAATTTAGTCCTAAATCTAAAATATTTAATAAATTTAGCCCTCAAAATAAACAAAA 20895712 TCTATCAATTTAGTCCTAAATCTAAAA 1 TCTATCAATTTAGTCCTAAATCTAAAA 20895739 ATTAAAATTA Statistics Matches: 70, Mismatches: 17, Indels: 3 0.78 0.19 0.03 Matches are distributed among these distances: 61 48 0.69 62 21 0.30 63 1 0.01 ACGTcount: A:0.44, C:0.15, G:0.05, T:0.37 Consensus pattern (61 bp): TCTATCAATTTAGTCCTAAATCTAAAATATTTAATAAATTTAGCCCTCAAAATAAACAAAA Found at i:20898133 original size:32 final size:32 Alignment explanation

Indices: 20898097--20898158 Score: 97 Period size: 32 Copynumber: 1.9 Consensus size: 32 20898087 AGCTTTTGGT 20898097 TTTTTCATGTTGGCAAAGAGTTGAACAATGGA 1 TTTTTCATGTTGGCAAAGAGTTGAACAATGGA ** * 20898129 TTTTTTGTGTTGTCAAAGAGTTGAACAATG 1 TTTTTCATGTTGGCAAAGAGTTGAACAATG 20898159 AAAATAGATG Statistics Matches: 27, Mismatches: 3, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 32 27 1.00 ACGTcount: A:0.29, C:0.08, G:0.24, T:0.39 Consensus pattern (32 bp): TTTTTCATGTTGGCAAAGAGTTGAACAATGGA Found at i:20898530 original size:4 final size:4 Alignment explanation

Indices: 20898513--20898555 Score: 77 Period size: 4 Copynumber: 10.8 Consensus size: 4 20898503 ACAACGATCA * 20898513 TATC TATC TTTC TATC TATC TATC TATC TATC TATC TATC TAT 1 TATC TATC TATC TATC TATC TATC TATC TATC TATC TATC TAT 20898556 ATGTCGGTCT Statistics Matches: 37, Mismatches: 2, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 4 37 1.00 ACGTcount: A:0.23, C:0.23, G:0.00, T:0.53 Consensus pattern (4 bp): TATC Found at i:20902646 original size:53 final size:53 Alignment explanation

Indices: 20902582--20902698 Score: 209 Period size: 53 Copynumber: 2.2 Consensus size: 53 20902572 AAATTTTAAA * * 20902582 TTTTAAATTTGAATATTGATATGTTAATATGTATTTAAAAATATATTGTTATC 1 TTTTAAATTTGAATATTGATATGTTAATATGTATTTAAAAATATACTATTATC 20902635 TTTTAAATTTGAATATTGATATGTTAATATGTATTTAAAAATATACTATTATC 1 TTTTAAATTTGAATATTGATATGTTAATATGTATTTAAAAATATACTATTATC 20902688 TTTT-AATTTGA 1 TTTTAAATTTGA 20902699 TTGAAAGATG Statistics Matches: 62, Mismatches: 2, Indels: 1 0.95 0.03 0.02 Matches are distributed among these distances: 52 7 0.11 53 55 0.89 ACGTcount: A:0.38, C:0.03, G:0.09, T:0.51 Consensus pattern (53 bp): TTTTAAATTTGAATATTGATATGTTAATATGTATTTAAAAATATACTATTATC Found at i:20914543 original size:27 final size:27 Alignment explanation

Indices: 20914503--20914568 Score: 98 Period size: 27 Copynumber: 2.5 Consensus size: 27 20914493 CAGCAGAGCT 20914503 ACCAGT-ACAGTATATGTGGCAAAGCC 1 ACCAGTAACAGTATATGTGGCAAAGCC * * 20914529 ACGAGTAACAGTATATGTGGCAGAGCC 1 ACCAGTAACAGTATATGTGGCAAAGCC * 20914556 ACCCGTAACAGTA 1 ACCAGTAACAGTA 20914569 CTTCCTCCAT Statistics Matches: 35, Mismatches: 4, Indels: 1 0.88 0.10 0.03 Matches are distributed among these distances: 26 5 0.14 27 30 0.86 ACGTcount: A:0.35, C:0.23, G:0.24, T:0.18 Consensus pattern (27 bp): ACCAGTAACAGTATATGTGGCAAAGCC Found at i:20914822 original size:27 final size:27 Alignment explanation

Indices: 20914771--20914822 Score: 77 Period size: 27 Copynumber: 1.9 Consensus size: 27 20914761 ATGGTCATTT * ** 20914771 TACCCTACAAGGGTATTTCGGTAATCC 1 TACCCTACAAAGGTATTTCAATAATCC 20914798 TACCCTACAAAGGTATTTCAATAAT 1 TACCCTACAAAGGTATTTCAATAAT 20914823 TTCACAAACC Statistics Matches: 22, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 27 22 1.00 ACGTcount: A:0.33, C:0.23, G:0.13, T:0.31 Consensus pattern (27 bp): TACCCTACAAAGGTATTTCAATAATCC Found at i:20914863 original size:27 final size:27 Alignment explanation

Indices: 20914809--20914945 Score: 107 Period size: 27 Copynumber: 5.0 Consensus size: 27 20914799 ACCCTACAAA ** * ** 20914809 GGTATTTCAATAATTTCACAAACCAGG 1 GGTATTTTGATAATTTTACAAATTAGG * 20914836 GTTATTTTGATAATTTTACAAATTAGG 1 GGTATTTTGATAATTTTACAAATTAGG * * * * 20914863 GGTGTTTCGGTAATTTTACAAATTAAG 1 GGTATTTTGATAATTTTACAAATTAGG * ** 20914890 GGTATTTT-AGTAATTTTACACA-CCGG 1 GGTATTTTGA-TAATTTTACAAATTAGG * 20914916 AGGTATTTTGATAATTTCACAAATTCAGG 1 -GGTATTTTGATAATTTTACAAATT-AGG 20914945 G 1 G 20914946 TCTCGGTGAC Statistics Matches: 83, Mismatches: 22, Indels: 9 0.73 0.19 0.08 Matches are distributed among these distances: 26 1 0.01 27 78 0.94 28 2 0.02 29 2 0.02 ACGTcount: A:0.32, C:0.11, G:0.18, T:0.39 Consensus pattern (27 bp): GGTATTTTGATAATTTTACAAATTAGG Found at i:20923256 original size:14 final size:14 Alignment explanation

Indices: 20923237--20923272 Score: 72 Period size: 14 Copynumber: 2.6 Consensus size: 14 20923227 ATTACGTTAG 20923237 AAATTAAAGTATAT 1 AAATTAAAGTATAT 20923251 AAATTAAAGTATAT 1 AAATTAAAGTATAT 20923265 AAATTAAA 1 AAATTAAA 20923273 TAAAAAATTT Statistics Matches: 22, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 14 22 1.00 ACGTcount: A:0.61, C:0.00, G:0.06, T:0.33 Consensus pattern (14 bp): AAATTAAAGTATAT Found at i:20925073 original size:17 final size:17 Alignment explanation

Indices: 20925051--20925149 Score: 76 Period size: 17 Copynumber: 5.7 Consensus size: 17 20925041 AGGCCAGATC 20925051 GGGCATGTGGGCCACAA 1 GGGCATGTGGGCCACAA 20925068 GGGCATGTGGGCCCACACA 1 GGGCATGTGGG-CCACA-A * * * 20925087 -GGCATATGGGCCAGAC 1 GGGCATGTGGGCCACAA * * 20925103 GGGTATGTGGGCCAGACA 1 GGGCATGTGGGCCACA-A * * * 20925121 -GACGTGTGGGCCCACAC 1 GGGCATGTGGG-CCACAA 20925138 GGGCATGTGGGC 1 GGGCATGTGGGC 20925150 TCATAATTCT Statistics Matches: 63, Mismatches: 13, Indels: 12 0.72 0.15 0.14 Matches are distributed among these distances: 17 36 0.57 18 26 0.41 19 1 0.02 ACGTcount: A:0.20, C:0.25, G:0.41, T:0.13 Consensus pattern (17 bp): GGGCATGTGGGCCACAA Found at i:20925098 original size:35 final size:35 Alignment explanation

Indices: 20925052--20925149 Score: 117 Period size: 35 Copynumber: 2.8 Consensus size: 35 20925042 GGCCAGATCG * 20925052 GGCATGTGGGCCACAAGGGCATGTGGGCCCACACA 1 GGCATGTGGGCCACACGGGCATGTGGGCCCACACA * * * * 20925087 GGCATATGGGCCAGACGGGTATGTGGG-CCAGACA 1 GGCATGTGGGCCACACGGGCATGTGGGCCCACACA * * 20925121 GACGTGTGGGCCCACACGGGCATGTGGGC 1 GGCATGTGGG-CCACACGGGCATGTGGGC 20925150 TCATAATTCT Statistics Matches: 51, Mismatches: 10, Indels: 3 0.80 0.16 0.05 Matches are distributed among these distances: 34 13 0.25 35 38 0.75 ACGTcount: A:0.20, C:0.26, G:0.41, T:0.13 Consensus pattern (35 bp): GGCATGTGGGCCACACGGGCATGTGGGCCCACACA Found at i:20925114 original size:52 final size:53 Alignment explanation

Indices: 20925042--20925149 Score: 139 Period size: 52 Copynumber: 2.1 Consensus size: 53 20925032 ACACGAGCTA * 20925042 GGCCAGATCGGGCATGTGGGCCACA-AGGGCATGTGGGCCCACACAGGCATATG 1 GGCCAGATCGGGCATGTGGGCCACACA-GACATGTGGGCCCACACAGGCATATG * * * * * 20925095 GGCCAGA-CGGGTATGTGGGCCAGACAGACGTGTGGGCCCACACGGGCATGTG 1 GGCCAGATCGGGCATGTGGGCCACACAGACATGTGGGCCCACACAGGCATATG 20925147 GGC 1 GGC 20925150 TCATAATTCT Statistics Matches: 48, Mismatches: 6, Indels: 3 0.84 0.11 0.05 Matches are distributed among these distances: 52 40 0.83 53 8 0.17 ACGTcount: A:0.20, C:0.26, G:0.41, T:0.13 Consensus pattern (53 bp): GGCCAGATCGGGCATGTGGGCCACACAGACATGTGGGCCCACACAGGCATATG Found at i:20931473 original size:296 final size:296 Alignment explanation

Indices: 20930939--20931531 Score: 1062 Period size: 296 Copynumber: 2.0 Consensus size: 296 20930929 ATTCACATTA * 20930939 ATATCATTTGTATTTGTCCTTAATGGTTTTTGCACTCAAAACAAAATGTAAGCAAATATTTGCTC 1 ATATCATTTGTATTTGTCCTTAATGGTTTTTGCACTCAAAACAAAATGTAAGAAAATATTTGCTC * * 20931004 ATTGATTACCTAAATTTTAACTGATACTAAGTTGTATTACGTTGTCGGATCGGAATGTAAGAAGA 66 ATTGATTACCTAAATTTTAACTAATACTAAGTTGTATTACGTTGTCGGATCGAAATGTAAGAAGA 20931069 CAACTTGTATTAGTAGGTAATCTAAAAGGTCTATAGTCTAAAAGGACTATTATGTCGTCTATCAA 131 CAACTTGTATTAGTAGGTAATCTAAAAGGTCTATAGTCTAAAAGGACTATTATGTCGTCTATCAA * * * * 20931134 GTCCAAATAGGTGGATACCTTATCTTGGATATTAGAATGATTTACTCCCAGAAGATAGAGACATA 196 GTCCAAATAGGGGGATACATTATCTTGGATATCAGAATGATTTACTCCCAAAAGATAGAGACATA 20931199 AATATTATTTTCTAGACTAACAATACATTAGACATG 261 AATATTATTTTCTAGACTAACAATACATTAGACATG 20931235 ATATCATTTGTATTTGTCCTTAATGGTTTTTGCACTCAAAACAAAATGTAAGAAAATATTTGCTC 1 ATATCATTTGTATTTGTCCTTAATGGTTTTTGCACTCAAAACAAAATGTAAGAAAATATTTGCTC * * 20931300 ATTGATTACCTAAATTTTAACTAATATTAAGTTGTATTACGTTGTCTGATCGAAATGTAAGAAGA 66 ATTGATTACCTAAATTTTAACTAATACTAAGTTGTATTACGTTGTCGGATCGAAATGTAAGAAGA 20931365 CAACTTGTATTAGTAGGTAATCTAAAAGGTCTATAGTCTAAAAGGACTATTATGTCGTCTATCAA 131 CAACTTGTATTAGTAGGTAATCTAAAAGGTCTATAGTCTAAAAGGACTATTATGTCGTCTATCAA * * 20931430 GTCCAAAT-GGGGAGATGCATTATCTTGGATATCAGAGTGATTTACTCCCAAAAGATAGAGACAT 196 GTCCAAATAGGGG-GATACATTATCTTGGATATCAGAATGATTTACTCCCAAAAGATAGAGACAT * 20931494 AAATGTTATTTTCTAGACTAACAATACATTAGACATG 260 AAATATTATTTTCTAGACTAACAATACATTAGACATG 20931531 A 1 A 20931532 CCCAAGTTGA Statistics Matches: 284, Mismatches: 12, Indels: 2 0.95 0.04 0.01 Matches are distributed among these distances: 295 3 0.01 296 281 0.99 ACGTcount: A:0.36, C:0.13, G:0.16, T:0.35 Consensus pattern (296 bp): ATATCATTTGTATTTGTCCTTAATGGTTTTTGCACTCAAAACAAAATGTAAGAAAATATTTGCTC ATTGATTACCTAAATTTTAACTAATACTAAGTTGTATTACGTTGTCGGATCGAAATGTAAGAAGA CAACTTGTATTAGTAGGTAATCTAAAAGGTCTATAGTCTAAAAGGACTATTATGTCGTCTATCAA GTCCAAATAGGGGGATACATTATCTTGGATATCAGAATGATTTACTCCCAAAAGATAGAGACATA AATATTATTTTCTAGACTAACAATACATTAGACATG Found at i:20932191 original size:13 final size:13 Alignment explanation

Indices: 20932173--20932204 Score: 55 Period size: 13 Copynumber: 2.5 Consensus size: 13 20932163 TTTCCAGGAT * 20932173 CCACACAGGATGG 1 CCACACAGAATGG 20932186 CCACACAGAATGG 1 CCACACAGAATGG 20932199 CCACAC 1 CCACAC 20932205 GCCCGTGTGA Statistics Matches: 18, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 13 18 1.00 ACGTcount: A:0.34, C:0.38, G:0.22, T:0.06 Consensus pattern (13 bp): CCACACAGAATGG Found at i:20933363 original size:18 final size:17 Alignment explanation

Indices: 20933340--20933373 Score: 50 Period size: 17 Copynumber: 1.9 Consensus size: 17 20933330 ATTGCTTTTT 20933340 ATTTTTGGTTTTAGTTTA 1 ATTTTT-GTTTTAGTTTA * 20933358 ATTTTTGTTTTTGTTT 1 ATTTTTGTTTTAGTTT 20933374 TCTAATTTTT Statistics Matches: 15, Mismatches: 1, Indels: 1 0.88 0.06 0.06 Matches are distributed among these distances: 17 9 0.60 18 6 0.40 ACGTcount: A:0.12, C:0.00, G:0.15, T:0.74 Consensus pattern (17 bp): ATTTTTGTTTTAGTTTA Found at i:20937483 original size:20 final size:20 Alignment explanation

Indices: 20937441--20937483 Score: 59 Period size: 20 Copynumber: 2.1 Consensus size: 20 20937431 ATTTTCTTTG * * 20937441 TTTTTATTTTATTTATTTAT 1 TTTTTATTTTAGTTACTTAT * 20937461 TTTTTATTTTAGTTACTTCT 1 TTTTTATTTTAGTTACTTAT 20937481 TTT 1 TTT 20937484 CTTGGTGTTC Statistics Matches: 20, Mismatches: 3, Indels: 0 0.87 0.13 0.00 Matches are distributed among these distances: 20 20 1.00 ACGTcount: A:0.16, C:0.05, G:0.02, T:0.77 Consensus pattern (20 bp): TTTTTATTTTAGTTACTTAT Found at i:20937565 original size:15 final size:15 Alignment explanation

Indices: 20937522--20937569 Score: 53 Period size: 15 Copynumber: 3.3 Consensus size: 15 20937512 TTTTAGATTC * 20937522 TTTTATT-TTAGTAT 1 TTTTATTGTTATTAT * 20937536 TTTTATTTTTATTAT 1 TTTTATTGTTATTAT * * 20937551 TTTTCTTGTTCTTAT 1 TTTTATTGTTATTAT 20937566 TTTT 1 TTTT 20937570 TTCATTTTCC Statistics Matches: 29, Mismatches: 4, Indels: 1 0.85 0.12 0.03 Matches are distributed among these distances: 14 7 0.24 15 22 0.76 ACGTcount: A:0.15, C:0.04, G:0.04, T:0.77 Consensus pattern (15 bp): TTTTATTGTTATTAT Found at i:20939317 original size:6 final size:6 Alignment explanation

Indices: 20939220--20939295 Score: 91 Period size: 6 Copynumber: 12.5 Consensus size: 6 20939210 AGTTTATTTC * * 20939220 TTTATT TTTATT TTTATT TTTATT TTTATT TTTATG TTTATT GTTATTT 1 TTTATT TTTATT TTTATT TTTATT TTTATT TTTATT TTTATT TTTA-TT * * 20939269 TCTTATA TTTA-T TTTGTT TTTATT TTT 1 T-TTATT TTTATT TTTATT TTTATT TTT 20939296 TGAACTATAT Statistics Matches: 59, Mismatches: 8, Indels: 6 0.81 0.11 0.08 Matches are distributed among these distances: 5 3 0.05 6 49 0.83 7 4 0.07 8 3 0.05 ACGTcount: A:0.16, C:0.01, G:0.04, T:0.79 Consensus pattern (6 bp): TTTATT Found at i:20942013 original size:14 final size:14 Alignment explanation

Indices: 20941990--20942020 Score: 53 Period size: 14 Copynumber: 2.2 Consensus size: 14 20941980 TGTTGCGTTC * 20941990 TTAGTTAATTTAGA 1 TTAGTAAATTTAGA 20942004 TTAGTAAATTTAGA 1 TTAGTAAATTTAGA 20942018 TTA 1 TTA 20942021 AAAATAATCA Statistics Matches: 16, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 14 16 1.00 ACGTcount: A:0.39, C:0.00, G:0.13, T:0.48 Consensus pattern (14 bp): TTAGTAAATTTAGA Found at i:20946101 original size:26 final size:26 Alignment explanation

Indices: 20946046--20946116 Score: 90 Period size: 26 Copynumber: 2.7 Consensus size: 26 20946036 TCATCCTTAT ** * 20946046 TTTACCCCCAGTAAAATTTTGATAGT 1 TTTACCCCTGGTAAAATTTTGATAGA 20946072 TTTACCCCTGGTAAAATTTTGAT-GAA 1 TTTACCCCTGGTAAAATTTTGATAG-A * 20946098 TTTACCCCTGATAAAATTT 1 TTTACCCCTGGTAAAATTT 20946117 CAAGAAAATA Statistics Matches: 40, Mismatches: 4, Indels: 2 0.87 0.09 0.04 Matches are distributed among these distances: 25 1 0.03 26 39 0.98 ACGTcount: A:0.31, C:0.18, G:0.11, T:0.39 Consensus pattern (26 bp): TTTACCCCTGGTAAAATTTTGATAGA Found at i:20954301 original size:5 final size:5 Alignment explanation

Indices: 20954291--20954319 Score: 58 Period size: 5 Copynumber: 5.8 Consensus size: 5 20954281 ATAACATTTT 20954291 TATAA TATAA TATAA TATAA TATAA TATA 1 TATAA TATAA TATAA TATAA TATAA TATA 20954320 TTTATATTAA Statistics Matches: 24, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 5 24 1.00 ACGTcount: A:0.59, C:0.00, G:0.00, T:0.41 Consensus pattern (5 bp): TATAA Found at i:20956364 original size:20 final size:20 Alignment explanation

Indices: 20956336--20956380 Score: 81 Period size: 20 Copynumber: 2.2 Consensus size: 20 20956326 AGGTATTAGC * 20956336 TTTTACAAGCTTTTAGAGAA 1 TTTTGCAAGCTTTTAGAGAA 20956356 TTTTGCAAGCTTTTAGAGAA 1 TTTTGCAAGCTTTTAGAGAA 20956376 TTTTG 1 TTTTG 20956381 TGTGGGTTAT Statistics Matches: 24, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 20 24 1.00 ACGTcount: A:0.29, C:0.09, G:0.18, T:0.44 Consensus pattern (20 bp): TTTTGCAAGCTTTTAGAGAA Found at i:20957453 original size:9 final size:9 Alignment explanation

Indices: 20957439--20957487 Score: 53 Period size: 9 Copynumber: 5.4 Consensus size: 9 20957429 GATTAAATCA 20957439 CTTCCTCTT 1 CTTCCTCTT * 20957448 CTTCCTCCT 1 CTTCCTCTT * * 20957457 CCTCCTCCT 1 CTTCCTCTT * 20957466 CCTCCTCTT 1 CTTCCTCTT * 20957475 CTTCTTCTT 1 CTTCCTCTT 20957484 CTTC 1 CTTC 20957488 TTCTCTCGTT Statistics Matches: 35, Mismatches: 5, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 9 35 1.00 ACGTcount: A:0.00, C:0.51, G:0.00, T:0.49 Consensus pattern (9 bp): CTTCCTCTT Found at i:20960609 original size:133 final size:134 Alignment explanation

Indices: 20960370--20960631 Score: 312 Period size: 133 Copynumber: 2.0 Consensus size: 134 20960360 GTGTTTACTT * ** * ** * 20960370 CTTGTCTTACTACACCTATCAAAATGGCATTGAGATGGTACAACTTTTAGCCATATATTTAGGAT 1 CTTGTCTTACTACACCTATCAAAATGACATCAAGATGGTAAAACTTCAAGCCATATAATTAGGAT * * * * ** * * 20960435 GACGTGTATTTTTCATTTGGGAAATGTTCTCCTTTTTGGTATCCATGCTGTATATAAATCATGAC 66 CACGTGTATTTTTCATTTGGGAAATATTCGCCTTCTTGGTATCCACACTCTAGATAAATCATGAC 20960500 ATGA 131 ATGA * * * * 20960504 CTTGTCTTACTA-ACCTATCAAGATGACATCAAGGTGGTAAAACTTCAAGCCTTGTAATTAGGAT 1 CTTGTCTTACTACACCTATCAAAATGACATCAAGATGGTAAAACTTCAAGCCATATAATTAGGAT * * 20960568 CATGTGTAGTTTTT-ATTTGGGAAATATTGGCCTTCTTGGTATCCACACTCTAGATAAATCATGA 66 CACGTGTA-TTTTTCATTTGGGAAATATTCGCCTTCTTGGTATCCACACTCTAGATAAATCATGA 20960632 GAATGCATCT Statistics Matches: 106, Mismatches: 21, Indels: 3 0.82 0.16 0.02 Matches are distributed among these distances: 133 89 0.84 134 17 0.16 ACGTcount: A:0.29, C:0.17, G:0.18, T:0.37 Consensus pattern (134 bp): CTTGTCTTACTACACCTATCAAAATGACATCAAGATGGTAAAACTTCAAGCCATATAATTAGGAT CACGTGTATTTTTCATTTGGGAAATATTCGCCTTCTTGGTATCCACACTCTAGATAAATCATGAC ATGA Found at i:20971272 original size:85 final size:85 Alignment explanation

Indices: 20971127--20971297 Score: 209 Period size: 85 Copynumber: 2.0 Consensus size: 85 20971117 AACGCTCCAC * * * * * * * 20971127 ATACTTGTGTTGTTGCAAGTATGTTACAGGATCATCTGGAATTGGATTTTGACCTAATCTCTGAC 1 ATACTTGTGTTGTTGCAAGTATGTCACAAGATCATCTGAAATTGGATTCTAACATAATCTCTAAC * * * 20971192 ATTGTCTTACTTATTTTGAA 66 ATTATCTTACCTATGTTGAA * * * 20971212 ATACTTGTGTTGTTGCAGGTATGTCACAAGATCA-CTCGAAATTGGATTCTAAGATGATCTCTAA 1 ATACTTGTGTTGTTGCAAGTATGTCACAAGATCATCT-GAAATTGGATTCTAACATAATCTCTAA 20971276 CATTATCTTACCTATGTTGAA 65 CATTATCTTACCTATGTTGAA 20971297 A 1 A 20971298 GCGAGTCCTA Statistics Matches: 72, Mismatches: 13, Indels: 2 0.83 0.15 0.02 Matches are distributed among these distances: 84 2 0.03 85 70 0.97 ACGTcount: A:0.27, C:0.15, G:0.18, T:0.39 Consensus pattern (85 bp): ATACTTGTGTTGTTGCAAGTATGTCACAAGATCATCTGAAATTGGATTCTAACATAATCTCTAAC ATTATCTTACCTATGTTGAA Found at i:20972440 original size:5 final size:5 Alignment explanation

Indices: 20972432--20972483 Score: 59 Period size: 5 Copynumber: 9.8 Consensus size: 5 20972422 CCTAATCTAA * * 20972432 AAAAG AAAAG AAAAAG AAAAAG AAAAAG AGAAG AGAAG AAAAG AAAAG 1 AAAAG AAAAG -AAAAG -AAAAG -AAAAG AAAAG AAAAG AAAAG AAAAG 20972480 AAAA 1 AAAA 20972484 AGCCCTAACC Statistics Matches: 44, Mismatches: 2, Indels: 2 0.92 0.04 0.04 Matches are distributed among these distances: 5 27 0.61 6 17 0.39 ACGTcount: A:0.79, C:0.00, G:0.21, T:0.00 Consensus pattern (5 bp): AAAAG Found at i:20972446 original size:6 final size:6 Alignment explanation

Indices: 20972431--20972485 Score: 60 Period size: 6 Copynumber: 9.0 Consensus size: 6 20972421 ACCTAATCTA * 20972431 AAAAAG -AAAAG AAAAAG AAAAAG AAAAAG AGAAGAG AAGAAA- AGAAAAG 1 AAAAAG AAAAAG AAAAAG AAAAAG AAAAAG A-AAAAG AA-AAAG A-AAAAG 20972480 AAAAAG 1 AAAAAG 20972486 CCCTAACCCT Statistics Matches: 42, Mismatches: 2, Indels: 10 0.78 0.04 0.19 Matches are distributed among these distances: 5 5 0.12 6 28 0.67 7 9 0.21 ACGTcount: A:0.78, C:0.00, G:0.22, T:0.00 Consensus pattern (6 bp): AAAAAG Found at i:20976830 original size:25 final size:26 Alignment explanation

Indices: 20976802--20976850 Score: 73 Period size: 26 Copynumber: 1.9 Consensus size: 26 20976792 CAACAAAACT * 20976802 AAAATCTAACCT-AAAAAGAAAAGAG 1 AAAACCTAACCTAAAAAAGAAAAGAG * 20976827 AAAACCTAATCTAAAAAAGAAAAG 1 AAAACCTAACCTAAAAAAGAAAAG 20976851 TAAAAGAAAA Statistics Matches: 21, Mismatches: 2, Indels: 1 0.88 0.08 0.04 Matches are distributed among these distances: 25 10 0.48 26 11 0.52 ACGTcount: A:0.65, C:0.12, G:0.10, T:0.12 Consensus pattern (26 bp): AAAACCTAACCTAAAAAAGAAAAGAG Found at i:20976855 original size:25 final size:24 Alignment explanation

Indices: 20976796--20976855 Score: 66 Period size: 25 Copynumber: 2.4 Consensus size: 24 20976786 AGGAAACAAC * * 20976796 AAAACTAAAATCTAACCTAAAAAG 1 AAAAGTAAAACCTAACCTAAAAAG * * 20976820 AAAAGAGAAAACCTAATCTAAAAAAG 1 AAAAG-TAAAACCTAACCT-AAAAAG 20976846 AAAAGTAAAA 1 AAAAGTAAAA 20976856 GAAAAGAAAA Statistics Matches: 29, Mismatches: 5, Indels: 3 0.78 0.14 0.08 Matches are distributed among these distances: 24 4 0.14 25 14 0.48 26 11 0.38 ACGTcount: A:0.67, C:0.12, G:0.08, T:0.13 Consensus pattern (24 bp): AAAAGTAAAACCTAACCTAAAAAG Found at i:20979014 original size:79 final size:75 Alignment explanation

Indices: 20978915--20979201 Score: 307 Period size: 75 Copynumber: 3.8 Consensus size: 75 20978905 TTTTCATTGC * * * * * 20978915 TTTTA-TTTTTTGGTAAATTGTAAATAAGTATATATTATTGATGTAAAATATAGTAGGATATAAT 1 TTTTATTTTTTTCGTAGATTGTAAATAAGTATATATTATTGATATAAAATACAATA-GATATAAT 20978979 ATAAACAATTTTT 65 ATAAA-AA-TTTT * * * 20978992 TTTTCATTTTTTTCGTATATTGTAAATAAGTATATATTATTGATATAAAATACAATA-AAATAAA 1 TTTT-ATTTTTTTCGTAGATTGTAAATAAGTATATATTATTGATATAAAATACAATAGATATAAT * * 20979056 ATACAAATAATT 65 ATAAAAAT-TTT ** * * 20979068 TTTTAAATTTTTCGTAGATTGTAAATAAATATATATTATTGATATAAAATACAATAGAATACAAT 1 TTTTATTTTTTTCGTAGATTGTAAATAAGTATATATTATTGATATAAAATACAATAG-ATATAAT * 20979133 ACAAATAA-TTT 65 ATAAA-AATTTT * * 20979144 TTCTATTGTTTTT-GTAGATTGTAAATAAG--TATATTATTGATGTAAAATACAATAGATA 1 TTTTATT-TTTTTCGTAGATTGTAAATAAGTATATATTATTGATATAAAATACAATAGATA 20979202 AACTTTTTTT Statistics Matches: 179, Mismatches: 24, Indels: 18 0.81 0.11 0.08 Matches are distributed among these distances: 73 3 0.02 74 25 0.14 75 49 0.27 76 29 0.16 77 25 0.14 78 3 0.02 79 45 0.25 ACGTcount: A:0.43, C:0.04, G:0.09, T:0.44 Consensus pattern (75 bp): TTTTATTTTTTTCGTAGATTGTAAATAAGTATATATTATTGATATAAAATACAATAGATATAATA TAAAAATTTT Found at i:20984138 original size:16 final size:16 Alignment explanation

Indices: 20984119--20984152 Score: 50 Period size: 16 Copynumber: 2.1 Consensus size: 16 20984109 AAATGAATGT * * 20984119 ATGTGTAAATGTGATA 1 ATGTGCAAATATGATA 20984135 ATGTGCAAATATGATA 1 ATGTGCAAATATGATA 20984151 AT 1 AT 20984153 TGGATATTGT Statistics Matches: 16, Mismatches: 2, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 16 16 1.00 ACGTcount: A:0.41, C:0.03, G:0.21, T:0.35 Consensus pattern (16 bp): ATGTGCAAATATGATA Found at i:20990109 original size:12 final size:13 Alignment explanation

Indices: 20990069--20990110 Score: 68 Period size: 14 Copynumber: 3.2 Consensus size: 13 20990059 ATTTCAACTT 20990069 TTATATATTATTA 1 TTATATATTATTA 20990082 TGTATATATTATTA 1 T-TATATATTATTA 20990096 TT-TATATTATTA 1 TTATATATTATTA 20990108 TTA 1 TTA 20990111 AGTTTTTCTT Statistics Matches: 27, Mismatches: 0, Indels: 4 0.87 0.00 0.13 Matches are distributed among these distances: 12 12 0.44 13 2 0.07 14 13 0.48 ACGTcount: A:0.36, C:0.00, G:0.02, T:0.62 Consensus pattern (13 bp): TTATATATTATTA Found at i:20996862 original size:50 final size:50 Alignment explanation

Indices: 20996803--20997366 Score: 341 Period size: 50 Copynumber: 11.3 Consensus size: 50 20996793 GTAGTGAAAC 20996803 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * * * * ** * 20996853 AGATTAAAGCCAAAAT-AGCGAATCTTAC-TCCCCAAGCGA-TGCAGCGAAAC 1 AGATTAAAGCC-ACATCGGTGAATCTTGCTTCCCC-GGC-ATTGCAGTTAAAA * * * 20996903 AGATTAAAGCCACAACGGTGAATCTTGTTTCCCCGACATTGCAGTTAAAA 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * * * * * * * * ** * 20996953 AGATTAAAGCTACAGCAGCGAATCTTACCTCTCAGGCAGTGCA-ACAGAAT 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTA-AAA * * * ** * 20997003 AGATTAAAGCCACAACGTTGAATCTTGCCTCCATGACATTGCAGTTAAAA 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * * * * * *** * 20997053 AGATTAAAGCTACAAT-AGCGAATCTTACTTCCCAGGCGA-TGTAGCGGAAC 1 AGATTAAAGCCAC-ATCGGTGAATCTTGCTTCCCCGGC-ATTGCAGTTAAAA * * 20997103 ATATTAAAGCCACAACGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * * * * * ** * ** 20997153 AGACTAAAGCCACATCGGCGAGTGTTACTTCCCTGGCAACGCAGTGAAGC 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * * * 20997203 AGACTAAAGCCACAAT-GTTGAATCTTGCTTCCCCGACATTGTAGTTAAAA 1 AGATTAAAGCCAC-ATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * * * * * * * ** * ** 20997253 ATACTAAAGCCACAGCGGCGAGTGTTACTTCCCTGGCAACGCAGTGAAGC 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * * * 20997303 AGACTAAAGCCACAACGGTGAATCTTGCTTTCCCGACATTGCAGTTAAAA 1 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA * * 20997353 GGATTAAAGTCACA 1 AGATTAAAGCCACA 20997367 ATGACGAATC Statistics Matches: 368, Mismatches: 132, Indels: 28 0.70 0.25 0.05 Matches are distributed among these distances: 49 12 0.03 50 342 0.93 51 14 0.04 ACGTcount: A:0.34, C:0.24, G:0.20, T:0.23 Consensus pattern (50 bp): AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCGGCATTGCAGTTAAAA Found at i:20996947 original size:100 final size:100 Alignment explanation

Indices: 20996774--20997462 Score: 638 Period size: 100 Copynumber: 6.9 Consensus size: 100 20996764 AGAATTACAG * * * * 20996774 ATCTTA-TCTCCCAAGCGATGTAGTGAAACAGATTAAAGCCACATCGGTGAATCTTGCTTCCCCG 1 ATCTTACTC-CCCAGGCGATGCAGCGAAACAGATTAAAGCCACAACGGTGAATCTTGCTTCCCCG * * 20996838 GCATTGCAGTTAAAAAGATTAAAGCCAAAATAGCGA 65 ACATTGCAGTTAAAAAGATTAAAGCCACAATAGCGA * * 20996874 ATCTTACTCCCCAAGCGATGCAGCGAAACAGATTAAAGCCACAACGGTGAATCTTGTTTCCCCGA 1 ATCTTACTCCCCAGGCGATGCAGCGAAACAGATTAAAGCCACAACGGTGAATCTTGCTTCCCCGA * ** 20996939 CATTGCAGTTAAAAAGATTAAAGCTACAGCAGCGA 66 CATTGCAGTTAAAAAGATTAAAGCCACAATAGCGA * * * * * * 20996974 ATCTTAC-CTCTCAGGC-AGTGCAAC-AGAATAGATTAAAGCCACAACGTTGAATCTTGCCTCCA 1 ATCTTACTC-CCCAGGCGA-TGCAGCGA-AACAGATTAAAGCCACAACGGTGAATCTTGCTTCCC * * 20997036 TGACATTGCAGTTAAAAAGATTAAAGCTACAATAGCGA 63 CGACATTGCAGTTAAAAAGATTAAAGCCACAATAGCGA * * * * * 20997074 ATCTTACTTCCCAGGCGATGTAGCGGAACATATTAAAGCCACAACGGTGAATCTTGCTTCCCCGG 1 ATCTTACTCCCCAGGCGATGCAGCGAAACAGATTAAAGCCACAACGGTGAATCTTGCTTCCCCGA * * 20997139 CATTGCAGTTAAAAAGACTAAAGCCAC-ATCGGCGA 66 CATTGCAGTTAAAAAGATTAAAGCCACAAT-AGCGA * * * * * * * * * * * 20997174 GTGTTACTTCCCTGGCAACGCAGTGAAGCAGACTAAAGCCACAATGTTGAATCTTGCTTCCCCGA 1 ATCTTACTCCCCAGGCGATGCAGCGAAACAGATTAAAGCCACAACGGTGAATCTTGCTTCCCCGA * * * *** 20997239 CATTGTAGTTAAAAATACTAAAGCCACAGCGGCGA 66 CATTGCAGTTAAAAAGATTAAAGCCACAATAGCGA * * * * * * * * * * 20997274 GTGTTACTTCCCTGGCAACGCAGTGAAGCAGACTAAAGCCACAACGGTGAATCTTGCTTTCCCGA 1 ATCTTACTCCCCAGGCGATGCAGCGAAACAGATTAAAGCCACAACGGTGAATCTTGCTTCCCCGA * * 20997339 CATTGCAGTTAAAAGGATTAAAGTCACAAT-GACGA 66 CATTGCAGTTAAAAAGATTAAAGCCACAATAG-CGA * * * * * * * * * 20997374 ATCTTACTCTCTCA-GCTAGTGCA-CTAGAGCAGATTGAAGCTACAACGGCGAATCTTGATTCCT 1 ATCTTACTC-CCCAGGCGA-TGCAGCGA-AACAGATTAAAGCCACAACGGTGAATCTTGCTTCCC ** * 20997437 CGACATCACAGTTAAACAGATTAAAG 63 CGACATTGCAGTTAAAAAGATTAAAG 20997463 TAACGTGTTA Statistics Matches: 493, Mismatches: 83, Indels: 25 0.82 0.14 0.04 Matches are distributed among these distances: 99 6 0.01 100 428 0.87 101 59 0.12 ACGTcount: A:0.33, C:0.24, G:0.20, T:0.23 Consensus pattern (100 bp): ATCTTACTCCCCAGGCGATGCAGCGAAACAGATTAAAGCCACAACGGTGAATCTTGCTTCCCCGA CATTGCAGTTAAAAAGATTAAAGCCACAATAGCGA Found at i:21000191 original size:5 final size:6 Alignment explanation

Indices: 21000140--21000196 Score: 60 Period size: 6 Copynumber: 9.2 Consensus size: 6 21000130 ACCAATTTAG * * * * 21000140 TTTATC TTTATT TTTCTT TTTATT TCTATT TGTAGTGT TTTATT TTTATT 1 TTTATT TTTATT TTTATT TTTATT TTTATT TTTA-T-T TTTATT TTTATT 21000190 TTTATT T 1 TTTATT T 21000197 GAATTCCAAT Statistics Matches: 43, Mismatches: 6, Indels: 4 0.81 0.11 0.08 Matches are distributed among these distances: 6 37 0.86 7 2 0.05 8 4 0.09 ACGTcount: A:0.14, C:0.05, G:0.05, T:0.75 Consensus pattern (6 bp): TTTATT Found at i:21000229 original size:3 final size:3 Alignment explanation

Indices: 21000221--21000289 Score: 111 Period size: 3 Copynumber: 23.0 Consensus size: 3 21000211 TTTTAATCCC *** 21000221 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT CCC ATT ATT ATT 1 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 21000269 ATT ATT ATT ATT ATT ATT ATT 1 ATT ATT ATT ATT ATT ATT ATT 21000290 TCTTCTTTAT Statistics Matches: 60, Mismatches: 6, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 3 60 1.00 ACGTcount: A:0.32, C:0.04, G:0.00, T:0.64 Consensus pattern (3 bp): ATT Found at i:21000405 original size:23 final size:22 Alignment explanation

Indices: 21000378--21000437 Score: 59 Period size: 23 Copynumber: 2.6 Consensus size: 22 21000368 TGACATTACC * 21000378 TTTAAACTTATTATTATTATAAT 1 TTTAAA-TTATTATTACTATAAT * * 21000401 TTTAAAATGTAATA-TACTATTAT 1 TTT-AAAT-TATTATTACTATAAT 21000424 TTTAAATTATTATT 1 TTTAAATTATTATT 21000438 CACATGTTTT Statistics Matches: 30, Mismatches: 4, Indels: 7 0.73 0.10 0.17 Matches are distributed among these distances: 21 4 0.13 22 5 0.17 23 14 0.47 24 7 0.23 ACGTcount: A:0.40, C:0.03, G:0.02, T:0.55 Consensus pattern (22 bp): TTTAAATTATTATTACTATAAT Found at i:21004748 original size:6 final size:6 Alignment explanation

Indices: 21004716--21004796 Score: 76 Period size: 6 Copynumber: 13.7 Consensus size: 6 21004706 AGCTCAAAAA * * * * ** 21004716 AAAATC CAAATC CAAATC CAAATC AAAAAC AAAA-- AAGAAGA AAAATC 1 AAAATC AAAATC AAAATC AAAATC AAAATC AAAATC AA-AATC AAAATC * 21004763 AAAATC AAAATC AAAATC AAAATC AAAATG AAAA 1 AAAATC AAAATC AAAATC AAAATC AAAATC AAAA 21004797 AGAAAAAAGA Statistics Matches: 66, Mismatches: 6, Indels: 6 0.85 0.08 0.08 Matches are distributed among these distances: 4 2 0.03 5 2 0.03 6 60 0.91 7 2 0.03 ACGTcount: A:0.68, C:0.16, G:0.04, T:0.12 Consensus pattern (6 bp): AAAATC Found at i:21005675 original size:22 final size:21 Alignment explanation

Indices: 21005634--21005694 Score: 50 Period size: 22 Copynumber: 2.7 Consensus size: 21 21005624 GAAAAATAAA * * 21005634 AAAAATAAAAAAAGAGAAAAACG 1 AAAAAT-AAAAATGA-AAAAAGG * * 21005657 GAAAAATAAAAATGAAAAAATG 1 AAAAA-TAAAAATGAAAAAAGG 21005679 AAAAAGTAAAAATGAA 1 AAAAA-TAAAAATGAA 21005695 GAGGCTAAGG Statistics Matches: 32, Mismatches: 5, Indels: 3 0.80 0.12 0.08 Matches are distributed among these distances: 22 20 0.62 23 11 0.34 24 1 0.03 ACGTcount: A:0.75, C:0.02, G:0.13, T:0.10 Consensus pattern (21 bp): AAAAATAAAAATGAAAAAAGG Found at i:21005689 original size:14 final size:15 Alignment explanation

Indices: 21005634--21005694 Score: 54 Period size: 14 Copynumber: 4.1 Consensus size: 15 21005624 GAAAAATAAA * * 21005634 AAAAATAAAAAAAGAG 1 AAAAATGAAAAAA-TG * * 21005650 AAAAACGGAAAAAT- 1 AAAAATGAAAAAATG 21005664 AAAAATGAAAAAATG 1 AAAAATGAAAAAATG * 21005679 AAAAA-GTAAAAATG 1 AAAAATGAAAAAATG 21005693 AA 1 AA 21005695 GAGGCTAAGG Statistics Matches: 37, Mismatches: 7, Indels: 4 0.77 0.15 0.08 Matches are distributed among these distances: 14 22 0.59 15 5 0.14 16 10 0.27 ACGTcount: A:0.75, C:0.02, G:0.13, T:0.10 Consensus pattern (15 bp): AAAAATGAAAAAATG Found at i:21007069 original size:44 final size:44 Alignment explanation

Indices: 21006987--21007362 Score: 210 Period size: 44 Copynumber: 8.1 Consensus size: 44 21006977 ACCTTCAAGT * * * * 21006987 TGGAGCAGATTGAAAGCCAGAAATCTTATCTCCTTGAGATTACAG 1 TGGAGTAGATTG-AAGCTAGAAATCCTATCTCCCTGAGATTACAG * * 21007032 CGGAGTAGATTGAAGCTAGTAATCCTATCTCCCTGAGATTACAG 1 TGGAGTAGATTGAAGCTAGAAATCCTATCTCCCTGAGATTACAG ** ** * * 21007076 TGGAGCGGATT-AAAATA-AAGGATCTTATCTCTCTGA-AGTTACAG 1 TGGAGTAGATTGAAGCTAGAA--ATCCTATCTCCCTGAGA-TTACAG * * 21007120 TAGAGTAGATCACATCAGATTCAAGCCAGAAAT-CTATCTCCCTGAGATTACAG 1 T-G-G-AG------T-AGATTGAAGCTAGAAATCCTATCTCCCTGAGATTACAG * * 21007173 CGGAGTAGATTGAAGCTAGTAATCCTATCTCCCTGAGATTACAG 1 TGGAGTAGATTGAAGCTAGAAATCCTATCTCCCTGAGATTACAG * ** * * 21007217 TGGA-TCGGATT-AAAATA-AAGGATCTTATCTCTCTGA-AGTTACAG 1 TGGAGT-AGATTGAAGCTAGAA--ATCCTATCTCCCTGAGA-TTACAG * * 21007261 TAGAGTAGATCGCATTAGATTGAAGCCAGAAATCTTATCTCCCTGAGATTACAG 1 T---G--GA--G---TAGATTGAAGCTAGAAATCCTATCTCCCTGAGATTACAG * * * 21007315 TGGAGCAGATTGAAGCTAGTAATCCTATCTCCCTGAGATTATAG 1 TGGAGTAGATTGAAGCTAGAAATCCTATCTCCCTGAGATTACAG 21007359 TGGA 1 TGGA 21007363 ATGGATTAAA Statistics Matches: 254, Mismatches: 42, Indels: 71 0.69 0.11 0.19 Matches are distributed among these distances: 42 2 0.01 43 25 0.10 44 142 0.56 45 11 0.04 46 1 0.00 47 4 0.02 49 4 0.02 50 2 0.01 51 2 0.01 52 1 0.00 53 16 0.06 54 32 0.13 55 8 0.03 56 4 0.02 ACGTcount: A:0.32, C:0.18, G:0.22, T:0.28 Consensus pattern (44 bp): TGGAGTAGATTGAAGCTAGAAATCCTATCTCCCTGAGATTACAG Found at i:21007267 original size:141 final size:142 Alignment explanation

Indices: 21007000--21007420 Score: 747 Period size: 141 Copynumber: 3.0 Consensus size: 142 21006990 AGCAGATTGA * 21007000 AAGCCAGAAATCTTATCTCCTTGAGATTACAGCGGAGTAGATTGAAGCTAGTAATCCTATCTCCC 1 AAGCCAGAAATCTTATCTCCCTGAGATTACAGCGGAGTAGATTGAAGCTAGTAATCCTATCTCCC * 21007065 TGAGATTACAGTGGAGCGGATTAAAATAAAGGATCTTATCTCTCTGAAGTTACAGTAGAGTAGAT 66 TGAGATTACAGTGGATCGGATTAAAATAAAGGATCTTATCTCTCTGAAGTTACAGTAGAGTAGAT * 21007130 CACATCAGATTC 131 CGCATCAGATTC 21007142 AAGCCAGAAATC-TATCTCCCTGAGATTACAGCGGAGTAGATTGAAGCTAGTAATCCTATCTCCC 1 AAGCCAGAAATCTTATCTCCCTGAGATTACAGCGGAGTAGATTGAAGCTAGTAATCCTATCTCCC 21007206 TGAGATTACAGTGGATCGGATTAAAATAAAGGATCTTATCTCTCTGAAGTTACAGTAGAGTAGAT 66 TGAGATTACAGTGGATCGGATTAAAATAAAGGATCTTATCTCTCTGAAGTTACAGTAGAGTAGAT * * 21007271 CGCATTAGATTG 131 CGCATCAGATTC * * 21007283 AAGCCAGAAATCTTATCTCCCTGAGATTACAGTGGAGCAGATTGAAGCTAGTAATCCTATCTCCC 1 AAGCCAGAAATCTTATCTCCCTGAGATTACAGCGGAGTAGATTGAAGCTAGTAATCCTATCTCCC * 21007348 TGAGATTATAGTGGAAT-GGATTAAAATAAAGGATCTTATCTCTCTGAAGTTACAGTAGAGTAGA 66 TGAGATTACAGTGG-ATCGGATTAAAATAAAGGATCTTATCTCTCTGAAGTTACAGTAGAGTAGA 21007412 TCGCATCAG 130 TCGCATCAG 21007421 GTCTTATCTC Statistics Matches: 268, Mismatches: 9, Indels: 4 0.95 0.03 0.01 Matches are distributed among these distances: 141 136 0.51 142 130 0.49 143 2 0.01 ACGTcount: A:0.33, C:0.18, G:0.21, T:0.28 Consensus pattern (142 bp): AAGCCAGAAATCTTATCTCCCTGAGATTACAGCGGAGTAGATTGAAGCTAGTAATCCTATCTCCC TGAGATTACAGTGGATCGGATTAAAATAAAGGATCTTATCTCTCTGAAGTTACAGTAGAGTAGAT CGCATCAGATTC Found at i:21007577 original size:50 final size:50 Alignment explanation

Indices: 21007496--21008148 Score: 267 Period size: 50 Copynumber: 13.2 Consensus size: 50 21007486 GCAGTGGAAC * * * * * 21007496 AGATTAAAGCCACAAC-GATGAATTTTGCCTCCCTGACATTGCAATTAAAA 1 AGATTAAAGCTACAGCAG-TGAATCTTACCTCCCTGACATTGCAGTTAAAA * ** * ** * 21007546 AGATTAAAGCTACAGCAGCGAATCTTACCTCCCTGGTAGTGCAGTGGAAC 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGACATTGCAGTTAAAA * * * * 21007596 AGATTAAAGCCACATCGGTGAATCTTGCCTCCCTGACATTGCAGTTAAAA 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGACATTGCAGTTAAAA * * *** * 21007646 AGATTAAAGCTACAGCAGTGAATCTTA-CTCCCCAGGCGA-TGCAGCGGAAT 1 AGATTAAAGCTACAGCAGTGAATCTTACCT-CCCTGAC-ATTGCAGTTAAAA * * * * * 21007696 AGATTAAAGCCACATCGGTGAATCTTGCCTCCCTAACATTGCAGTTAAAA 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGACATTGCAGTTAAAA * * * * * ** ** ** 21007746 AGATTAAAGCTACAACAGCGAATCTTACTTCTC-AAGCGGTGCAGTGGAGC 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGA-CATTGCAGTTAAAA * * * * * * * 21007796 AGATTAAAGCCACATCGGTGAATCTTGCTTCCCCG--A---CA-TT--AC 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGACATTGCAGTTAAAA * * * * * * * * ** * 21007838 AGATTAAAGCCACAGCGGCGAATCCTACTTCCTTGGCGA-TGCCGTGGAAC 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGAC-ATTGCAGTTAAAA * * * * * *** 21007888 AGATTAAAGCCACAACGGTGAATCTT-GCTTCCTCGGTGTTGCAGTTAAAA 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCT-GACATTGCAGTTAAAA * * * * ** ** * 21007938 AGATTAAAGCTACAACAGCGAATCTTACCTCCCAGACAGTATAGTGGAAC 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGACATTGCAGTTAAAA * * * * * * 21007988 AGATTAAAGCCACAACGGTGAATCTTGCCTCCATGAAATTGCAGTTAAAA 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGACATTGCAGTTAAAA * * * * * * * * * * 21008038 TGATTAAAGCTATAACGGCGAATCTTACATTCCCAGGCAGTGCAG-TAAAGC 1 AGATTAAAGCTACAGCAGTGAATCTTAC-CTCCCTGACATTGCAGTTAAA-A * * * * * 21008089 AGATTAAAGCCACAACGGTGAATCTTGCTTCCAC-GACATTGCAGTTAAAA 1 AGATTAAAGCTACAGCAGTGAATCTTACCTCC-CTGACATTGCAGTTAAAA 21008139 AGATTAAAGC 1 AGATTAAAGC 21008149 CACAATGGCG Statistics Matches: 445, Mismatches: 136, Indels: 44 0.71 0.22 0.07 Matches are distributed among these distances: 42 30 0.07 44 1 0.00 45 3 0.01 47 1 0.00 48 1 0.00 49 8 0.02 50 355 0.80 51 46 0.10 ACGTcount: A:0.33, C:0.23, G:0.20, T:0.24 Consensus pattern (50 bp): AGATTAAAGCTACAGCAGTGAATCTTACCTCCCTGACATTGCAGTTAAAA Found at i:21007644 original size:100 final size:100 Alignment explanation

Indices: 21007479--21008166 Score: 774 Period size: 100 Copynumber: 7.0 Consensus size: 100 21007469 CTTACCACTT * * * * 21007479 AGGCGGTGCAGTGGAACAGATTAAAGCCACAACGATGAATTTTGCCTCCCTGACATTGCAATTAA 1 AGGCAGTGCAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCCTGACATTGCAGTTAA * 21007544 AAAGATTAAAGCTACAGCAGCGAATCTTACCTCCC 66 AAAGATTAAAGCTACAACAGCGAATCTTACCTCCC * * * 21007579 TGGTAGTGCAGTGGAACAGATTAAAGCCACATCGGTGAATCTTGCCTCCCTGACATTGCAGTTAA 1 AGGCAGTGCAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCCTGACATTGCAGTTAA * * 21007644 AAAGATTAAAGCTACAGCAGTGAATCTTA-CTCCCC 66 AAAGATTAAAGCTACAACAGCGAATCTTACCT-CCC * * * * 21007679 AGGC-GATGCAGCGGAATAGATTAAAGCCACATCGGTGAATCTTGCCTCCCTAACATTGCAGTTA 1 AGGCAG-TGCAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCCTGACATTGCAGTTA * * 21007743 AAAAGATTAAAGCTACAACAGCGAATCTTACTTCTC 65 AAAAGATTAAAGCTACAACAGCGAATCTTACCTCCC * * * * * * 21007779 AAGCGGTGCAGTGGAGCAGATTAAAGCCACATCGGTGAATCTTGCTTCCCCG--A---CA-TT-- 1 AGGCAGTGCAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCCTGACATTGCAGTTAA * * * * * * * 21007836 ACAGATTAAAGCCACAGCGGCGAATCCTACTTCCT 66 AAAGATTAAAGCTACAACAGCGAATCTTACCTCCC * * * *** 21007871 TGGC-GATGCCGTGGAACAGATTAAAGCCACAACGGTGAATCTTG-CTTCCTCGGTGTTGCAGTT 1 AGGCAG-TGCAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCCT-GACATTGCAGTT 21007934 AAAAAGATTAAAGCTACAACAGCGAATCTTACCTCCC 64 AAAAAGATTAAAGCTACAACAGCGAATCTTACCTCCC * ** * * 21007971 AGACAGTATAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCATGAAATTGCAGTTAA 1 AGGCAGTGCAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCCTGACATTGCAGTTAA * * * * 21008036 AATGATTAAAGCTATAACGGCGAATCTTACATTCCC 66 AAAGATTAAAGCTACAACAGCGAATCTTAC-CTCCC * * 21008072 AGGCAGTGCAGT-AAAGCAGATTAAAGCCACAACGGTGAATCTTGCTTCCAC-GACATTGCAGTT 1 AGGCAGTGCAGTGGAA-CAGATTAAAGCCACAACGGTGAATCTTGCCTCC-CTGACATTGCAGTT * ** 21008135 AAAAAGATTAAAGCCACAATGGCGAATCTTAC 64 AAAAAGATTAAAGCTACAACAGCGAATCTTAC 21008167 TTTCCAAATG Statistics Matches: 497, Mismatches: 72, Indels: 37 0.82 0.12 0.06 Matches are distributed among these distances: 91 4 0.01 92 66 0.13 94 2 0.00 95 2 0.00 97 2 0.00 98 3 0.01 99 3 0.01 100 324 0.65 101 91 0.18 ACGTcount: A:0.33, C:0.23, G:0.21, T:0.23 Consensus pattern (100 bp): AGGCAGTGCAGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCCTGACATTGCAGTTAA AAAGATTAAAGCTACAACAGCGAATCTTACCTCCC Found at i:21007999 original size:292 final size:292 Alignment explanation

Indices: 21007546--21008137 Score: 802 Period size: 292 Copynumber: 2.0 Consensus size: 292 21007536 GCAATTAAAA * * * * 21007546 AGATTAAAGCTACAGCAGCGAATCTTACCTCCCTGGTAGTGCAGTGGAACAGATTAAAGCCACAT 1 AGATTAAAGCCACAGCAGCGAATCCTACCTCCCTGGGAGTGCAGTGGAACAGATTAAAGCCACAA * * 21007611 CGGTGAATCTTGCCTCCCTGACATTGCAGTTAAAAAGATTAAAGCTACAGCAGTGAATCTTACTC 66 CGGTGAATCTTGCCTCCCTGACATTGCAGTTAAAAAGATTAAAGCTACAACAGCGAATCTTACTC * * * * 21007676 CCCAGGCGATGCAGCGGAATAGATTAAAGCCACATCGGTGAATCTTGCCTCCCT-AACATTGCAG 131 CCCAGACGAT-CAGCGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCATGAA-ATTGCAG * * ** 21007740 TTAAAAAGATTAAAGCTACAACAGCGAATCTTAC-TTCTCAAGCGGTGCAGTGGAGCAGATTAAA 194 TTAAAAAGATTAAAGCTACAACAGCGAATCTTACATTCCCAAGCAGTGCAGTAAAGCAGATTAAA * * 21007804 GCCACATCGGTGAATCTTGCTTCCCCGACATTAC 259 GCCACAACGGTGAATCTTGCTTCCACGACATTAC * * * * 21007838 AGATTAAAGCCACAGCGGCGAATCCTACTTCCTTGGCGA-TGCCGTGGAACAGATTAAAGCCACA 1 AGATTAAAGCCACAGCAGCGAATCCTACCTCCCTGG-GAGTGCAGTGGAACAGATTAAAGCCACA * *** 21007902 ACGGTGAATCTTG-CTTCCTCGGTGTTGCAGTTAAAAAGATTAAAGCTACAACAGCGAATCTTAC 65 ACGGTGAATCTTGCCTCCCT-GACATTGCAGTTAAAAAGATTAAAGCTACAACAGCGAATCTTAC * 21007966 -CTCCCAGACAGTAT-AGTGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCATGAAATTG 129 TC-CCCAGAC-G-ATCAGCGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCATGAAATTG * * * * 21008029 CAGTTAAAATGATTAAAGCTATAACGGCGAATCTTACATTCCCAGGCAGTGCAGTAAAGCAGATT 191 CAGTTAAAAAGATTAAAGCTACAACAGCGAATCTTACATTCCCAAGCAGTGCAGTAAAGCAGATT * 21008094 AAAGCCACAACGGTGAATCTTGCTTCCACGACATTGC 256 AAAGCCACAACGGTGAATCTTGCTTCCACGACATTAC 21008131 AG-TTAAA 1 AGATTAAA 21008138 AAGATTAAAG Statistics Matches: 263, Mismatches: 30, Indels: 14 0.86 0.10 0.05 Matches are distributed among these distances: 291 6 0.02 292 193 0.73 293 62 0.24 294 2 0.01 ACGTcount: A:0.32, C:0.23, G:0.21, T:0.23 Consensus pattern (292 bp): AGATTAAAGCCACAGCAGCGAATCCTACCTCCCTGGGAGTGCAGTGGAACAGATTAAAGCCACAA CGGTGAATCTTGCCTCCCTGACATTGCAGTTAAAAAGATTAAAGCTACAACAGCGAATCTTACTC CCCAGACGATCAGCGGAACAGATTAAAGCCACAACGGTGAATCTTGCCTCCATGAAATTGCAGTT AAAAAGATTAAAGCTACAACAGCGAATCTTACATTCCCAAGCAGTGCAGTAAAGCAGATTAAAGC CACAACGGTGAATCTTGCTTCCACGACATTAC Found at i:21008115 original size:101 final size:100 Alignment explanation

Indices: 21007885--21008168 Score: 383 Period size: 101 Copynumber: 2.8 Consensus size: 100 21007875 GATGCCGTGG *** 21007885 AACAGATTAAAGCCACAACGGTGAATCTTGCTTCC-TCGGTGTTGCAGTTAAAAAGATTAAAGCT 1 AACAGATTAAAGCCACAACGGTGAATCTTGCTTCCAT-GAAATTGCAGTTAAAAAGATTAAAGCT * * * * 21007949 ACAACAGCGAATCTTACCTCCCAGACAGTATAGTGG 65 ACAACGGCGAATCTTACTTCCCAGACAGTACAGTGA * * 21007985 AACAGATTAAAGCCACAACGGTGAATCTTGCCTCCATGAAATTGCAGTTAAAATGATTAAAGCTA 1 AACAGATTAAAGCCACAACGGTGAATCTTGCTTCCATGAAATTGCAGTTAAAAAGATTAAAGCTA * * * 21008050 TAACGGCGAATCTTACATTCCCAGGCAGTGCAGT-A 66 CAACGGCGAATCTTAC-TTCCCAGACAGTACAGTGA * * * 21008085 AAGCAGATTAAAGCCACAACGGTGAATCTTGCTTCCACGACATTGCAGTTAAAAAGATTAAAGCC 1 AA-CAGATTAAAGCCACAACGGTGAATCTTGCTTCCATGAAATTGCAGTTAAAAAGATTAAAGCT * 21008150 ACAATGGCGAATCTTACTT 65 ACAACGGCGAATCTTACTT 21008169 TCCAAATGTT Statistics Matches: 162, Mismatches: 19, Indels: 6 0.87 0.10 0.03 Matches are distributed among these distances: 100 76 0.47 101 86 0.53 ACGTcount: A:0.36, C:0.21, G:0.19, T:0.24 Consensus pattern (100 bp): AACAGATTAAAGCCACAACGGTGAATCTTGCTTCCATGAAATTGCAGTTAAAAAGATTAAAGCTA CAACGGCGAATCTTACTTCCCAGACAGTACAGTGA Found at i:21009591 original size:5 final size:6 Alignment explanation

Indices: 21009540--21009596 Score: 60 Period size: 6 Copynumber: 9.2 Consensus size: 6 21009530 ACCAATTTAG * * * * 21009540 TTTATC TTTATT TTTCTT TTTATT TCTATT TGTAGTGT TTTATT TTTATT 1 TTTATT TTTATT TTTATT TTTATT TTTATT TTTA-T-T TTTATT TTTATT 21009590 TTTATT T 1 TTTATT T 21009597 GAATTCCAAT Statistics Matches: 43, Mismatches: 6, Indels: 4 0.81 0.11 0.08 Matches are distributed among these distances: 6 37 0.86 7 2 0.05 8 4 0.09 ACGTcount: A:0.14, C:0.05, G:0.05, T:0.75 Consensus pattern (6 bp): TTTATT Found at i:21009630 original size:3 final size:3 Alignment explanation

Indices: 21009622--21009687 Score: 105 Period size: 3 Copynumber: 22.0 Consensus size: 3 21009612 TTTTAATCCC *** 21009622 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT CCC ATT ATT ATT ATT ATT 1 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 21009670 ATT ATT ATT ATT ATT ATT 1 ATT ATT ATT ATT ATT ATT 21009688 TCTTCTTTAT Statistics Matches: 57, Mismatches: 6, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 3 57 1.00 ACGTcount: A:0.32, C:0.05, G:0.00, T:0.64 Consensus pattern (3 bp): ATT Found at i:21009803 original size:23 final size:22 Alignment explanation

Indices: 21009776--21009835 Score: 59 Period size: 23 Copynumber: 2.6 Consensus size: 22 21009766 TGACATTACC * 21009776 TTTAAACTTATTATTATTATAAT 1 TTTAAA-TTATTATTACTATAAT * * 21009799 TTTAAAATGTAATA-TACTATTAT 1 TTT-AAAT-TATTATTACTATAAT 21009822 TTTAAATTATTATT 1 TTTAAATTATTATT 21009836 CACATGTTTT Statistics Matches: 30, Mismatches: 4, Indels: 7 0.73 0.10 0.17 Matches are distributed among these distances: 21 4 0.13 22 5 0.17 23 14 0.47 24 7 0.23 ACGTcount: A:0.40, C:0.03, G:0.02, T:0.55 Consensus pattern (22 bp): TTTAAATTATTATTACTATAAT Found at i:21011097 original size:15 final size:15 Alignment explanation

Indices: 21011073--21011108 Score: 54 Period size: 15 Copynumber: 2.4 Consensus size: 15 21011063 GGTTTCGACA * 21011073 ATTATCATTAGAATT 1 ATTATTATTAGAATT * 21011088 ATTATTATTAGTATT 1 ATTATTATTAGAATT 21011103 ATTATT 1 ATTATT 21011109 TACATTAGTT Statistics Matches: 19, Mismatches: 2, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 15 19 1.00 ACGTcount: A:0.36, C:0.03, G:0.06, T:0.56 Consensus pattern (15 bp): ATTATTATTAGAATT Found at i:21011405 original size:12 final size:13 Alignment explanation

Indices: 21011365--21011406 Score: 68 Period size: 14 Copynumber: 3.2 Consensus size: 13 21011355 ATTTCAACTT 21011365 TTATATATTATTA 1 TTATATATTATTA 21011378 TGTATATATTATTA 1 T-TATATATTATTA 21011392 TT-TATATTATTA 1 TTATATATTATTA 21011404 TTA 1 TTA 21011407 AGTTTTTCTT Statistics Matches: 27, Mismatches: 0, Indels: 4 0.87 0.00 0.13 Matches are distributed among these distances: 12 12 0.44 13 2 0.07 14 13 0.48 ACGTcount: A:0.36, C:0.00, G:0.02, T:0.62 Consensus pattern (13 bp): TTATATATTATTA Found at i:21016855 original size:18 final size:18 Alignment explanation

Indices: 21016833--21016868 Score: 56 Period size: 17 Copynumber: 2.1 Consensus size: 18 21016823 TAAGTTATAT * 21016833 TAAAAATTTTAAT-TTAA 1 TAAAAACTTTAATGTTAA 21016850 TAAAAACTTTAATGTTAA 1 TAAAAACTTTAATGTTAA 21016868 T 1 T 21016869 GGTAATAAAA Statistics Matches: 17, Mismatches: 1, Indels: 1 0.89 0.05 0.05 Matches are distributed among these distances: 17 12 0.71 18 5 0.29 ACGTcount: A:0.50, C:0.03, G:0.03, T:0.44 Consensus pattern (18 bp): TAAAAACTTTAATGTTAA Found at i:21018436 original size:19 final size:19 Alignment explanation

Indices: 21018394--21018436 Score: 52 Period size: 19 Copynumber: 2.3 Consensus size: 19 21018384 TTTTTTATTT * 21018394 AATA-TTAATATAACTATA 1 AATATTTAATATAAATATA ** 21018412 TTTATTTAATATAAATATA 1 AATATTTAATATAAATATA 21018431 AATATT 1 AATATT 21018437 AAATTTATTG Statistics Matches: 19, Mismatches: 5, Indels: 1 0.76 0.20 0.04 Matches are distributed among these distances: 18 2 0.11 19 17 0.89 ACGTcount: A:0.51, C:0.02, G:0.00, T:0.47 Consensus pattern (19 bp): AATATTTAATATAAATATA Found at i:21025459 original size:20 final size:20 Alignment explanation

Indices: 21025434--21025474 Score: 82 Period size: 20 Copynumber: 2.0 Consensus size: 20 21025424 GTAAGCATGA 21025434 ACCTAAGTAAATTTGAAGCT 1 ACCTAAGTAAATTTGAAGCT 21025454 ACCTAAGTAAATTTGAAGCT 1 ACCTAAGTAAATTTGAAGCT 21025474 A 1 A 21025475 TCTTCTCCTG Statistics Matches: 21, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 20 21 1.00 ACGTcount: A:0.41, C:0.15, G:0.15, T:0.29 Consensus pattern (20 bp): ACCTAAGTAAATTTGAAGCT Found at i:21025941 original size:56 final size:56 Alignment explanation

Indices: 21025829--21025942 Score: 165 Period size: 56 Copynumber: 2.0 Consensus size: 56 21025819 TTACGAGCTC * ** * * 21025829 AATCTTTGGTTTTGAGGAGATTGTTAGCCTCTACACAAATGGTGAGAACTTCGAAG 1 AATCTTTGGTTTTGAGGAGATTATTAGCCTCTACACAAATGACGAGAACATAGAAG * * 21025885 AATCTTTGGTTTTGAGGATATTATTAGCCTCTACACAAATGACGAGGACATAGAAG 1 AATCTTTGGTTTTGAGGAGATTATTAGCCTCTACACAAATGACGAGAACATAGAAG 21025941 AA 1 AA 21025943 ATAAAGAAAG Statistics Matches: 51, Mismatches: 7, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 56 51 1.00 ACGTcount: A:0.32, C:0.14, G:0.23, T:0.31 Consensus pattern (56 bp): AATCTTTGGTTTTGAGGAGATTATTAGCCTCTACACAAATGACGAGAACATAGAAG Found at i:21031014 original size:7 final size:7 Alignment explanation

Indices: 21031002--21031036 Score: 61 Period size: 7 Copynumber: 5.0 Consensus size: 7 21030992 AAATTTGTGT 21031002 GGTTTAG 1 GGTTTAG 21031009 GGTTTAG 1 GGTTTAG 21031016 GGTTTAG 1 GGTTTAG * 21031023 GGTTCAG 1 GGTTTAG 21031030 GGTTTAG 1 GGTTTAG 21031037 AAAAAGAACA Statistics Matches: 26, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 7 26 1.00 ACGTcount: A:0.14, C:0.03, G:0.43, T:0.40 Consensus pattern (7 bp): GGTTTAG Found at i:21034649 original size:15 final size:15 Alignment explanation

Indices: 21034626--21034658 Score: 57 Period size: 15 Copynumber: 2.2 Consensus size: 15 21034616 TTTTTATGCC 21034626 ATTGATGTATTTATT 1 ATTGATGTATTTATT * 21034641 ATTGTTGTATTTATT 1 ATTGATGTATTTATT 21034656 ATT 1 ATT 21034659 ATAGCATTTA Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 15 17 1.00 ACGTcount: A:0.24, C:0.00, G:0.12, T:0.64 Consensus pattern (15 bp): ATTGATGTATTTATT Found at i:21040870 original size:26 final size:26 Alignment explanation

Indices: 21040841--21040893 Score: 106 Period size: 26 Copynumber: 2.0 Consensus size: 26 21040831 CTATGCTCAG 21040841 AACCAACTTTATTCTTATCCATTGTT 1 AACCAACTTTATTCTTATCCATTGTT 21040867 AACCAACTTTATTCTTATCCATTGTT 1 AACCAACTTTATTCTTATCCATTGTT 21040893 A 1 A 21040894 TGACCTTTTT Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 26 27 1.00 ACGTcount: A:0.28, C:0.23, G:0.04, T:0.45 Consensus pattern (26 bp): AACCAACTTTATTCTTATCCATTGTT Found at i:21041737 original size:44 final size:43 Alignment explanation

Indices: 21041684--21042443 Score: 470 Period size: 44 Copynumber: 17.5 Consensus size: 43 21041674 AGATGGCAAG 21041684 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCC-ATAA * * * 21041728 TCCTATCTCCCTGAAGTTGCAGTGGAGCGGATTAAAACC-TCAGA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCAT-A-A * * * * * 21041772 TCTTATCTCTCTGAAGTTGCAG-AGAGCAGA-T--CGCAACTAG 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCA-TAA * * 21041812 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGTAAG-CA-AG 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGA-TTA--AAGCCATAA * * * * * 21041856 TCTTATC-CTCTTGAAGTTGCAGTGGGGCAGACTGAAGATGGCA-AG 1 TCTTATCTC-CCTGAAGTTGCAGTGGAGCAGA-TTAA-A-GCCATAA * * 21041901 TCTTATCTCCCTGAAGCTGTAGTGGAGCAGATTAAAGCCGATAA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCC-ATAA * * * 21041945 TCCTATCTCCCTAAAGTTGCAGTGGAGCAGATTAAAACC-TCAGA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCAT-A-A * * * * *** * * 21041989 TCTTATCTCTCTGAAGTTGCAG-AGAACAAATCGCA-TC-TAG 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCATAA * * * * 21042029 TTTTATCTCTCTGAAGTTGCAGTGGAGCAGAATAAGTAAG-CA-AG 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAG-ATTA--AAGCCATAA * * * * 21042073 TCTTATC-CTCCTGAAGTTGCAGTGGGGCAGACTGAAGATGGCA-AG 1 TCTTATCTC-CCTGAAGTTGCAGTGGAGCAGA-TTAA-A-GCCATAA 21042118 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCC-ATAA * * * * 21042162 TCCTATCTCCCTGAAGTTGCAGTGGAGCGGATTAAAACCTTAGA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCATA-A * * *** * * 21042206 TCTTATCTCTCTGAAGTTGCAG-AGAGCAGATCGCA-TC-TAG 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCATAA * * * * 21042246 TTTTATCTCCCTGAAGTTGTAGTGGAGCAGACTAAGTAAG-CA-AG 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGA-TTA--AAGCCATAA * * * * * 21042290 TCTTATC-CTCCTGAAGTTGTAGTGGGGCAGACTGAAGATGGCA-AG 1 TCTTATCTC-CCTGAAGTTGCAGTGGAGCAGA-TTAA-A-GCCATAA * * * 21042335 TCGTATCTCCCTGAAGTTGCAGTGGAGCAAATTAAAGTCGATAA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAG-CCATAA * * * * * 21042379 TCCTATCTCCCTAAAGTTGTAGTGGAGCGGATTAAAGCCTTAGA 1 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCATA-A * 21042423 TCTTATCTCTCTGAAGTTGCA 1 TCTTATCTCCCTGAAGTTGCA 21042444 AAGAGTAGAT Statistics Matches: 568, Mismatches: 99, Indels: 98 0.74 0.13 0.13 Matches are distributed among these distances: 40 62 0.11 41 22 0.04 42 18 0.03 43 41 0.07 44 330 0.58 45 90 0.16 46 4 0.01 47 1 0.00 ACGTcount: A:0.28, C:0.20, G:0.24, T:0.28 Consensus pattern (43 bp): TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCATAA Found at i:21041925 original size:89 final size:86 Alignment explanation

Indices: 21041810--21042363 Score: 404 Period size: 89 Copynumber: 6.4 Consensus size: 86 21041800 GATCGCAACT * 21041810 AGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCTTGAAGTTG 1 AGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAG-AAGCAAGTCTTAT-CTCCTGAAGTTG * 21041875 CAGTGGGGCAGACTGAAGATGGCA 64 CAGTGGAGCAGA-TGAAGATGGCA * * * * * * 21041899 AGTCTTATCTCCCTGAAGCTGTAGTGGAGCAGA-TTA-AAGCCGATAATCCTATCTCCCTAAAGT 1 AGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGAAG-C-A-AGTCTTATCT-CCTGAAGT * * *** 21041962 TGCAGTGGAGCAGATTAAAACCTC- 62 TGCAGTGGAGCAGATGAAGATGGCA * * * * * * * * * 21041986 AGATCTTATCTCTCTGAAGTTGCAG-AGAACA-AAT-CGCATCTAGTTTTATCTCTCTGAAGTTG 1 AG-TCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGAAGCAAGTCTTATCTC-CTGAAGTTG * 21042048 CAGTGGAGCAGAAT-AAG-TAAGCA 64 CAGTGGAGCAG-ATGAAGAT-GGCA * * 21042071 AGTCTTATC-CTCCTGAAGTTGCAGTGGGGCAGACTGAAGATGGCAAGTCTTATCTCCCTGAAGT 1 AGTCTTATCTC-CCTGAAGTTGCAGTGGAGCAGACT-AAGA-AGCAAGTCTTATCT-CCTGAAGT * * 21042135 TGCAGTGGAGCAGATTAA-A-GCCGA 62 TGCAGTGGAGCAGATGAAGATGGC-A * * * * * * 21042159 TAATCCTATCTCCCTGAAGTTGCAGTGGAGCGGATTAA-AACCTTAGATCTTATCTCTCTGAAGT 1 -AGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGAAGC-AAG-TCTTATCTC-CTGAAGT * * * 21042223 TGCAG-AGAGCAGATCG--CAT--CT 62 TGCAGTGGAGCAGAT-GAAGATGGCA * * 21042244 AGTTTTATCTCCCTGAAGTTGTAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCCTGAAGTTG 1 AGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAG-AAGCAAGTCTTAT-CTCCTGAAGTTG * * 21042309 TAGTGGGGCAGACTGAAGATGGCA 64 CAGTGGAGCAGA-TGAAGATGGCA * 21042333 AGTCGTATCTCCCTGAAGTTGCAGTGGAGCA 1 AGTCTTATCTCCCTGAAGTTGCAGTGGAGCA 21042364 AATTAAAGTC Statistics Matches: 357, Mismatches: 70, Indels: 76 0.71 0.14 0.15 Matches are distributed among these distances: 83 2 0.01 84 96 0.27 85 19 0.05 86 13 0.04 87 25 0.07 88 55 0.15 89 145 0.41 90 2 0.01 ACGTcount: A:0.28, C:0.20, G:0.25, T:0.27 Consensus pattern (86 bp): AGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGAAGCAAGTCTTATCTCCTGAAGTTGCA GTGGAGCAGATGAAGATGGCA Found at i:21042046 original size:173 final size:172 Alignment explanation

Indices: 21041858--21042183 Score: 401 Period size: 173 Copynumber: 1.9 Consensus size: 172 21041848 TAAGCAAGTC * * * * 21041858 TTATCCTCTTGAAGTTGCAGTGGGGCAGACTGAAG-ATGGCAAGTCTTAT-CTCCCTGAAGCTGT 1 TTATCCTCTTGAAGTTGCAGTGGAGCAGAAT-AAGTA-AGCAAGTCTTATCCT-CCTGAAGCTGC * * * 21041921 AGTGGAGCAGATTAAAGCCGATAA-TCCTATCTCCCTAAAGTTGCAGTGGAGCAGATTAAAACC- 63 AGTGGAGCAGACTAAAGACGACAAGTCCTATCTCCCTAAAGTTGCAGTGGAGCAGATTAAAACCA * * 21041984 TCAGATCTTATCTCTCTGAAGTTGCAGAGAACAAATCGCATCTAGTT 128 T-A-ATCCTATCTCCCTGAAGTTGCAGAGAACAAATCGCATCTAGTT * 21042031 TTAT-CTCTCTGAAGTTGCAGTGGAGCAGAATAAGTAAGCAAGTCTTATCCTCCTGAAGTTGCAG 1 TTATCCTCT-TGAAGTTGCAGTGGAGCAGAATAAGTAAGCAAGTCTTATCCTCCTGAAGCTGCAG * * * * * * * 21042095 TGGGGCAGACTGAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGAT 65 TGGAGCAGACTAAAGACGACAAGTCCTATCTCCCTAAAGTTGCAGTGGAGCAGATTAAAACC-AT 21042160 AATCCTATCTCCCTGAAGTTGCAG 129 AATCCTATCTCCCTGAAGTTGCAG 21042184 TGGAGCGGAT Statistics Matches: 130, Mismatches: 17, Indels: 12 0.82 0.11 0.08 Matches are distributed among these distances: 172 44 0.34 173 84 0.65 174 1 0.01 175 1 0.01 ACGTcount: A:0.28, C:0.21, G:0.24, T:0.27 Consensus pattern (172 bp): TTATCCTCTTGAAGTTGCAGTGGAGCAGAATAAGTAAGCAAGTCTTATCCTCCTGAAGCTGCAGT GGAGCAGACTAAAGACGACAAGTCCTATCTCCCTAAAGTTGCAGTGGAGCAGATTAAAACCATAA TCCTATCTCCCTGAAGTTGCAGAGAACAAATCGCATCTAGTT Found at i:21042047 original size:217 final size:217 Alignment explanation

Indices: 21041672--21042497 Score: 1409 Period size: 217 Copynumber: 3.8 Consensus size: 217 21041662 GAAGCAGATC 21041672 GAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAATCCTATCTC 1 GAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAATCCTATCTC * 21041737 CCTGAAGTTGCAGTGGAGCGGATTAAAACCTCAGATCTTATCTCTCTGAAGTTGCAGAGAGCAGA 66 CCTAAAGTTGCAGTGGAGCGGATTAAAACCTCAGATCTTATCTCTCTGAAGTTGCAGAGAGCAGA * * 21041802 TCGCAACTAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCT 131 TCGCATCTAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCC 21041867 TGAAGTTGCAGTGGGGCAGACT 196 TGAAGTTGCAGTGGGGCAGACT * * 21041889 GAAGATGGCAAGTCTTATCTCCCTGAAGCTGTAGTGGAGCAGATTAAAGCCGATAATCCTATCTC 1 GAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAATCCTATCTC * * * 21041954 CCTAAAGTTGCAGTGGAGCAGATTAAAACCTCAGATCTTATCTCTCTGAAGTTGCAGAGAACAAA 66 CCTAAAGTTGCAGTGGAGCGGATTAAAACCTCAGATCTTATCTCTCTGAAGTTGCAGAGAGCAGA * * * 21042019 TCGCATCTAGTTTTATCTCTCTGAAGTTGCAGTGGAGCAGAATAAGTAAGCAAGTCTTATCCTCC 131 TCGCATCTAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCC 21042084 TGAAGTTGCAGTGGGGCAGACT 196 TGAAGTTGCAGTGGGGCAGACT 21042106 GAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAATCCTATCTC 1 GAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAATCCTATCTC * * 21042171 CCTGAAGTTGCAGTGGAGCGGATTAAAACCTTAGATCTTATCTCTCTGAAGTTGCAGAGAGCAGA 66 CCTAAAGTTGCAGTGGAGCGGATTAAAACCTCAGATCTTATCTCTCTGAAGTTGCAGAGAGCAGA * * 21042236 TCGCATCTAGTTTTATCTCCCTGAAGTTGTAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCC 131 TCGCATCTAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCC * 21042301 TGAAGTTGTAGTGGGGCAGACT 196 TGAAGTTGCAGTGGGGCAGACT * * * 21042323 GAAGATGGCAAGTCGTATCTCCCTGAAGTTGCAGTGGAGCAAATTAAAGTCGATAATCCTATCTC 1 GAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAATCCTATCTC * * * * * 21042388 CCTAAAGTTGTAGTGGAGCGGATTAAAGCCTTAGATCTTATCTCTCTGAAGTTGCAAAGAGTAGA 66 CCTAAAGTTGCAGTGGAGCGGATTAAAACCTCAGATCTTATCTCTCTGAAGTTGCAGAGAGCAGA * * * 21042453 TCGCATTTAGTCTTATCTCCTTGAAGTTGCAGCGGAGCAGACTAA 131 TCGCATCTAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAA 21042498 AATAGCAAAT Statistics Matches: 574, Mismatches: 35, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 217 574 1.00 ACGTcount: A:0.28, C:0.20, G:0.24, T:0.27 Consensus pattern (217 bp): GAAGATGGCAAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATTAAAGCCGATAATCCTATCTC CCTAAAGTTGCAGTGGAGCGGATTAAAACCTCAGATCTTATCTCTCTGAAGTTGCAGAGAGCAGA TCGCATCTAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACTAAGTAAGCAAGTCTTATCCTCC TGAAGTTGCAGTGGGGCAGACT Found at i:21044444 original size:15 final size:15 Alignment explanation

Indices: 21044421--21044453 Score: 57 Period size: 15 Copynumber: 2.2 Consensus size: 15 21044411 TTTTTATGCC 21044421 ATTGATGTATTTATT 1 ATTGATGTATTTATT * 21044436 ATTGTTGTATTTATT 1 ATTGATGTATTTATT 21044451 ATT 1 ATT 21044454 ATAGCATTTG Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 15 17 1.00 ACGTcount: A:0.24, C:0.00, G:0.12, T:0.64 Consensus pattern (15 bp): ATTGATGTATTTATT Found at i:21050678 original size:19 final size:18 Alignment explanation

Indices: 21050609--21050674 Score: 55 Period size: 19 Copynumber: 3.5 Consensus size: 18 21050599 TTTATTTTAT 21050609 TATA-TTAATATTAAAATTA 1 TATATTTAATA-T-AAATTA * * 21050628 TACATTTTAATAGAATATTA 1 TATA-TTTAATATAA-ATTA 21050648 -ATATATTAATATAAATTA 1 TATAT-TTAATATAAATTA 21050666 TATATTTAA 1 TATATTTAA 21050675 ATATTTTGTT Statistics Matches: 38, Mismatches: 4, Indels: 11 0.72 0.08 0.21 Matches are distributed among these distances: 18 9 0.24 19 19 0.50 20 4 0.11 21 6 0.16 ACGTcount: A:0.50, C:0.02, G:0.02, T:0.47 Consensus pattern (18 bp): TATATTTAATATAAATTA Found at i:21050693 original size:33 final size:33 Alignment explanation

Indices: 21050651--21050714 Score: 85 Period size: 33 Copynumber: 1.9 Consensus size: 33 21050641 AATATTAATA 21050651 TATTAATATAAATTATA-TATTTAAATATTTTGT 1 TATTAATATAAA-TATAGTATTTAAATATTTTGT * * * 21050684 TATTTATATAAATATAGTTTTTAATTATTTT 1 TATTAATATAAATATAGTATTTAAATATTTT 21050715 AGTAAATATA Statistics Matches: 27, Mismatches: 3, Indels: 2 0.84 0.09 0.06 Matches are distributed among these distances: 32 4 0.15 33 23 0.85 ACGTcount: A:0.39, C:0.00, G:0.03, T:0.58 Consensus pattern (33 bp): TATTAATATAAATATAGTATTTAAATATTTTGT Found at i:21050977 original size:13 final size:13 Alignment explanation

Indices: 21050959--21050983 Score: 50 Period size: 13 Copynumber: 1.9 Consensus size: 13 21050949 GAAGAGTAAT 21050959 AAAAACAAAAAAC 1 AAAAACAAAAAAC 21050972 AAAAACAAAAAA 1 AAAAACAAAAAA 21050984 AAGGAATACA Statistics Matches: 12, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 12 1.00 ACGTcount: A:0.88, C:0.12, G:0.00, T:0.00 Consensus pattern (13 bp): AAAAACAAAAAAC Found at i:21051477 original size:110 final size:110 Alignment explanation

Indices: 21051284--21051501 Score: 400 Period size: 110 Copynumber: 2.0 Consensus size: 110 21051274 ATTGAGTCAA ** 21051284 TTAGTTCGCTAAAGCGTATTATCCTCCTATTTATGGGTTGGGGAGGAACTATGAGTAATTCTAGA 1 TTAGTTCGCTAAAGCACATTATCCTCCTATTTATGGGTTGGGGAGGAACTATGAGTAATTCTAGA * 21051349 CATTGTGTCAAAAAGAGCAGATATGATCAAAACATATAATGAGTC 66 CATTGTGTCAAAAAGAGCAGATATAATCAAAACATATAATGAGTC * 21051394 TTAGTTCGCTAAAGCACATTATCCTCCTATTTATGGGTTGGGGAGGAACTATGAGTAGTTCTAGA 1 TTAGTTCGCTAAAGCACATTATCCTCCTATTTATGGGTTGGGGAGGAACTATGAGTAATTCTAGA 21051459 CATTGTGTCAAAAAGAGCAGATATAATCAAAACATATAATGAG 66 CATTGTGTCAAAAAGAGCAGATATAATCAAAACATATAATGAG 21051502 ATTATTAAAA Statistics Matches: 104, Mismatches: 4, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 110 104 1.00 ACGTcount: A:0.34, C:0.14, G:0.22, T:0.30 Consensus pattern (110 bp): TTAGTTCGCTAAAGCACATTATCCTCCTATTTATGGGTTGGGGAGGAACTATGAGTAATTCTAGA CATTGTGTCAAAAAGAGCAGATATAATCAAAACATATAATGAGTC Found at i:21057529 original size:20 final size:20 Alignment explanation

Indices: 21057501--21057556 Score: 60 Period size: 20 Copynumber: 2.8 Consensus size: 20 21057491 CGCATAGTTA * * 21057501 CATAACATATCATGT-GATGT 1 CATATCATATCATGTCGA-AT * 21057521 CATATCATATCATATCGAAT 1 CATATCATATCATGTCGAAT * 21057541 CATATCATAGCATGTC 1 CATATCATATCATGTC 21057557 CTACTCCCGT Statistics Matches: 30, Mismatches: 5, Indels: 2 0.81 0.14 0.05 Matches are distributed among these distances: 20 28 0.93 21 2 0.07 ACGTcount: A:0.36, C:0.20, G:0.11, T:0.34 Consensus pattern (20 bp): CATATCATATCATGTCGAAT Found at i:21061334 original size:17 final size:17 Alignment explanation

Indices: 21061312--21061346 Score: 70 Period size: 17 Copynumber: 2.1 Consensus size: 17 21061302 CAAGTGGAAG 21061312 TATGATTAATTAATTGT 1 TATGATTAATTAATTGT 21061329 TATGATTAATTAATTGT 1 TATGATTAATTAATTGT 21061346 T 1 T 21061347 TTGTAGGGGT Statistics Matches: 18, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 17 18 1.00 ACGTcount: A:0.34, C:0.00, G:0.11, T:0.54 Consensus pattern (17 bp): TATGATTAATTAATTGT Found at i:21066343 original size:22 final size:22 Alignment explanation

Indices: 21066298--21066344 Score: 60 Period size: 22 Copynumber: 2.1 Consensus size: 22 21066288 ACTGGTTGCT * 21066298 ATGTCGCAACACGGAATGCCAA 1 ATGTCGCAACACGGAATGCAAA * 21066320 ATGTCGCAACATTGGAA-GCAAA 1 ATGTCGCAACA-CGGAATGCAAA 21066342 ATG 1 ATG 21066345 GAGAAATCAC Statistics Matches: 22, Mismatches: 2, Indels: 2 0.85 0.08 0.08 Matches are distributed among these distances: 22 18 0.82 23 4 0.18 ACGTcount: A:0.38, C:0.21, G:0.23, T:0.17 Consensus pattern (22 bp): ATGTCGCAACACGGAATGCAAA Found at i:21071915 original size:15 final size:15 Alignment explanation

Indices: 21071895--21071925 Score: 62 Period size: 15 Copynumber: 2.1 Consensus size: 15 21071885 CTTCAGGTTC 21071895 CTTTTCAGTTAATTT 1 CTTTTCAGTTAATTT 21071910 CTTTTCAGTTAATTT 1 CTTTTCAGTTAATTT 21071925 C 1 C 21071926 CTTTAATGTT Statistics Matches: 16, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 15 16 1.00 ACGTcount: A:0.19, C:0.16, G:0.06, T:0.58 Consensus pattern (15 bp): CTTTTCAGTTAATTT Found at i:21072569 original size:22 final size:22 Alignment explanation

Indices: 21072544--21072585 Score: 59 Period size: 22 Copynumber: 1.9 Consensus size: 22 21072534 TGTTTTATCA 21072544 TAATATAAAT-TATTTACATTAC 1 TAAT-TAAATATATTTACATTAC * 21072566 TAATTAATTATATTTACATT 1 TAATTAAATATATTTACATT 21072586 TAGATTAACT Statistics Matches: 18, Mismatches: 1, Indels: 2 0.86 0.05 0.10 Matches are distributed among these distances: 21 4 0.22 22 14 0.78 ACGTcount: A:0.43, C:0.07, G:0.00, T:0.50 Consensus pattern (22 bp): TAATTAAATATATTTACATTAC Found at i:21075158 original size:36 final size:36 Alignment explanation

Indices: 21075117--21075195 Score: 97 Period size: 36 Copynumber: 2.2 Consensus size: 36 21075107 TCATATGATT * 21075117 TACATA-CAACTCGCATAGTTCCTAGGGTCTGCATGA 1 TACATATCAA-TCGCATAGTTCCTAGGATCTGCATGA * * * 21075153 TACATATGAATCGCATAGTTTCTAGGATTTGCATGA 1 TACATATCAATCGCATAGTTCCTAGGATCTGCATGA * 21075189 TGCATAT 1 TACATAT 21075196 GACTCGTGTA Statistics Matches: 37, Mismatches: 5, Indels: 2 0.84 0.11 0.05 Matches are distributed among these distances: 36 35 0.95 37 2 0.05 ACGTcount: A:0.29, C:0.19, G:0.19, T:0.33 Consensus pattern (36 bp): TACATATCAATCGCATAGTTCCTAGGATCTGCATGA Found at i:21075196 original size:36 final size:36 Alignment explanation

Indices: 21075127--21075197 Score: 106 Period size: 36 Copynumber: 2.0 Consensus size: 36 21075117 TACATACAAC * 21075127 TCGCATAGTTCCTAGGGTCTGCATGATACATATGAA 1 TCGCATAGTTCCTAGGATCTGCATGATACATATGAA * * * 21075163 TCGCATAGTTTCTAGGATTTGCATGATGCATATGA 1 TCGCATAGTTCCTAGGATCTGCATGATACATATGA 21075198 CTCGTGTATA Statistics Matches: 31, Mismatches: 4, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 36 31 1.00 ACGTcount: A:0.27, C:0.17, G:0.23, T:0.34 Consensus pattern (36 bp): TCGCATAGTTCCTAGGATCTGCATGATACATATGAA Found at i:21075280 original size:25 final size:25 Alignment explanation

Indices: 21075242--21075297 Score: 103 Period size: 25 Copynumber: 2.2 Consensus size: 25 21075232 TGGTGGTTCA * 21075242 CATGTAATACTCAACAAGTGACTCG 1 CATGTAATAATCAACAAGTGACTCG 21075267 CATGTAATAATCAACAAGTGACTCG 1 CATGTAATAATCAACAAGTGACTCG 21075292 CATGTA 1 CATGTA 21075298 CAACAAATTA Statistics Matches: 30, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 25 30 1.00 ACGTcount: A:0.38, C:0.21, G:0.16, T:0.25 Consensus pattern (25 bp): CATGTAATAATCAACAAGTGACTCG Found at i:21082883 original size:14 final size:15 Alignment explanation

Indices: 21082859--21082891 Score: 50 Period size: 14 Copynumber: 2.3 Consensus size: 15 21082849 TCCCAAGCGA 21082859 GGAAGGACCTTATGT 1 GGAAGGACCTTATGT * 21082874 GGAA-GACCTTATTT 1 GGAAGGACCTTATGT 21082888 GGAA 1 GGAA 21082892 AAACGTCGAA Statistics Matches: 17, Mismatches: 1, Indels: 1 0.89 0.05 0.05 Matches are distributed among these distances: 14 13 0.76 15 4 0.24 ACGTcount: A:0.30, C:0.12, G:0.30, T:0.27 Consensus pattern (15 bp): GGAAGGACCTTATGT Found at i:21084250 original size:44 final size:44 Alignment explanation

Indices: 21084169--21084570 Score: 321 Period size: 44 Copynumber: 9.1 Consensus size: 44 21084159 GCAGATCAGT * * * * 21084169 GAAGATAGCAGATCCTGTCTTTCTATATTGGTAGCGAAGTGGATC 1 GAAGAT-GCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * * * 21084214 GAAGATGCAGATCTTATCTTCCCATACTGGTGGCGAAGTAGATC 1 GAAGATGCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * *** * * * 21084258 GAAGAAAGCAGATCTTGTCTTCATGTATTAGCA-TGAAGTAGATC 1 GAAG-ATGCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * * * * 21084302 GAAGATAGCGGGTCTTGTCTTCCTATATTGGTAG-GAAGTGGATC 1 GAAGAT-GCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * * * 21084346 GAAGATGCAGATCTTGTCTTTCCATACTGGTGGCGAAGTAGATC 1 GAAGATGCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * **** * * 21084390 GAAGAAAT-CAGATCTTATCTTTATGTATTGG-CGTGAAGTAGATC 1 GAAG--ATGCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * * * * * 21084434 GAAGATAGTAGGTCATGTCTTCCTATATTGGTAG-GAAGTGGATC 1 GAAGAT-GCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * * * 21084478 AAAGATGCAGATCTTGTCTTCCCATATTGGTGGTGAAGTAGATC 1 GAAGATGCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC * * *** * 21084522 GAAGAAAGCAGAACTTGTCTTCATGTATTGG-AGTGAAGTAGATC 1 GAAG-ATGCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC 21084566 GAAGA 1 GAAGA 21084571 CACCAGCCTT Statistics Matches: 284, Mismatches: 62, Indels: 24 0.77 0.17 0.06 Matches are distributed among these distances: 42 2 0.01 43 45 0.16 44 170 0.60 45 65 0.23 46 2 0.01 ACGTcount: A:0.29, C:0.14, G:0.27, T:0.30 Consensus pattern (44 bp): GAAGATGCAGATCTTGTCTTCCCATATTGGTAGCGAAGTAGATC Found at i:21084416 original size:132 final size:131 Alignment explanation

Indices: 21084169--21084570 Score: 615 Period size: 132 Copynumber: 3.0 Consensus size: 131 21084159 GCAGATCAGT * * * * 21084169 GAAGATAGCAGATCCTGTCTTTCTATATTGGTAGCGAAGTGGATCGAAGATGCAGATCTTATCTT 1 GAAGATAGCAGGTCATGTCTTCCTATATTGGTAG-GAAGTGGATCGAAGATGCAGATCTTGTCTT 21084234 CCCATACTGGTGGCGAAGTAGATCGAAGAAAGCAGATCTTGTCTTCATGTATTAGCATGAAGTAG 65 CCCATACTGGTGGCGAAGTAGATCGAAGAAAGCAGATCTTGTCTTCATGTATT-GCATGAAGTAG 21084299 ATC 129 ATC * * * 21084302 GAAGATAGCGGGTCTTGTCTTCCTATATTGGTAGGAAGTGGATCGAAGATGCAGATCTTGTCTTT 1 GAAGATAGCAGGTCATGTCTTCCTATATTGGTAGGAAGTGGATCGAAGATGCAGATCTTGTCTTC * * * * 21084367 CCATACTGGTGGCGAAGTAGATCGAAGAAATCAGATCTTATCTTTATGTATTGGCGTGAAGTAGA 66 CCATACTGGTGGCGAAGTAGATCGAAGAAAGCAGATCTTGTCTTCATGTATT-GCATGAAGTAGA 21084432 TC 130 TC * * 21084434 GAAGATAGTAGGTCATGTCTTCCTATATTGGTAGGAAGTGGATCAAAGATGCAGATCTTGTCTTC 1 GAAGATAGCAGGTCATGTCTTCCTATATTGGTAGGAAGTGGATCGAAGATGCAGATCTTGTCTTC * * * * 21084499 CCATATTGGTGGTGAAGTAGATCGAAGAAAGCAGAACTTGTCTTCATGTATTGGAGTGAAGTAGA 66 CCATACTGGTGGCGAAGTAGATCGAAGAAAGCAGATCTTGTCTTCATGTATTGCA-TGAAGTAGA 21084564 TC 130 TC 21084566 GAAGA 1 GAAGA 21084571 CACCAGCCTT Statistics Matches: 244, Mismatches: 24, Indels: 3 0.90 0.09 0.01 Matches are distributed among these distances: 131 1 0.00 132 213 0.87 133 30 0.12 ACGTcount: A:0.29, C:0.14, G:0.27, T:0.30 Consensus pattern (131 bp): GAAGATAGCAGGTCATGTCTTCCTATATTGGTAGGAAGTGGATCGAAGATGCAGATCTTGTCTTC CCATACTGGTGGCGAAGTAGATCGAAGAAAGCAGATCTTGTCTTCATGTATTGCATGAAGTAGAT C Found at i:21084740 original size:88 final size:88 Alignment explanation

Indices: 21084556--21085007 Score: 629 Period size: 88 Copynumber: 5.1 Consensus size: 88 21084546 GTATTGGAGT * * * * * * * 21084556 GAAGTAGATCGAAGACACCAGCCTTGTCTTCTTGGGTTGCAGCGGAGCAGGCT-AAAATAGCAAA 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGAGTTGTAGCGGAGCAGGCTAAAAATAGCAGA * * 21084620 TCTTGCCTTCTTGCACCGATAGC 66 TCTTGCCTTCCTGCACCGACAGC * * * * 21084643 GAAGTAGATCAAAGACACCAGCTTTGCCTCCCTGAGTTGTAGCGGAGCAGGCTAAAAATAGAAGA 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGAGTTGTAGCGGAGCAGGCTAAAAATAGCAGA * 21084708 TATTGCCTTCCTGCACCGACAGC 66 TCTTGCCTTCCTGCACCGACAGC * * 21084731 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGGGTTGTAGCGGAGCAGGCTATAAATAGCAGA 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGAGTTGTAGCGGAGCAGGCTAAAAATAGCAGA 21084796 TCTTGCCTTCCTGCACCGACAGC 66 TCTTGCCTTCCTGCACCGACAGC * ** * * * 21084819 GAAACAGATCGAAGACACCAGCCTTGCCTCCCT-AGGTTACAGTGGAGCAGGTTAAAAAATAGCG 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGA-GTTGTAGCGGAGCAGGCT-AAAAATAGCA * 21084883 GATCTTGCCTTCTTGCACCGACAGC 64 GATCTTGCCTTCCTGCACCGACAGC * * 21084908 GAAGCAGATCGAAGACACCAGCCTTGCCTCTCTGAGTTGTAGCGGAGCAGGTTAAAAATAGCAGA 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGAGTTGTAGCGGAGCAGGCTAAAAATAGCAGA * * 21084973 TCTTGCTTTCCTGCACCGTCAGC 66 TCTTGCCTTCCTGCACCGACAGC 21084996 GAAGCAGATCGA 1 GAAGCAGATCGA 21085008 TAACCCCAAC Statistics Matches: 324, Mismatches: 37, Indels: 7 0.88 0.10 0.02 Matches are distributed among these distances: 87 46 0.14 88 199 0.61 89 78 0.24 90 1 0.00 ACGTcount: A:0.28, C:0.26, G:0.25, T:0.21 Consensus pattern (88 bp): GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGAGTTGTAGCGGAGCAGGCTAAAAATAGCAGA TCTTGCCTTCCTGCACCGACAGC Found at i:21084935 original size:177 final size:175 Alignment explanation

Indices: 21084556--21085007 Score: 636 Period size: 177 Copynumber: 2.6 Consensus size: 175 21084546 GTATTGGAGT * * * * * * 21084556 GAAGTAGATCGAAGACACCAGCCTTGTCTTCTTGGGTTGCAGCGGAGCAGGCTAAAATAGCAAAT 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGGGTTGTAGCGGAGCAGGCTAAAATAGCAGAT * * * * * ** 21084621 CTTGCCTTCTTGCACCGATAGCGAAGTAGATCAAAGACACCAGCTTTGCCTCCCTGAGTTGTAGC 66 CTTGCCTTCCTGCACCGACAGCGAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGAGTTACAGC 21084686 GGAGCAGGCTAAAAATAGAAGATATTGCCTTCCTGCACCGACAGC 131 GGAGCAGGCTAAAAATAGAAGATATTGCCTTCCTGCACCGACAGC 21084731 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGGGTTGTAGCGGAGCAGGCTATAAATAGCAGA 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGGGTTGTAGCGGAGCAGGCTA-AAATAGCAGA * 21084796 TCTTGCCTTCCTGCACCGACAGCGAAACAGATCGAAGACACCAGCCTTGCCTCCCT-AGGTTACA 65 TCTTGCCTTCCTGCACCGACAGCGAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGA-GTTACA * * ** * * 21084860 GTGGAGCAGGTTAAAAAATAGCGGATCTTGCCTTCTTGCACCGACAGC 129 GCGGAGCAGGCT-AAAAATAGAAGATATTGCCTTCCTGCACCGACAGC * * * 21084908 GAAGCAGATCGAAGACACCAGCCTTGCCTCTCTGAGTTGTAGCGGAGCAGGTTAAAAATAGCAGA 1 GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGGGTTGTAGCGGAGCAGGCT-AAAATAGCAGA * * 21084973 TCTTGCTTTCCTGCACCGTCAGCGAAGCAGATCGA 65 TCTTGCCTTCCTGCACCGACAGCGAAGCAGATCGA 21085008 TAACCCCAAC Statistics Matches: 247, Mismatches: 26, Indels: 6 0.89 0.09 0.02 Matches are distributed among these distances: 175 50 0.20 176 73 0.30 177 123 0.50 178 1 0.00 ACGTcount: A:0.28, C:0.26, G:0.25, T:0.21 Consensus pattern (175 bp): GAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGGGTTGTAGCGGAGCAGGCTAAAATAGCAGAT CTTGCCTTCCTGCACCGACAGCGAAGCAGATCGAAGACACCAGCCTTGCCTCCCTGAGTTACAGC GGAGCAGGCTAAAAATAGAAGATATTGCCTTCCTGCACCGACAGC Found at i:21085068 original size:44 final size:44 Alignment explanation

Indices: 21085019--21085183 Score: 201 Period size: 44 Copynumber: 3.8 Consensus size: 44 21085009 AACCCCAACC 21085019 CTATCTCCCTGGTCAGCAGTGGAATAGGTTGAAGATTGAGAATT 1 CTATCTCCCTGGTCAGCAGTGGAATAGGTTGAAGATTGAGAATT * * * 21085063 CTATCTCCCTGGGCAGCAGTGGAATAGATTGAAGATTGTAG--GT 1 CTATCTCCCTGGTCAGCAGTGGAATAGGTTGAAGATTG-AGAATT * * * 21085106 CTAATCTCCCTAGTCAGCAGTGGAATAGGTTGAAGATTGTGAATC 1 CT-ATCTCCCTGGTCAGCAGTGGAATAGGTTGAAGATTGAGAATT ** * 21085151 CTATCTCCCTGAG-CAATAGTGGAGTAGGTTGAA 1 CTATCTCCCTG-GTCAGCAGTGGAATAGGTTGAA 21085184 AATAGTAGAT Statistics Matches: 103, Mismatches: 13, Indels: 10 0.82 0.10 0.08 Matches are distributed among these distances: 43 4 0.04 44 94 0.91 45 5 0.05 ACGTcount: A:0.27, C:0.17, G:0.27, T:0.28 Consensus pattern (44 bp): CTATCTCCCTGGTCAGCAGTGGAATAGGTTGAAGATTGAGAATT Found at i:21085172 original size:88 final size:88 Alignment explanation

Indices: 21085021--21085183 Score: 254 Period size: 88 Copynumber: 1.9 Consensus size: 88 21085011 CCCCAACCCT * * * * 21085021 ATCTCCCTGGTCAGCAGTGGAATAGGTTGAAGATTGAGAATTCTATCTCCCTGGGCAGCAGTGGA 1 ATCTCCCTAGTCAGCAGTGGAATAGGTTGAAGATTGAGAATCCTATCTCCCTGAGCAACAGTGGA 21085086 ATAGATTGAAGATTGTAGGTCTA 66 ATAGATTGAAGATTGTAGGTCTA * * 21085109 ATCTCCCTAGTCAGCAGTGGAATAGGTTGAAGATTGTGAATCCTATCTCCCTGAGCAATAGTGGA 1 ATCTCCCTAGTCAGCAGTGGAATAGGTTGAAGATTGAGAATCCTATCTCCCTGAGCAACAGTGGA * * 21085174 GTAGGTTGAA 66 ATAGATTGAA 21085184 AATAGTAGAT Statistics Matches: 67, Mismatches: 8, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 88 67 1.00 ACGTcount: A:0.28, C:0.17, G:0.28, T:0.28 Consensus pattern (88 bp): ATCTCCCTAGTCAGCAGTGGAATAGGTTGAAGATTGAGAATCCTATCTCCCTGAGCAACAGTGGA ATAGATTGAAGATTGTAGGTCTA Found at i:21085557 original size:101 final size:101 Alignment explanation

Indices: 21085382--21085573 Score: 251 Period size: 101 Copynumber: 1.9 Consensus size: 101 21085372 ACAGTGGAGC * * * 21085382 AGATTGAAGCCGCAACGGCAAATCTTACTCCCCTGGCGGTGTAATGGAACAGATTGAAGCTACGA 1 AGATTGAAGCCGCAACGGCAAATCTTACTCCCCTGGCAGTGTAACGGAACAGATTGAAGCCACGA * * * 21085447 CAGTGAATCTTTTTTCCTCAACATTGCAAATTTAAA 66 CAGCGAATCTTGTTTCCCCAACATTGCAAATTTAAA * * * * * * 21085483 AGATTGAAGCCGCAACGGCGAATTTTACTTCCCTGGCATTGTAGCGGAGCAGATTGAAGCCACGA 1 AGATTGAAGCCGCAACGGCAAATCTTACTCCCCTGGCAGTGTAACGGAACAGATTGAAGCCACGA * 21085548 C-GACGAATCTTGTTTCCCCGACATTG 66 CAG-CGAATCTTGTTTCCCCAACATTG 21085574 TAGATGGAAA Statistics Matches: 77, Mismatches: 13, Indels: 2 0.84 0.14 0.02 Matches are distributed among these distances: 100 1 0.01 101 76 0.99 ACGTcount: A:0.29, C:0.23, G:0.22, T:0.26 Consensus pattern (101 bp): AGATTGAAGCCGCAACGGCAAATCTTACTCCCCTGGCAGTGTAACGGAACAGATTGAAGCCACGA CAGCGAATCTTGTTTCCCCAACATTGCAAATTTAAA Found at i:21086309 original size:21 final size:21 Alignment explanation

Indices: 21086284--21086338 Score: 78 Period size: 21 Copynumber: 2.7 Consensus size: 21 21086274 ATCTTCAAGG * 21086284 ATATGTAATCTTAGATATGAT 1 ATATGCAATCTTAGATATGAT * 21086305 ATATGCAATCTTGGATATG-- 1 ATATGCAATCTTAGATATGAT 21086324 ATATGCAATCTTAGA 1 ATATGCAATCTTAGA 21086339 AGATATGATT Statistics Matches: 31, Mismatches: 3, Indels: 2 0.86 0.08 0.06 Matches are distributed among these distances: 19 14 0.45 21 17 0.55 ACGTcount: A:0.36, C:0.09, G:0.16, T:0.38 Consensus pattern (21 bp): ATATGCAATCTTAGATATGAT Found at i:21088070 original size:17 final size:17 Alignment explanation

Indices: 21088050--21088115 Score: 71 Period size: 17 Copynumber: 3.9 Consensus size: 17 21088040 CAGACATAAA * 21088050 AGTTCGCCAGTTATAGG 1 AGTTCGCCAGTAATAGG * 21088067 AGTTTGCCAGTAATA-G 1 AGTTCGCCAGTAATAGG * ** 21088083 TGTTCGCCAGGCATAGG 1 AGTTCGCCAGTAATAGG * 21088100 AGTTTGCCAGTAATAG 1 AGTTCGCCAGTAATAG 21088116 TGTTCGTCAG Statistics Matches: 38, Mismatches: 10, Indels: 2 0.76 0.20 0.04 Matches are distributed among these distances: 16 12 0.32 17 26 0.68 ACGTcount: A:0.26, C:0.17, G:0.29, T:0.29 Consensus pattern (17 bp): AGTTCGCCAGTAATAGG Found at i:21088088 original size:33 final size:33 Alignment explanation

Indices: 21088051--21088139 Score: 133 Period size: 33 Copynumber: 2.7 Consensus size: 33 21088041 AGACATAAAA ** 21088051 GTTCGCCAGTTATAGGAGTTTGCCAGTAATAGT 1 GTTCGCCAGACATAGGAGTTTGCCAGTAATAGT * 21088084 GTTCGCCAGGCATAGGAGTTTGCCAGTAATAGT 1 GTTCGCCAGACATAGGAGTTTGCCAGTAATAGT * * 21088117 GTTCGTCAGACATTGGAGTTTGC 1 GTTCGCCAGACATAGGAGTTTGC 21088140 TAGACTTAGA Statistics Matches: 51, Mismatches: 5, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 33 51 1.00 ACGTcount: A:0.22, C:0.17, G:0.29, T:0.31 Consensus pattern (33 bp): GTTCGCCAGACATAGGAGTTTGCCAGTAATAGT Found at i:21090401 original size:26 final size:26 Alignment explanation

Indices: 21090340--21090389 Score: 73 Period size: 26 Copynumber: 1.9 Consensus size: 26 21090330 CATGTATGTG * 21090340 AAAATATGTATGAATTATAAAATGGA 1 AAAATATATATGAATTATAAAATGGA * * 21090366 AAAATGTATATGTATTATAAAATG 1 AAAATATATATGAATTATAAAATG 21090390 TAAATTTATA Statistics Matches: 21, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 26 21 1.00 ACGTcount: A:0.52, C:0.00, G:0.14, T:0.34 Consensus pattern (26 bp): AAAATATATATGAATTATAAAATGGA Found at i:21090533 original size:3 final size:3 Alignment explanation

Indices: 21090525--21090564 Score: 55 Period size: 3 Copynumber: 13.3 Consensus size: 3 21090515 GTATATATGT * 21090525 ATA ATA ATA ATA ATA ATA ATA GTA ATA TATA AT- ATA ATA A 1 ATA ATA ATA ATA ATA ATA ATA ATA ATA -ATA ATA ATA ATA A 21090565 CAAAAATAAC Statistics Matches: 33, Mismatches: 2, Indels: 4 0.85 0.05 0.10 Matches are distributed among these distances: 2 2 0.06 3 28 0.85 4 3 0.09 ACGTcount: A:0.62, C:0.00, G:0.03, T:0.35 Consensus pattern (3 bp): ATA Found at i:21091580 original size:28 final size:29 Alignment explanation

Indices: 21091540--21091594 Score: 69 Period size: 28 Copynumber: 1.9 Consensus size: 29 21091530 GTTGAAATTG * 21091540 GCTATTGGGTTTTAATT-TT-ATTGGGCCT 1 GCTATTGGATTTT-ATTATTGATTGGGCCT * 21091568 GCTATTGTATTTTATTATTGATTGGGC 1 GCTATTGGATTTTATTATTGATTGGGC 21091595 AAGGCAAATA Statistics Matches: 23, Mismatches: 2, Indels: 3 0.82 0.07 0.11 Matches are distributed among these distances: 27 3 0.13 28 13 0.57 29 7 0.30 ACGTcount: A:0.16, C:0.09, G:0.24, T:0.51 Consensus pattern (29 bp): GCTATTGGATTTTATTATTGATTGGGCCT Found at i:21092582 original size:88 final size:88 Alignment explanation

Indices: 21092490--21092660 Score: 234 Period size: 88 Copynumber: 1.9 Consensus size: 88 21092480 GGACCTACTA * * * * 21092490 TCTTTGATCTACTTCACGTCAGTACATGAAGACACGGTCTGTTTTCTTCGACCTACTCCACCACC 1 TCTTTGATCTACTTCACGCCAGTACATGAAGACAAGATCTGCTTTCTTCGACCTACTCCACCACC * 21092555 AGTATGGGGAGACAAGATCTGTT 66 AGTATGGGAAGACAAGATCTGTT * * * * * * 21092578 TCTTTGATCTACTTCATGCCAGTACATGAAGACAAGATCTGCTTTCTTTGATCTGCTTCGCCACC 1 TCTTTGATCTACTTCACGCCAGTACATGAAGACAAGATCTGCTTTCTTCGACCTACTCCACCACC * 21092643 AGTATGGGAAGGCAAGAT 66 AGTATGGGAAGACAAGAT 21092661 ATGTATATTC Statistics Matches: 71, Mismatches: 12, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 88 71 1.00 ACGTcount: A:0.25, C:0.25, G:0.20, T:0.31 Consensus pattern (88 bp): TCTTTGATCTACTTCACGCCAGTACATGAAGACAAGATCTGCTTTCTTCGACCTACTCCACCACC AGTATGGGAAGACAAGATCTGTT Found at i:21092624 original size:44 final size:43 Alignment explanation

Indices: 21092490--21092839 Score: 205 Period size: 44 Copynumber: 7.9 Consensus size: 43 21092480 GGACCTACTA ** * * * 21092490 TCTTTGATCTACTTCACGTCAGTACATGAAGACACGGTCTGTTT 1 TCTTTGATCTACTTCACACCAGTACAGGAAGACAAGATCTG-TT * * * ** * 21092534 TCTTCGACCTACTCCACCACCAGTATGGGGAGACAAGATCTGTT 1 TCTTTGATCTACTTCA-CACCAGTACAGGAAGACAAGATCTGTT ** * 21092578 TCTTTGATCTACTTCATGCCAGTACATGAAGACAAGATCTGCTT 1 TCTTTGATCTACTTCACACCAGTACAGGAAGACAAGATCTG-TT * * ** * * * 21092622 TCTTTGATCTGCTTCGCCACCAGTATGGGAAGGCAAGATATGTA 1 TCTTTGATCTACTTC-ACACCAGTACAGGAAGACAAGATCTGTT * * * * * * * * * * 21092666 TATTCGATCCACTTCGCTACCAATATAGGAAGACATGACCTACTA 1 TCTTTGATCTACTTCAC-ACCAGTACAGGAAGACAAGATCT-GTT * * * * 21092711 TCTTTGATCTACTTCACGCCAGTACATGAAGACACGGTCTGTTT 1 TCTTTGATCTACTTCACACCAGTACAGGAAGACAAGATCTG-TT * * ** ** * 21092755 TCTTTGACCTACTCCACCACCAGTATGGGGGGACAAGATCTGGTA 1 TCTTTGATCTACTTCA-CACCAGTACAGGAAGACAAGATCT-GTT * * * * 21092800 TCTTCGATCTACTTCACGCCAATACATGAAGACAAGATCT 1 TCTTTGATCTACTTCACACCAGTACAGGAAGACAAGATCT 21092840 TTTGTATTCA Statistics Matches: 222, Mismatches: 76, Indels: 16 0.71 0.24 0.05 Matches are distributed among these distances: 43 20 0.09 44 121 0.55 45 80 0.36 46 1 0.00 ACGTcount: A:0.27, C:0.25, G:0.19, T:0.30 Consensus pattern (43 bp): TCTTTGATCTACTTCACACCAGTACAGGAAGACAAGATCTGTT Found at i:21092772 original size:221 final size:222 Alignment explanation

Indices: 21092407--21092839 Score: 733 Period size: 221 Copynumber: 2.0 Consensus size: 222 21092397 ATAATCTTCA * * 21092407 ATCTACTTCGCCGCCAGTATGGGAAGATAAAATATGTATATTCGATCCACTTCGCTACCAATATA 1 ATCTACTTCGCCACCAGTATGGGAAGACAAAATATGTATATTCGATCCACTTCGCTACCAATATA * * 21092472 GGAAGATAGGACCTACTATCTTTGATCTACTTCACGTCAGTACATGAAGACACGGTCTGTTTTCT 66 GGAAGACAGGACCTACTATCTTTGATCTACTTCACGCCAGTACATGAAGACACGGTCTGTTTTCT * * * * 21092537 TCGACCTACTCCACCACCAGTATGGGGAGACAAGATCT-GTTTCTTTGATCTACTTCATGCCAGT 131 TCGACCTACTCCACCACCAGTATGGGGAGACAAGATCTGGTATCTTCGATCTACTTCACGCCAAT 21092601 ACATGAAGACAAGATCTGCTTTCTTTG 196 ACATGAAGACAAGATCTGCTTTCTTTG * * * 21092628 ATCTGCTTCGCCACCAGTATGGGAAGGCAAGATATGTATATTCGATCCACTTCGCTACCAATATA 1 ATCTACTTCGCCACCAGTATGGGAAGACAAAATATGTATATTCGATCCACTTCGCTACCAATATA * 21092693 GGAAGACATGACCTACTATCTTTGATCTACTTCACGCCAGTACATGAAGACACGGTCTGTTTTCT 66 GGAAGACAGGACCTACTATCTTTGATCTACTTCACGCCAGTACATGAAGACACGGTCTGTTTTCT * * 21092758 TTGACCTACTCCACCACCAGTATGGGGGGACAAGATCTGGTATCTTCGATCTACTTCACGCCAAT 131 TCGACCTACTCCACCACCAGTATGGGGAGACAAGATCTGGTATCTTCGATCTACTTCACGCCAAT 21092823 ACATGAAGACAAGATCT 196 ACATGAAGACAAGATCT 21092840 TTTGTATTCA Statistics Matches: 197, Mismatches: 14, Indels: 1 0.93 0.07 0.00 Matches are distributed among these distances: 221 158 0.80 222 39 0.20 ACGTcount: A:0.28, C:0.24, G:0.18, T:0.29 Consensus pattern (222 bp): ATCTACTTCGCCACCAGTATGGGAAGACAAAATATGTATATTCGATCCACTTCGCTACCAATATA GGAAGACAGGACCTACTATCTTTGATCTACTTCACGCCAGTACATGAAGACACGGTCTGTTTTCT TCGACCTACTCCACCACCAGTATGGGGAGACAAGATCTGGTATCTTCGATCTACTTCACGCCAAT ACATGAAGACAAGATCTGCTTTCTTTG Found at i:21092822 original size:133 final size:133 Alignment explanation

Indices: 21092578--21092833 Score: 336 Period size: 133 Copynumber: 1.9 Consensus size: 133 21092568 AAGATCTGTT * * * * * 21092578 TCTTTGATCTACTTCATGCCAGTACATGAAGACAAGATCTGCTTTCTTTGATCTGCTTCGCCACC 1 TCTTTGATCTACTTCACGCCAGTACATGAAGACAAGATCTGCTTTCTTTGACCTACTCCACCACC * * 21092643 AGTATGGGAAGGCAAGATATGTATATTCGATCCACTTCGCTACCAATATAGGAAGACATGACCTA 66 AGTATGGGAAGGCAAGATATGTATATTCGATCCACTTCACTACCAATACAGGAAGACATGACCTA 21092708 CTA 131 CTA * * * 21092711 TCTTTGATCTACTTCACGCCAGTACATGAAGACACGGTCTGTTTTCTTTGACCTACTCCACCACC 1 TCTTTGATCTACTTCACGCCAGTACATGAAGACAAGATCTGCTTTCTTTGACCTACTCCACCACC * * * * * * 21092776 AGTATGGG-GGGACAAGATCTGGTATCTTCGATCTACTTCAC-GCCAATACATGAAGACA 66 AGTATGGGAAGG-CAAGATAT-GTATATTCGATCCACTTCACTACCAATACAGGAAGACA 21092834 AGATCTTTTG Statistics Matches: 105, Mismatches: 16, Indels: 4 0.84 0.13 0.03 Matches are distributed among these distances: 132 2 0.02 133 86 0.82 134 17 0.16 ACGTcount: A:0.27, C:0.25, G:0.18, T:0.29 Consensus pattern (133 bp): TCTTTGATCTACTTCACGCCAGTACATGAAGACAAGATCTGCTTTCTTTGACCTACTCCACCACC AGTATGGGAAGGCAAGATATGTATATTCGATCCACTTCACTACCAATACAGGAAGACATGACCTA CTA Found at i:21099012 original size:26 final size:26 Alignment explanation

Indices: 21098951--21099000 Score: 73 Period size: 26 Copynumber: 1.9 Consensus size: 26 21098941 CATGTATGTG * 21098951 AAAATATGTATGAATTATAAAATGGA 1 AAAATATATATGAATTATAAAATGGA * * 21098977 AAAATGTATATGTATTATAAAATG 1 AAAATATATATGAATTATAAAATG 21099001 TAAATTTATA Statistics Matches: 21, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 26 21 1.00 ACGTcount: A:0.52, C:0.00, G:0.14, T:0.34 Consensus pattern (26 bp): AAAATATATATGAATTATAAAATGGA Found at i:21099153 original size:3 final size:3 Alignment explanation

Indices: 21099145--21099196 Score: 79 Period size: 3 Copynumber: 17.3 Consensus size: 3 21099135 GTATATATGT * 21099145 ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA GTA ATA TATA AT- 1 ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA -ATA ATA 21099190 ATA ATA A 1 ATA ATA A 21099197 CAAAAATAAC Statistics Matches: 45, Mismatches: 2, Indels: 4 0.88 0.04 0.08 Matches are distributed among these distances: 2 2 0.04 3 40 0.89 4 3 0.07 ACGTcount: A:0.63, C:0.00, G:0.02, T:0.35 Consensus pattern (3 bp): ATA Found at i:21102450 original size:37 final size:37 Alignment explanation

Indices: 21102409--21102554 Score: 139 Period size: 37 Copynumber: 3.9 Consensus size: 37 21102399 GATTCCCTGC * 21102409 AAAAGATGGAGAACAATAGTAAAGATTCTCTGCAAAG 1 AAAAGACGGAGAACAATAGTAAAGATTCTCTGCAAAG ** * * * 21102446 AAAAGACGGAGATTAGTAATAAAGATTCTCTGCAAAAAAA 1 AAAAGACGGAGAACAATAGTAAAGATTCTCTGC---AAAG ** * * * * 21102486 AAAAGACATAGATCAACAGTAAAGGTTCTATGCAAAG 1 AAAAGACGGAGAACAATAGTAAAGATTCTCTGCAAAG * * 21102523 AAAAGACGGAAAACAATAGTAAAGATTTTCTG 1 AAAAGACGGAGAACAATAGTAAAGATTCTCTG 21102555 TAAAAGTTTT Statistics Matches: 83, Mismatches: 23, Indels: 6 0.74 0.21 0.05 Matches are distributed among these distances: 37 55 0.66 40 28 0.34 ACGTcount: A:0.50, C:0.11, G:0.19, T:0.20 Consensus pattern (37 bp): AAAAGACGGAGAACAATAGTAAAGATTCTCTGCAAAG Found at i:21108432 original size:18 final size:18 Alignment explanation

Indices: 21108385--21108432 Score: 55 Period size: 18 Copynumber: 2.7 Consensus size: 18 21108375 TAAAATTTTA 21108385 ATTTATAATT-TTTTTATG 1 ATTTA-AATTATTTTTATG * 21108403 ATTTTAAA-TATTTTTATT 1 A-TTTAAATTATTTTTATG 21108421 ATTTAAATTATT 1 ATTTAAATTATT 21108433 GAAGATATAT Statistics Matches: 26, Mismatches: 1, Indels: 6 0.79 0.03 0.18 Matches are distributed among these distances: 17 7 0.27 18 15 0.58 19 4 0.15 ACGTcount: A:0.33, C:0.00, G:0.02, T:0.65 Consensus pattern (18 bp): ATTTAAATTATTTTTATG Found at i:21109977 original size:9 final size:9 Alignment explanation

Indices: 21109965--21109995 Score: 53 Period size: 9 Copynumber: 3.4 Consensus size: 9 21109955 ATGAAACGAA 21109965 TCGAGTTAT 1 TCGAGTTAT * 21109974 TCGAGTTAA 1 TCGAGTTAT 21109983 TCGAGTTAT 1 TCGAGTTAT 21109992 TCGA 1 TCGA 21109996 ATCAACTCGA Statistics Matches: 20, Mismatches: 2, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 9 20 1.00 ACGTcount: A:0.26, C:0.13, G:0.23, T:0.39 Consensus pattern (9 bp): TCGAGTTAT Found at i:21110593 original size:17 final size:17 Alignment explanation

Indices: 21110573--21110655 Score: 64 Period size: 17 Copynumber: 4.8 Consensus size: 17 21110563 ATTTATATCT 21110573 ATTATTTATATTATTAA 1 ATTATTTATATTATTAA * * 21110590 ATTAAATTTCACATT-TTAT 1 ATT--ATTT-ATATTATTAA * 21110609 ATTATTTATATTATTGA 1 ATTATTTATATTATTAA * 21110626 ATTATTTAGTCA-TATTGA 1 ATTATTTA-T-ATTATTAA 21110644 A-TATTTATATTA 1 ATTATTTATATTA 21110656 AAATTGAATT Statistics Matches: 54, Mismatches: 5, Indels: 15 0.73 0.07 0.20 Matches are distributed among these distances: 15 1 0.02 16 7 0.13 17 23 0.43 18 8 0.15 19 11 0.20 20 4 0.07 ACGTcount: A:0.37, C:0.04, G:0.04, T:0.55 Consensus pattern (17 bp): ATTATTTATATTATTAA Found at i:21111003 original size:5 final size:5 Alignment explanation

Indices: 21110993--21111049 Score: 71 Period size: 5 Copynumber: 11.2 Consensus size: 5 21110983 GAATAAATAT * * 21110993 ATTCG ATTCG ATTCG ATTCG ATTCG ATTCG AATTCC ATCTC- ACTCG ATTCG 1 ATTCG ATTCG ATTCG ATTCG ATTCG ATTCG -ATTCG AT-TCG ATTCG ATTCG 21111044 ATTCG A 1 ATTCG A 21111050 GAAAACTTCA Statistics Matches: 46, Mismatches: 3, Indels: 6 0.84 0.05 0.11 Matches are distributed among these distances: 4 2 0.04 5 38 0.83 6 6 0.13 ACGTcount: A:0.23, C:0.25, G:0.16, T:0.37 Consensus pattern (5 bp): ATTCG Found at i:21112586 original size:13 final size:13 Alignment explanation

Indices: 21112568--21112592 Score: 50 Period size: 13 Copynumber: 1.9 Consensus size: 13 21112558 ATATAGGGGA 21112568 AAATGTACCATTT 1 AAATGTACCATTT 21112581 AAATGTACCATT 1 AAATGTACCATT 21112593 AACGAATTTT Statistics Matches: 12, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 12 1.00 ACGTcount: A:0.40, C:0.16, G:0.08, T:0.36 Consensus pattern (13 bp): AAATGTACCATTT Found at i:21124562 original size:46 final size:46 Alignment explanation

Indices: 21124491--21124675 Score: 275 Period size: 46 Copynumber: 4.0 Consensus size: 46 21124481 CATGGCAAAG * * * 21124491 TGTGGAACGTAGGTATGGTAATTCAG-GAAAGAAAACCATATTGAGT 1 TGTGAAACGTAGGTATGGTATTTCAGTG-AAGAAAACCATACTGAGT * 21124537 TGTGGAACGTAGGTATGGTATTTCAGTGAAGAAAACCATACTGAGT 1 TGTGAAACGTAGGTATGGTATTTCAGTGAAGAAAACCATACTGAGT * * 21124583 TGTGAAACGTAGGTATGGTATTTTAGT-AACGAAAACCATACAGAGT 1 TGTGAAACGTAGGTATGGTATTTCAGTGAA-GAAAACCATACTGAGT * 21124629 TGTGAAAAGTAGGTATGGTATTTCAGTGAAGAAAACCATACTGAGT 1 TGTGAAACGTAGGTATGGTATTTCAGTGAAGAAAACCATACTGAGT 21124675 T 1 T 21124676 ATAGCATTGT Statistics Matches: 128, Mismatches: 8, Indels: 6 0.90 0.06 0.04 Matches are distributed among these distances: 45 2 0.02 46 123 0.96 47 3 0.02 ACGTcount: A:0.36, C:0.10, G:0.26, T:0.28 Consensus pattern (46 bp): TGTGAAACGTAGGTATGGTATTTCAGTGAAGAAAACCATACTGAGT Found at i:21125904 original size:17 final size:17 Alignment explanation

Indices: 21125850--21125896 Score: 58 Period size: 17 Copynumber: 2.8 Consensus size: 17 21125840 TCTGGATTGG * * * 21125850 TTTATTGATTTTTATCA 1 TTTATTTATTTTTCTTA * 21125867 TTTGTTTATTTTTCTTA 1 TTTATTTATTTTTCTTA 21125884 TTTATTTATTTTT 1 TTTATTTATTTTT 21125897 GCTATTTAAT Statistics Matches: 25, Mismatches: 5, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 17 25 1.00 ACGTcount: A:0.17, C:0.04, G:0.04, T:0.74 Consensus pattern (17 bp): TTTATTTATTTTTCTTA Found at i:21133676 original size:24 final size:24 Alignment explanation

Indices: 21133644--21133698 Score: 83 Period size: 24 Copynumber: 2.2 Consensus size: 24 21133634 CAGACGGATG * 21133644 GGGGGAAAACTAAGTTAGGGATTT 1 GGGGAAAAACTAAGTTAGGGATTT * 21133668 GGGGAAAAACTAAGTTAGGTATTT 1 GGGGAAAAACTAAGTTAGGGATTT 21133692 GGAGGAA 1 GG-GGAA 21133699 TAAGGGTTGG Statistics Matches: 28, Mismatches: 2, Indels: 1 0.90 0.06 0.03 Matches are distributed among these distances: 24 24 0.86 25 4 0.14 ACGTcount: A:0.36, C:0.04, G:0.36, T:0.24 Consensus pattern (24 bp): GGGGAAAAACTAAGTTAGGGATTT Found at i:21138678 original size:20 final size:20 Alignment explanation

Indices: 21138653--21138712 Score: 75 Period size: 20 Copynumber: 3.0 Consensus size: 20 21138643 CTTGTACAAG * 21138653 ATTTACACTTCGGTGCCTCT 1 ATTTACACTTCGGTGCCCCT * 21138673 ATTTACACTTCGATGCCCCT 1 ATTTACACTTCGGTGCCCCT * * * 21138693 GTATACACTTCGGTGTCCCT 1 ATTTACACTTCGGTGCCCCT 21138713 GTTTGTACAT Statistics Matches: 34, Mismatches: 6, Indels: 0 0.85 0.15 0.00 Matches are distributed among these distances: 20 34 1.00 ACGTcount: A:0.17, C:0.32, G:0.15, T:0.37 Consensus pattern (20 bp): ATTTACACTTCGGTGCCCCT Found at i:21142804 original size:33 final size:33 Alignment explanation

Indices: 21142734--21142809 Score: 86 Period size: 33 Copynumber: 2.4 Consensus size: 33 21142724 CCCATACAAG * 21142734 CGTGT-GGCCCACACGCCCACATTGACCTTGCC 1 CGTGTGGGCCCACACGCCCACATTGACCTAGCC * * 21142766 CGTGTGGGCCTACACGCCC-CATTTGGCCTAGCC 1 CGTGTGGGCCCACACGCCCACA-TTGACCTAGCC * 21142799 TGTGT-GGCCCA 1 CGTGTGGGCCCA 21142810 TACAGCCACA Statistics Matches: 37, Mismatches: 5, Indels: 4 0.80 0.11 0.09 Matches are distributed among these distances: 32 12 0.32 33 25 0.68 ACGTcount: A:0.13, C:0.39, G:0.26, T:0.21 Consensus pattern (33 bp): CGTGTGGGCCCACACGCCCACATTGACCTAGCC Found at i:21147727 original size:15 final size:15 Alignment explanation

Indices: 21147709--21147743 Score: 52 Period size: 15 Copynumber: 2.3 Consensus size: 15 21147699 GTATTATATA * 21147709 ATAATGACAATAGTT 1 ATAATGACAATAATT * 21147724 ATAATGATAATAATT 1 ATAATGACAATAATT 21147739 ATAAT 1 ATAAT 21147744 AATAGTACAG Statistics Matches: 18, Mismatches: 2, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 15 18 1.00 ACGTcount: A:0.51, C:0.03, G:0.09, T:0.37 Consensus pattern (15 bp): ATAATGACAATAATT Found at i:21154690 original size:13 final size:13 Alignment explanation

Indices: 21154672--21154696 Score: 50 Period size: 13 Copynumber: 1.9 Consensus size: 13 21154662 CTAAATCATA 21154672 ATTTTAGTAAAAT 1 ATTTTAGTAAAAT 21154685 ATTTTAGTAAAA 1 ATTTTAGTAAAA 21154697 CCCCTTTTTG Statistics Matches: 12, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 12 1.00 ACGTcount: A:0.48, C:0.00, G:0.08, T:0.44 Consensus pattern (13 bp): ATTTTAGTAAAAT Done.