Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Chr12 ID=Chr12-JGI_221_v2.0

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 35429946
ACGTcount: A:0.33, C:0.16, G:0.16, T:0.33

Warning! 509281 characters in sequence are not A, C, G, or T


File 70 of 120

Found at i:21030909 original size:10 final size:11

Alignment explanation

Indices: 21030891--21030923 Score: 50 Period size: 10 Copynumber: 3.1 Consensus size: 11 21030881 TCTCCTTCCT 21030891 TTCTTTCTTTC 1 TTCTTTCTTTC 21030902 TT-TTTCTTTC 1 TTCTTTCTTTC * 21030912 TTCTTTTTTTC 1 TTCTTTCTTTC 21030923 T 1 T 21030924 ATTAATTTGC Statistics Matches: 20, Mismatches: 1, Indels: 2 0.87 0.04 0.09 Matches are distributed among these distances: 10 10 0.50 11 10 0.50 ACGTcount: A:0.00, C:0.21, G:0.00, T:0.79 Consensus pattern (11 bp): TTCTTTCTTTC Found at i:21030920 original size:14 final size:14 Alignment explanation

Indices: 21030890--21030919 Score: 53 Period size: 13 Copynumber: 2.2 Consensus size: 14 21030880 CTCTCCTTCC 21030890 TTTCTTTCTTTCTT 1 TTTCTTTCTTTCTT 21030904 TTTCTTTC-TTCTT 1 TTTCTTTCTTTCTT 21030917 TTT 1 TTT 21030920 TTCTATTAAT Statistics Matches: 16, Mismatches: 0, Indels: 1 0.94 0.00 0.06 Matches are distributed among these distances: 13 8 0.50 14 8 0.50 ACGTcount: A:0.00, C:0.20, G:0.00, T:0.80 Consensus pattern (14 bp): TTTCTTTCTTTCTT Found at i:21033102 original size:26 final size:27 Alignment explanation

Indices: 21033057--21033108 Score: 79 Period size: 26 Copynumber: 2.0 Consensus size: 27 21033047 CACATGTATA * 21033057 TATATTTTTTTTAAACGAATATTTTCC 1 TATATTTTTTTTAAACGAATACTTTCC * 21033084 TATA-TTTTTTTAAATGAATACTTTC 1 TATATTTTTTTTAAACGAATACTTTC 21033109 ATATTTATAT Statistics Matches: 23, Mismatches: 2, Indels: 1 0.88 0.08 0.04 Matches are distributed among these distances: 26 19 0.83 27 4 0.17 ACGTcount: A:0.31, C:0.10, G:0.04, T:0.56 Consensus pattern (27 bp): TATATTTTTTTTAAACGAATACTTTCC Found at i:21033973 original size:98 final size:98 Alignment explanation

Indices: 21033804--21033997 Score: 334 Period size: 98 Copynumber: 2.0 Consensus size: 98 21033794 NNNNNNNNNN * 21033804 AAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATTGTGCCCTAACGTATTGGGTGTGATTTCT 1 AAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATCGTGCCCTAACGTATTGGGTGTGATTTCT * * 21033869 TAAATCTTGAATGAGTGGATGTTCTTTTAAAAT 66 TAAATCTTGAACGAGTGGATGTTCCTTTAAAAT * * 21033902 AAAGGAAATATTCCGAGTTTGGGATTCTAAAGGGATCGTGCCCTAACGTATTGGGTGTGATTTCT 1 AAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATCGTGCCCTAACGTATTGGGTGTGATTTCT * 21033967 TAAATCTTGGACGAGTGGATGTTCCTTTAAA 66 TAAATCTTGAACGAGTGGATGTTCCTTTAAA 21033998 GTTTTATTGT Statistics Matches: 90, Mismatches: 6, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 98 90 1.00 ACGTcount: A:0.29, C:0.12, G:0.24, T:0.35 Consensus pattern (98 bp): AAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATCGTGCCCTAACGTATTGGGTGTGATTTCT TAAATCTTGAACGAGTGGATGTTCCTTTAAAAT Found at i:21039767 original size:132 final size:132 Alignment explanation

Indices: 21039590--21040102 Score: 796 Period size: 132 Copynumber: 3.9 Consensus size: 132 21039580 ACAGTGAAGT * * * * 21039590 AGATCGAAAATGGCGGATTTTACCTCCTTGTGGTTACAGTGGAGTACATTGAAGCTAGTAATTCT 1 AGATCGAAGATGGCGGATTTTACCTCCCTGTGGTTACAGTGGAGTACATTGAAACCAGTAATTCT 21039655 ACTTCCCTGGGCAACAGTGGAATAGATTGAAGATTTCAGATCTTATCTCCCTAAGCAGTAGTGGA 66 ACTTCCCTGGGCAACAGTGGAATAGATTGAAGATTTCAGATCTTATCTCCCTAAGCAGTAGTGGA 21039720 GC 131 GC ** * * * 21039722 AGATCGAAGATGGCAAATTTGACCTCCCTATGGTTACAGTAGAGTACATTGAAACCAGTAATTCT 1 AGATCGAAGATGGCGGATTTTACCTCCCTGTGGTTACAGTGGAGTACATTGAAACCAGTAATTCT * 21039787 ACTTCCCTGGACAACAGTGGAATAGATTGAAGATTTCAGATCTTATCTCCCTAAGCAGTAGTGGA 66 ACTTCCCTGGGCAACAGTGGAATAGATTGAAGATTTCAGATCTTATCTCCCTAAGCAGTAGTGGA 21039852 GC 131 GC * 21039854 AGATCGAAGATGGCGGATTTTACCTCCCTGTGGTTACAGTGGAGTACATT-AAAGCCAGTAACTC 1 AGATCGAAGATGGCGGATTTTACCTCCCTGTGGTTACAGTGGAGTACATTGAAA-CCAGTAATTC * 21039918 TACTTCCCTGGGCAACAATGGAATAGATTGAAGATTTCAGATCTTATCTCCCTAAGCAGTAGTGG 65 TACTTCCCTGGGCAACAGTGGAATAGATTGAAGATTTCAGATCTTATCTCCCTAAGCAGTAGTGG 21039983 AGC 130 AGC * * * 21039986 AGATCGAAGATGGCGGATTTTACCTCCTTGTGGTTATAGTGGAGTACATTGAAACCAATAATTCT 1 AGATCGAAGATGGCGGATTTTACCTCCCTGTGGTTACAGTGGAGTACATTGAAACCAGTAATTCT * * ** * * * 21040051 ACTTCCCTAGTCAGTAGTGGAATAGGTTGAAGATTGTAAG-CCTTATCTCCCT 66 ACTTCCCTGGGCAACAGTGGAATAGATTGAAGATT-TCAGATCTTATCTCCCT 21040103 GAAATTGCAG Statistics Matches: 348, Mismatches: 30, Indels: 6 0.91 0.08 0.02 Matches are distributed among these distances: 131 3 0.01 132 339 0.97 133 6 0.02 ACGTcount: A:0.29, C:0.19, G:0.23, T:0.29 Consensus pattern (132 bp): AGATCGAAGATGGCGGATTTTACCTCCCTGTGGTTACAGTGGAGTACATTGAAACCAGTAATTCT ACTTCCCTGGGCAACAGTGGAATAGATTGAAGATTTCAGATCTTATCTCCCTAAGCAGTAGTGGA GC Found at i:21042456 original size:98 final size:98 Alignment explanation

Indices: 21042337--21042531 Score: 336 Period size: 98 Copynumber: 2.0 Consensus size: 98 21042327 AGATTTTTTT * 21042337 TAAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATTGTGCCCTAACATATTGGGTGTGATTTC 1 TAAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATCGTGCCCTAACATATTGGGTGTGATTTC * * 21042402 TTAAATCTTGGATGAGTGGATGTTCTTTTAAAA 66 TTAAATCTTGGACGAGTGGATGTTCCTTTAAAA * * * 21042435 TAAAGGAAATATTCCGAGTTTGGGATTCTAAAGGGATCGTGCCCTAACGTATTGGGTGTGATTTC 1 TAAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATCGTGCCCTAACATATTGGGTGTGATTTC 21042500 TTAAATCTTGGACGAGTGGATGTTCCTTTAAA 66 TTAAATCTTGGACGAGTGGATGTTCCTTTAAA 21042532 GTTTTATTGT Statistics Matches: 91, Mismatches: 6, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 98 91 1.00 ACGTcount: A:0.29, C:0.12, G:0.24, T:0.35 Consensus pattern (98 bp): TAAAGGAAATATTCCGAGTTTGAGATTCTAAAGGAATCGTGCCCTAACATATTGGGTGTGATTTC TTAAATCTTGGACGAGTGGATGTTCCTTTAAAA Found at i:21043134 original size:26 final size:26 Alignment explanation

Indices: 21043100--21043152 Score: 72 Period size: 28 Copynumber: 2.0 Consensus size: 26 21043090 TGAAAAGTAG 21043100 TTTTAAAATT-TTATCTCTTAGAAATA 1 TTTTAAAATTATTATCTCTTA-AAATA * 21043126 TTTTGAAATTAATTATCTCTTAAAATA 1 TTTTAAAATT-ATTATCTCTTAAAATA 21043153 AACATTATTT Statistics Matches: 24, Mismatches: 1, Indels: 3 0.86 0.04 0.11 Matches are distributed among these distances: 26 9 0.38 27 5 0.21 28 10 0.42 ACGTcount: A:0.40, C:0.08, G:0.04, T:0.49 Consensus pattern (26 bp): TTTTAAAATTATTATCTCTTAAAATA Found at i:21043505 original size:3 final size:3 Alignment explanation

Indices: 21043497--21043544 Score: 66 Period size: 3 Copynumber: 17.0 Consensus size: 3 21043487 TAGTTTACCC * 21043497 TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA T-C TAA -AA T-A TAA TAA 1 TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA 21043542 TAA 1 TAA 21043545 ATTCTAATAC Statistics Matches: 40, Mismatches: 2, Indels: 6 0.83 0.04 0.12 Matches are distributed among these distances: 2 5 0.12 3 35 0.88 ACGTcount: A:0.65, C:0.02, G:0.00, T:0.33 Consensus pattern (3 bp): TAA Found at i:21055977 original size:19 final size:19 Alignment explanation

Indices: 21055949--21055987 Score: 60 Period size: 19 Copynumber: 2.1 Consensus size: 19 21055939 AATAAACACC * * 21055949 AAAAATTTATTTTTAAAAT 1 AAAAAATTATTTTAAAAAT 21055968 AAAAAATTATTTTAAAAAT 1 AAAAAATTATTTTAAAAAT 21055987 A 1 A 21055988 TTTTAAAATT Statistics Matches: 18, Mismatches: 2, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 19 18 1.00 ACGTcount: A:0.59, C:0.00, G:0.00, T:0.41 Consensus pattern (19 bp): AAAAAATTATTTTAAAAAT Found at i:21059948 original size:23 final size:23 Alignment explanation

Indices: 21059922--21060001 Score: 74 Period size: 23 Copynumber: 3.5 Consensus size: 23 21059912 AGGCATCATT 21059922 GAATACGCCACACAGGCCTTACA 1 GAATACGCCACACAGGCCTTACA * * * * 21059945 GAATATGCCACTA-AGACTTTGCA 1 GAATACGCCAC-ACAGGCCTTACA * * 21059968 G-ATCATGCCACACAGGCCTTATA 1 GAAT-ACGCCACACAGGCCTTACA 21059991 GAATACGCCAC 1 GAATACGCCAC 21060002 CAAGACTTTG Statistics Matches: 44, Mismatches: 9, Indels: 8 0.72 0.15 0.13 Matches are distributed among these distances: 22 3 0.07 23 38 0.86 24 3 0.07 ACGTcount: A:0.34, C:0.30, G:0.17, T:0.19 Consensus pattern (23 bp): GAATACGCCACACAGGCCTTACA Found at i:21059980 original size:46 final size:46 Alignment explanation

Indices: 21059926--21060036 Score: 168 Period size: 46 Copynumber: 2.4 Consensus size: 46 21059916 ATCATTGAAT * * 21059926 ACGCCACACAGGCCTTACAGAATATGCCACTAAGACTTTGCAGATC 1 ACGCCACACAGGCCTTACAGAATACGCCACCAAGACTTTGCAGATC * * 21059972 ATGCCACACAGGCCTTATAGAATACGCCACCAAGACTTTGCAGATC 1 ACGCCACACAGGCCTTACAGAATACGCCACCAAGACTTTGCAGATC * * 21060018 ACGCCACATAGGCATTACA 1 ACGCCACACAGGCCTTACA 21060037 AAGAGTTACG Statistics Matches: 57, Mismatches: 8, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 46 57 1.00 ACGTcount: A:0.33, C:0.31, G:0.17, T:0.19 Consensus pattern (46 bp): ACGCCACACAGGCCTTACAGAATACGCCACCAAGACTTTGCAGATC Found at i:21060021 original size:23 final size:23 Alignment explanation

Indices: 21059951--21060024 Score: 64 Period size: 23 Copynumber: 3.2 Consensus size: 23 21059941 TACAGAATAT * * 21059951 GCCACTAAGACTTTGCAGATCAT 1 GCCACCAAGACTTTGCAGATCAC * * 21059974 GCCACACAGGCCTTAT--AGAAT-AC 1 GCCAC-CAAGACTT-TGCAG-ATCAC 21059997 GCCACCAAGACTTTGCAGATCAC 1 GCCACCAAGACTTTGCAGATCAC 21060020 GCCAC 1 GCCAC 21060025 ATAGGCATTA Statistics Matches: 39, Mismatches: 6, Indels: 12 0.68 0.11 0.21 Matches are distributed among these distances: 21 1 0.03 22 8 0.21 23 22 0.56 24 7 0.18 25 1 0.03 ACGTcount: A:0.31, C:0.32, G:0.18, T:0.19 Consensus pattern (23 bp): GCCACCAAGACTTTGCAGATCAC Found at i:21064793 original size:16 final size:17 Alignment explanation

Indices: 21064774--21064805 Score: 57 Period size: 16 Copynumber: 1.9 Consensus size: 17 21064764 TTTTGGGTAC 21064774 AATTTTTTT-TAATTTT 1 AATTTTTTTATAATTTT 21064790 AATTTTTTTATAATTT 1 AATTTTTTTATAATTT 21064806 ATATACTTTT Statistics Matches: 15, Mismatches: 0, Indels: 1 0.94 0.00 0.06 Matches are distributed among these distances: 16 9 0.60 17 6 0.40 ACGTcount: A:0.28, C:0.00, G:0.00, T:0.72 Consensus pattern (17 bp): AATTTTTTTATAATTTT Found at i:21064853 original size:9 final size:9 Alignment explanation

Indices: 21064794--21064860 Score: 55 Period size: 9 Copynumber: 7.2 Consensus size: 9 21064784 AATTTTAATT 21064794 TTTTTATAA 1 TTTTTATAA * * 21064803 TTTATATAC 1 TTTTTATAA 21064812 TTTTTA-AA 1 TTTTTATAA * 21064820 GTTTTTAAGAA 1 -TTTTT-ATAA * * 21064831 TTTTAAATAT 1 TTTT-TATAA 21064841 TTTTTATAA 1 TTTTTATAA 21064850 TTTTTATAA 1 TTTTTATAA 21064859 TT 1 TT 21064861 AGGACTAAAA Statistics Matches: 45, Mismatches: 9, Indels: 8 0.73 0.15 0.13 Matches are distributed among these distances: 8 1 0.02 9 31 0.69 10 11 0.24 11 2 0.04 ACGTcount: A:0.34, C:0.01, G:0.03, T:0.61 Consensus pattern (9 bp): TTTTTATAA Found at i:21072409 original size:43 final size:42 Alignment explanation

Indices: 21072337--21072564 Score: 282 Period size: 42 Copynumber: 5.4 Consensus size: 42 21072327 ACGTCTCTAA * 21072337 CTTTTGCGGCGCTTACAGGAAAAAACGCCGCTAAAGATCATGTT 1 CTTTAGCGGCGCTT--AGGAAAAAACGCCGCTAAAGATCATGTT * * 21072381 CTTTAGCGGCGCTAAGGAAATAAACGCCGCTAAAGATCCTGTT 1 CTTTAGCGGCGCTTAGGAAA-AAACGCCGCTAAAGATCATGTT * * * 21072424 CTTTAGCGGCGCTTATGAAAAAACGCCGCTAAAAATCCTGTT 1 CTTTAGCGGCGCTTAGGAAAAAACGCCGCTAAAGATCATGTT * * 21072466 CTTTAGCGGCGCTTAGAAAAAAACGCCGCT-AAGAGTAATGTT 1 CTTTAGCGGCGCTTAGGAAAAAACGCCGCTAAAGA-TCATGTT * ** 21072508 CTATAGCGGCGCTTAGTCAAAAACGCCGCTAAA-AGT-AGTGTT 1 CTTTAGCGGCGCTTAGGAAAAAACGCCGCTAAAGA-TCA-TGTT 21072550 CTTTAGCGGCGCTTA 1 CTTTAGCGGCGCTTA 21072565 TTTCACAAAC Statistics Matches: 165, Mismatches: 15, Indels: 10 0.87 0.08 0.05 Matches are distributed among these distances: 41 4 0.02 42 108 0.65 43 41 0.25 44 12 0.07 ACGTcount: A:0.30, C:0.22, G:0.23, T:0.25 Consensus pattern (42 bp): CTTTAGCGGCGCTTAGGAAAAAACGCCGCTAAAGATCATGTT Found at i:21073006 original size:21 final size:21 Alignment explanation

Indices: 21072980--21073048 Score: 75 Period size: 21 Copynumber: 3.3 Consensus size: 21 21072970 GCTTCTTTCG * * 21072980 CTGCTTTTCTCGCTGCGGCCT 1 CTGCTTCTCTCGCTGCAGCCT ** 21073001 CTGCTTCTCTCGCTATAGCCT 1 CTGCTTCTCTCGCTGCAGCCT * ** 21073022 CTGCTTCCCTCGCTGCCTCCT 1 CTGCTTCTCTCGCTGCAGCCT 21073043 CTGCTT 1 CTGCTT 21073049 TAAGTTGTTG Statistics Matches: 39, Mismatches: 9, Indels: 0 0.81 0.19 0.00 Matches are distributed among these distances: 21 39 1.00 ACGTcount: A:0.03, C:0.42, G:0.17, T:0.38 Consensus pattern (21 bp): CTGCTTCTCTCGCTGCAGCCT Found at i:21074096 original size:40 final size:41 Alignment explanation

Indices: 21073979--21074102 Score: 223 Period size: 41 Copynumber: 3.0 Consensus size: 41 21073969 TCTTCAAGGT * * 21073979 CCTGAACATTAGCGGCGCTTATTCAAAAACGCCGCTAAAGA 1 CCTGAGCATTAGCGGCGCTTATTCAGAAACGCCGCTAAAGA 21074020 CCTGAGCATTAGCGGCGCTTATTCAGAAACGCCGCTAAAGA 1 CCTGAGCATTAGCGGCGCTTATTCAGAAACGCCGCTAAAGA 21074061 CCTGAGCATTAGCGGCGCTT-TTCAGAAACGCCGCTAAAGA 1 CCTGAGCATTAGCGGCGCTTATTCAGAAACGCCGCTAAAGA 21074101 CC 1 CC 21074103 CCAAAAACTC Statistics Matches: 81, Mismatches: 2, Indels: 1 0.96 0.02 0.01 Matches are distributed among these distances: 40 22 0.27 41 59 0.73 ACGTcount: A:0.30, C:0.28, G:0.23, T:0.19 Consensus pattern (41 bp): CCTGAGCATTAGCGGCGCTTATTCAGAAACGCCGCTAAAGA Found at i:21074915 original size:27 final size:27 Alignment explanation

Indices: 21074857--21074913 Score: 73 Period size: 27 Copynumber: 2.1 Consensus size: 27 21074847 TTCTAAAAGG 21074857 TAAATATAAAAATATTTTAAAACTTTT 1 TAAATATAAAAATATTTTAAAACTTTT * * 21074884 TAAATAT-AATATATTTTTATAA-TTTT 1 TAAATATAAAAATA-TTTTAAAACTTTT 21074910 TAAA 1 TAAA 21074914 ATTAGAGCTA Statistics Matches: 27, Mismatches: 2, Indels: 3 0.84 0.06 0.09 Matches are distributed among these distances: 26 13 0.48 27 14 0.52 ACGTcount: A:0.49, C:0.02, G:0.00, T:0.49 Consensus pattern (27 bp): TAAATATAAAAATATTTTAAAACTTTT Found at i:21075248 original size:12 final size:12 Alignment explanation

Indices: 21075221--21075260 Score: 66 Period size: 12 Copynumber: 3.5 Consensus size: 12 21075211 CCTTGCTTGC 21075221 AAATT-ATATC- 1 AAATTAATATCA 21075231 AAATTAATATCA 1 AAATTAATATCA 21075243 AAATTAATATCA 1 AAATTAATATCA 21075255 AAATTA 1 AAATTA 21075261 TTATTATTCT Statistics Matches: 28, Mismatches: 0, Indels: 2 0.93 0.00 0.07 Matches are distributed among these distances: 10 5 0.18 11 5 0.18 12 18 0.64 ACGTcount: A:0.57, C:0.07, G:0.00, T:0.35 Consensus pattern (12 bp): AAATTAATATCA Found at i:21076367 original size:5 final size:5 Alignment explanation

Indices: 21076359--21076394 Score: 51 Period size: 5 Copynumber: 7.8 Consensus size: 5 21076349 TGTTTTTGGG 21076359 GGGGT GGGG- GGGG- GGGGT GGGG- GGGGT GGGGT GGGG 1 GGGGT GGGGT GGGGT GGGGT GGGGT GGGGT GGGGT GGGG 21076395 GAGGGAGGAC Statistics Matches: 29, Mismatches: 0, Indels: 4 0.88 0.00 0.12 Matches are distributed among these distances: 4 12 0.41 5 17 0.59 ACGTcount: A:0.00, C:0.00, G:0.89, T:0.11 Consensus pattern (5 bp): GGGGT Found at i:21076374 original size:13 final size:13 Alignment explanation

Indices: 21076356--21076395 Score: 71 Period size: 13 Copynumber: 3.0 Consensus size: 13 21076346 TTTTGTTTTT 21076356 GGGGGGGTGGGGG 1 GGGGGGGTGGGGG 21076369 GGGGGGGTGGGGG 1 GGGGGGGTGGGGG 21076382 GGGTGGGGTGGGGG 1 GGG-GGGGTGGGGG 21076396 AGGGAGGACC Statistics Matches: 26, Mismatches: 0, Indels: 1 0.96 0.00 0.04 Matches are distributed among these distances: 13 16 0.62 14 10 0.38 ACGTcount: A:0.00, C:0.00, G:0.90, T:0.10 Consensus pattern (13 bp): GGGGGGGTGGGGG Found at i:21076375 original size:14 final size:14 Alignment explanation

Indices: 21076356--21076395 Score: 64 Period size: 14 Copynumber: 2.9 Consensus size: 14 21076346 TTTTGTTTTT 21076356 GGGGGGGT-GGGGG 1 GGGGGGGTGGGGGG 21076369 GGGGGGGTGGGGGG 1 GGGGGGGTGGGGGG * 21076383 GGTGGGGTGGGGG 1 GGGGGGGTGGGGG 21076396 AGGGAGGACC Statistics Matches: 25, Mismatches: 1, Indels: 1 0.93 0.04 0.04 Matches are distributed among these distances: 13 8 0.32 14 17 0.68 ACGTcount: A:0.00, C:0.00, G:0.90, T:0.10 Consensus pattern (14 bp): GGGGGGGTGGGGGG Found at i:21081200 original size:13 final size:13 Alignment explanation

Indices: 21081167--21081200 Score: 50 Period size: 13 Copynumber: 2.6 Consensus size: 13 21081157 TTTAAAAAAT * 21081167 AAAAAAATTTGTG 1 AAAAAAAATTGTG * 21081180 AGAAAAAATTGTG 1 AAAAAAAATTGTG 21081193 AAAAAAAA 1 AAAAAAAA 21081201 ACAAAAGATG Statistics Matches: 18, Mismatches: 3, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 13 18 1.00 ACGTcount: A:0.65, C:0.00, G:0.15, T:0.21 Consensus pattern (13 bp): AAAAAAAATTGTG Found at i:21082101 original size:12 final size:12 Alignment explanation

Indices: 21082086--21082129 Score: 54 Period size: 12 Copynumber: 3.7 Consensus size: 12 21082076 AATTGAGATT 21082086 GAGAAAGAAAAA 1 GAGAAAGAAAAA * 21082098 GAGAAAGAAAAT 1 GAGAAAGAAAAA * 21082110 GAGAGAACAAAAA 1 GAGA-AAGAAAAA 21082123 GA-AAAGA 1 GAGAAAGA 21082130 GGTTCGAGTA Statistics Matches: 27, Mismatches: 4, Indels: 3 0.79 0.12 0.09 Matches are distributed among these distances: 11 3 0.11 12 16 0.59 13 8 0.30 ACGTcount: A:0.70, C:0.02, G:0.25, T:0.02 Consensus pattern (12 bp): GAGAAAGAAAAA Found at i:21087191 original size:5 final size:5 Alignment explanation

Indices: 21087183--21087211 Score: 58 Period size: 5 Copynumber: 5.8 Consensus size: 5 21087173 CATTCATTAC 21087183 ATTTT ATTTT ATTTT ATTTT ATTTT ATTT 1 ATTTT ATTTT ATTTT ATTTT ATTTT ATTT 21087212 GCATATTAGT Statistics Matches: 24, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 5 24 1.00 ACGTcount: A:0.21, C:0.00, G:0.00, T:0.79 Consensus pattern (5 bp): ATTTT Found at i:21089464 original size:54 final size:52 Alignment explanation

Indices: 21089403--21089587 Score: 246 Period size: 54 Copynumber: 3.5 Consensus size: 52 21089393 CCATGACAAG * * 21089403 TGAAGACCATATAGTGTAAGGCATAGACGTTTGATATGGCATCATATGGGGAA 1 TGAAGACCATATAGTGTAAGGCCTAG-CGTTTGCTATGGCATCATATGGGGAA * * 21089456 CTGAAGACCATATAGTGTAAGGCCCAGCTATTTGCTATGGCATCATATGGGGAAA 1 -TGAAGACCATATAGTGTAAGGCCTAGC-GTTTGCTATGGCATCATATGGGG-AA * 21089511 TGAAGACCATATAGTGTAAGGCCTACCCGTTTGCTATGGCATCATAT-GGGAA 1 TGAAGACCATATAGTGTAAGGCCTA-GCGTTTGCTATGGCATCATATGGGGAA * * * 21089563 AGAAGACCACATAGTGTAAGACCTA 1 TGAAGACCATATAGTGTAAGGCCTA 21089588 TTTTGGGACT Statistics Matches: 118, Mismatches: 10, Indels: 8 0.87 0.07 0.06 Matches are distributed among these distances: 52 24 0.20 53 4 0.03 54 87 0.74 55 3 0.03 ACGTcount: A:0.33, C:0.17, G:0.25, T:0.25 Consensus pattern (52 bp): TGAAGACCATATAGTGTAAGGCCTAGCGTTTGCTATGGCATCATATGGGGAA Found at i:21089600 original size:52 final size:51 Alignment explanation

Indices: 21089404--21089611 Score: 231 Period size: 54 Copynumber: 3.9 Consensus size: 51 21089394 CATGACAAGT * * * * 21089404 GAAGACCATATAGTGTAAGGCATAGACGTTTGATATGGCATCATATGGGGAACT 1 GAAGACCATATAGTGTAAGGCCT--ACTTTTGCTATGGCATCATATGGGGAA-A * 21089458 GAAGACCATATAGTGTAAGGCCCAGCTATTTGCTATGGCATCATATGGGGAAA 1 GAAGACCATATAGTGTAAGGCCTA-CT-TTTGCTATGGCATCATATGGGGAAA * 21089511 TGAAGACCATATAGTGTAAGGCCTACCCGTTTGCTATGGCATCATAT-GGGAAA 1 -GAAGACCATATAGTGTAAGGCCTA--CTTTTGCTATGGCATCATATGGGGAAA * * 21089564 GAAGACCACATAGTGTAAGACCTA-TTTTGGGACTATGGCATCATATGG 1 GAAGACCATATAGTGTAAGGCCTACTTTT--G-CTATGGCATCATATGG 21089612 AGAGAAGATG Statistics Matches: 135, Mismatches: 11, Indels: 17 0.83 0.07 0.10 Matches are distributed among these distances: 49 3 0.02 51 1 0.01 52 37 0.27 53 8 0.06 54 85 0.63 55 1 0.01 ACGTcount: A:0.32, C:0.16, G:0.26, T:0.26 Consensus pattern (51 bp): GAAGACCATATAGTGTAAGGCCTACTTTTGCTATGGCATCATATGGGGAAA Found at i:21089818 original size:31 final size:29 Alignment explanation

Indices: 21089724--21089822 Score: 153 Period size: 29 Copynumber: 3.3 Consensus size: 29 21089714 AAAAGTGATA 21089724 CCTTTGTGGCTGAATCTGTTATATGTGAG 1 CCTTTGTGGCTGAATCTGTTATATGTGAG * * * 21089753 CCTTTGTGGCTAAATCTATTTTATGTGAG 1 CCTTTGTGGCTGAATCTGTTATATGTGAG 21089782 CCTTTGTGGCTGAATCTGTTATATGTGGAAG 1 CCTTTGTGGCTGAATCTGTTATATGT-G-AG 21089813 CCTTTGTGGC 1 CCTTTGTGGC 21089823 CGTTCTTTGT Statistics Matches: 62, Mismatches: 6, Indels: 2 0.89 0.09 0.03 Matches are distributed among these distances: 29 49 0.79 30 1 0.02 31 12 0.19 ACGTcount: A:0.17, C:0.15, G:0.26, T:0.41 Consensus pattern (29 bp): CCTTTGTGGCTGAATCTGTTATATGTGAG Found at i:21089847 original size:30 final size:29 Alignment explanation

Indices: 21089724--21089847 Score: 115 Period size: 29 Copynumber: 4.2 Consensus size: 29 21089714 AAAAGTGATA * * 21089724 CCTTTGTGGCTGAATCTGTTATATGTGAG 1 CCTTTGTGGCCGAATCTGTTACATGTGAG ** * ** 21089753 CCTTTGTGGCTAAATCTATTTTATGTGAG 1 CCTTTGTGGCCGAATCTGTTACATGTGAG * * 21089782 CCTTTGTGGCTGAATCTGTTATATGTGGAAG 1 CCTTTGTGGCCGAATCTGTTACATGT-G-AG * 21089813 CCTTTGTGGCCG-TTCTTTGTTACATGTGAG 1 CCTTTGTGGCCGAATC--TGTTACATGTGAG 21089843 CCTTT 1 CCTTT 21089848 ATGGTGATAT Statistics Matches: 82, Mismatches: 9, Indels: 7 0.84 0.09 0.07 Matches are distributed among these distances: 29 49 0.60 30 10 0.12 31 14 0.17 32 9 0.11 ACGTcount: A:0.16, C:0.16, G:0.25, T:0.43 Consensus pattern (29 bp): CCTTTGTGGCCGAATCTGTTACATGTGAG Found at i:21097397 original size:25 final size:25 Alignment explanation

Indices: 21097342--21097424 Score: 87 Period size: 25 Copynumber: 3.3 Consensus size: 25 21097332 GAGTTATAAA * * 21097342 CGGTAAGCTCATACGAGCTAAATAA 1 CGGTAAGCTCATATGAGCTAAATAT * * 21097367 CAGTAAGCTCATGTGAGCTAAATAT 1 CGGTAAGCTCATATGAGCTAAATAT * * * 21097392 TGGTAAGCTC-TCTCGAGCTGAATAT 1 CGGTAAGCTCATAT-GAGCTAAATAT 21097417 CGGTAAGC 1 CGGTAAGC 21097425 CCTCTCAAGC Statistics Matches: 48, Mismatches: 9, Indels: 2 0.81 0.15 0.03 Matches are distributed among these distances: 24 2 0.04 25 46 0.96 ACGTcount: A:0.33, C:0.19, G:0.23, T:0.25 Consensus pattern (25 bp): CGGTAAGCTCATATGAGCTAAATAT Found at i:21097429 original size:25 final size:25 Alignment explanation

Indices: 21097381--21097451 Score: 83 Period size: 25 Copynumber: 2.9 Consensus size: 25 21097371 AAGCTCATGT * * * 21097381 GAGCTAAATATTGGTAAGCTCTCTC 1 GAGCTGAATATCGGTAAGCCCTCTC 21097406 GAGCTGAATATCGGTAAGCCCTCTC 1 GAGCTGAATATCGGTAAGCCCTCTC * 21097431 AAGCTGAGA-ATCGGTAA-CCCT 1 GAGCTGA-ATATCGGTAAGCCCT 21097452 AATGACATGT Statistics Matches: 41, Mismatches: 4, Indels: 3 0.85 0.08 0.06 Matches are distributed among these distances: 24 4 0.10 25 36 0.88 26 1 0.02 ACGTcount: A:0.28, C:0.24, G:0.23, T:0.25 Consensus pattern (25 bp): GAGCTGAATATCGGTAAGCCCTCTC Found at i:21105483 original size:5 final size:5 Alignment explanation

Indices: 21105459--21105493 Score: 52 Period size: 5 Copynumber: 6.6 Consensus size: 5 21105449 ACTTAAACTA 21105459 AAAATG AAAAG AAGAAG AAAAG AAAAG AAAAG AAA 1 AAAA-G AAAAG AA-AAG AAAAG AAAAG AAAAG AAA 21105494 CCCTAACCAT Statistics Matches: 28, Mismatches: 0, Indels: 3 0.90 0.00 0.10 Matches are distributed among these distances: 5 19 0.68 6 9 0.32 ACGTcount: A:0.77, C:0.00, G:0.20, T:0.03 Consensus pattern (5 bp): AAAAG Found at i:21107336 original size:29 final size:29 Alignment explanation

Indices: 21107304--21107404 Score: 105 Period size: 29 Copynumber: 3.5 Consensus size: 29 21107294 AATTTTCGTA * 21107304 AAAAGGACGCCTTTGTGGCTATCTCTGTT 1 AAAAGGAAGCCTTTGTGGCTATCTCTGTT * * * * 21107333 AAAAGGAAACCTTTGTGG-TGGTTTCTATT 1 AAAAGGAAGCCTTTGTGGCT-ATCTCTGTT * * * 21107362 AAAAGTAAGCCTTTGTGGCGATGTCTGTT 1 AAAAGGAAGCCTTTGTGGCTATCTCTGTT * 21107391 AAAAGAAAGCCTTT 1 AAAAGGAAGCCTTT 21107405 ATAGCGAACC Statistics Matches: 58, Mismatches: 12, Indels: 4 0.78 0.16 0.05 Matches are distributed among these distances: 28 1 0.02 29 57 0.98 ACGTcount: A:0.28, C:0.15, G:0.24, T:0.34 Consensus pattern (29 bp): AAAAGGAAGCCTTTGTGGCTATCTCTGTT Found at i:21114101 original size:22 final size:22 Alignment explanation

Indices: 21114066--21114118 Score: 79 Period size: 22 Copynumber: 2.4 Consensus size: 22 21114056 GAACGGTAAT * 21114066 GATACCATCTATGAGAAATATC 1 GATACCATCCATGAGAAATATC * 21114088 GATACTATCCATGAGAAATATC 1 GATACCATCCATGAGAAATATC * 21114110 GATATCATC 1 GATACCATC 21114119 GAATGAGTAT Statistics Matches: 27, Mismatches: 4, Indels: 0 0.87 0.13 0.00 Matches are distributed among these distances: 22 27 1.00 ACGTcount: A:0.40, C:0.19, G:0.13, T:0.28 Consensus pattern (22 bp): GATACCATCCATGAGAAATATC Found at i:21118068 original size:19 final size:19 Alignment explanation

Indices: 21118044--21118080 Score: 65 Period size: 19 Copynumber: 1.9 Consensus size: 19 21118034 TATTTTTTAT 21118044 TTTTTTAATTTTAAAATAC 1 TTTTTTAATTTTAAAATAC * 21118063 TTTTTTATTTTTAAAATA 1 TTTTTTAATTTTAAAATA 21118081 AATTTTTGGT Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 19 17 1.00 ACGTcount: A:0.35, C:0.03, G:0.00, T:0.62 Consensus pattern (19 bp): TTTTTTAATTTTAAAATAC Found at i:21118086 original size:19 final size:19 Alignment explanation

Indices: 21118045--21118087 Score: 59 Period size: 19 Copynumber: 2.3 Consensus size: 19 21118035 ATTTTTTATT ** 21118045 TTTTTAATTTTAAAATACT 1 TTTTTAATTTTAAAATAAA * 21118064 TTTTTATTTTTAAAATAAA 1 TTTTTAATTTTAAAATAAA 21118083 TTTTT 1 TTTTT 21118088 GGTGTTTATT Statistics Matches: 21, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 19 21 1.00 ACGTcount: A:0.35, C:0.02, G:0.00, T:0.63 Consensus pattern (19 bp): TTTTTAATTTTAAAATAAA Found at i:21119519 original size:14 final size:13 Alignment explanation

Indices: 21119480--21119521 Score: 52 Period size: 11 Copynumber: 3.3 Consensus size: 13 21119470 TTATATATAT * 21119480 ATATATTAAAAT- 1 ATATATAAAAATA 21119492 -TATATAAAAATA 1 ATATATAAAAATA 21119504 ATATATAAAAATTA 1 ATATATAAAAA-TA 21119518 ATAT 1 ATAT 21119522 GGACAGGCCG Statistics Matches: 26, Mismatches: 1, Indels: 4 0.84 0.03 0.13 Matches are distributed among these distances: 11 10 0.38 13 10 0.38 14 6 0.23 ACGTcount: A:0.62, C:0.00, G:0.00, T:0.38 Consensus pattern (13 bp): ATATATAAAAATA Found at i:21121300 original size:18 final size:18 Alignment explanation

Indices: 21121285--21121320 Score: 72 Period size: 18 Copynumber: 2.0 Consensus size: 18 21121275 GGTTTGAATT 21121285 ATTGGAACTTCATGAAGA 1 ATTGGAACTTCATGAAGA 21121303 ATTGGAACTTCATGAAGA 1 ATTGGAACTTCATGAAGA 21121321 CTTATTCTTT Statistics Matches: 18, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 18 18 1.00 ACGTcount: A:0.39, C:0.11, G:0.22, T:0.28 Consensus pattern (18 bp): ATTGGAACTTCATGAAGA Found at i:21121304 original size:21 final size:21 Alignment explanation

Indices: 21121280--21121326 Score: 64 Period size: 18 Copynumber: 2.4 Consensus size: 21 21121270 TTTTAGGTTT 21121280 GAATTATTGGAACTTCATGAA 1 GAATTATTGGAACTTCATGAA 21121301 G-A--ATTGGAACTTCATGAA 1 GAATTATTGGAACTTCATGAA * 21121319 GACTTATT 1 GAATTATT 21121327 CTTTCAACTA Statistics Matches: 22, Mismatches: 1, Indels: 6 0.76 0.03 0.21 Matches are distributed among these distances: 18 17 0.77 20 1 0.05 21 4 0.18 ACGTcount: A:0.36, C:0.11, G:0.19, T:0.34 Consensus pattern (21 bp): GAATTATTGGAACTTCATGAA Found at i:21132871 original size:20 final size:20 Alignment explanation

Indices: 21132846--21132883 Score: 60 Period size: 20 Copynumber: 1.9 Consensus size: 20 21132836 ACGAGCTCAA 21132846 TGAGCTGAA-TTGAGCTCGTG 1 TGAGCT-AACTTGAGCTCGTG 21132866 TGAGCTAACTTGAGCTCG 1 TGAGCTAACTTGAGCTCG 21132884 AATGAACTGA Statistics Matches: 17, Mismatches: 0, Indels: 2 0.89 0.00 0.11 Matches are distributed among these distances: 19 2 0.12 20 15 0.88 ACGTcount: A:0.21, C:0.18, G:0.32, T:0.29 Consensus pattern (20 bp): TGAGCTAACTTGAGCTCGTG Found at i:21134797 original size:7 final size:6 Alignment explanation

Indices: 21134709--21134848 Score: 93 Period size: 6 Copynumber: 23.0 Consensus size: 6 21134699 TCAATCTCAA * * * ** ** 21134709 TTTCTT TTTCAAT TTTCTTT TCTTCGT TTTCTT TTTCTC TCACTT TTTCGA 1 TTTCTT TTTC-TT TTTC-TT T-TTCTT TTTCTT TTTCTT TTTCTT TTTCTT *** * * * 21134760 TTTCTT TTTC-T TTTGAA TTTCTT TTTCTT TTTCGT TTTCTA TTTCTA 1 TTTCTT TTTCTT TTTCTT TTTCTT TTTCTT TTTCTT TTTCTT TTTCTT ** * * 21134807 TTTCTT TTTCAC TTTCTA TTTCTT TTTATT TTTCTT TTTCTT 1 TTTCTT TTTCTT TTTCTT TTTCTT TTTCTT TTTCTT TTTCTT 21134849 CATTTTTGTT Statistics Matches: 100, Mismatches: 31, Indels: 6 0.73 0.23 0.04 Matches are distributed among these distances: 5 4 0.04 6 84 0.84 7 9 0.09 8 3 0.03 ACGTcount: A:0.08, C:0.18, G:0.03, T:0.71 Consensus pattern (6 bp): TTTCTT Found at i:21138233 original size:26 final size:26 Alignment explanation

Indices: 21138153--21138235 Score: 68 Period size: 26 Copynumber: 3.2 Consensus size: 26 21138143 GATGAACACG * 21138153 TGTGTAGTACTATGTGAAGGCTACTA 1 TGTGTAGTACTAAGTGAAGGCTACTA * * 21138179 CGTGTA-T-CGATAAAT-AATGG-TCAC-A 1 TGTGTAGTAC--TAAGTGAA-GGCT-ACTA 21138204 TGTGTAGTACTAAGTGAAGGCTACTA 1 TGTGTAGTACTAAGTGAAGGCTACTA 21138230 TGTGTA 1 TGTGTA 21138236 CTGAGAAGTT Statistics Matches: 43, Mismatches: 5, Indels: 18 0.65 0.08 0.27 Matches are distributed among these distances: 24 1 0.02 25 18 0.42 26 23 0.53 27 1 0.02 ACGTcount: A:0.30, C:0.12, G:0.25, T:0.33 Consensus pattern (26 bp): TGTGTAGTACTAAGTGAAGGCTACTA Found at i:21138325 original size:102 final size:102 Alignment explanation

Indices: 21138149--21138490 Score: 412 Period size: 102 Copynumber: 3.4 Consensus size: 102 21138139 TATCGATGAA * 21138149 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGATAAA-TAATGGTCACATGTGTAGTA 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGGTAAACT-ATGGTCACATGTGTAGTA * * 21138213 CTAAGTGAAGGCTACTATGTGTACTGAGAAGTTTTGAG- 65 CTAAGTGAAGGCTACTATGTGTACTGAAAAGCTTTG-GT * * * * * 21138251 CACGTGTGTAGTACTGTGTGAAAGTTACTACGTGTATCGGTAAACTATGGTTACATGTGTGGTAC 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGGTAAACTATGGTCACATGTGTAGTAC 21138316 TAAGTGAAGGCTACTATG-GATACTGAAAAGCTTTGGT 66 TAAGTGAAGGCTACTATGTG-TACTGAAAAGCTTTGGT * * * * * 21138353 CACGTATGTAGTACTATGTGAAGGCTACTACGTG-AGCCGTAAAACT-TGATCACGTGTGTAGTA 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGGT-AAACTATGGTCACATGTGTAGTA * * * * 21138416 CTATGTGAAGGCTACTACGTGAACTGTAAAA-C--TGAT 65 CTAAGTGAAGGCTACTATGTGTACTG-AAAAGCTTTGGT * * 21138452 CTCGTGTGTTGTACTATGTGAAGGCTACTACGTGTATCG 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGTATCG 21138491 AATGATAAAA Statistics Matches: 206, Mismatches: 27, Indels: 16 0.83 0.11 0.06 Matches are distributed among these distances: 99 34 0.17 100 2 0.01 101 41 0.20 102 128 0.62 103 1 0.00 ACGTcount: A:0.28, C:0.15, G:0.26, T:0.31 Consensus pattern (102 bp): CACGTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGGTAAACTATGGTCACATGTGTAGTAC TAAGTGAAGGCTACTATGTGTACTGAAAAGCTTTGGT Found at i:21138471 original size:49 final size:50 Alignment explanation

Indices: 21138149--21138485 Score: 292 Period size: 51 Copynumber: 6.7 Consensus size: 50 21138139 TATCGATGAA * * * 21138149 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGTA-TCGATAAATA-ATGGT 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGAACT-G-TAAA-ACTTGAT * * * * ** * 21138200 CACATGTGTAGTACTAAGTGAAGGCTACTATGTGTACTG-AGAAGTTTTGAG 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGAACTGTA-AA-ACTTGAT * * * * * * 21138251 CACGTGTGTAGTACTGTGTGAAAGTTACTACGTGTATCGGT-AAACTATGGT 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTG-AACTGTAAAACT-TGAT * * * * * * 21138302 TACATGTGTGGTACTAAGTGAAGGCTACTATG-GATACTG-AAAAGCTTTGGT 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGA-ACTGTAAAA-C-TTGAT * * * 21138353 CACGTATGTAGTACTATGTGAAGGCTACTACGTGAGCCGTAAAACTTGAT 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGAACTGTAAAACTTGAT 21138403 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGAACTGTAAAAC-TGAT 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTGAACTGTAAAACTTGAT * * 21138452 CTCGTGTGTTGTACTATGTGAAGGCTACTACGTG 1 CACGTGTGTAGTACTATGTGAAGGCTACTACGTG 21138486 TATCGAATGA Statistics Matches: 229, Mismatches: 45, Indels: 26 0.76 0.15 0.09 Matches are distributed among these distances: 49 38 0.17 50 55 0.24 51 126 0.55 52 10 0.04 ACGTcount: A:0.28, C:0.15, G:0.26, T:0.31 Consensus pattern (50 bp): CACGTGTGTAGTACTATGTGAAGGCTACTACGTGAACTGTAAAACTTGAT Found at i:21145824 original size:51 final size:51 Alignment explanation

Indices: 21145763--21146101 Score: 309 Period size: 51 Copynumber: 6.7 Consensus size: 51 21145753 CGATGAACAA * * 21145763 GTGTGTAGTACTGTGTGAAGGCTACTACGTGTATCGATAAATAATGGTCAC 1 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGATAAATATTGGTCAC * * * * * * * 21145814 ATGTGTAGTACTAAGTGAAGGCTACTATGTGTACCGAGAAGTTTTGAG-CAC 1 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGATAAATATTG-GTCAC * * * * 21145865 GTGTGTAGTACTGTGTGAAAGCTACTACGTGTATCGGTAAACTA-TGGTTAC 1 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGATAAA-TATTGGTCAC * * * * * * ** 21145916 ATGTGTGGTACTAAGTGAAGGCTGCTATG-GATACCGA-AAAGCTTGGTCAC 1 GTGTGTAGTACTATGTGAAGGCTACTACGTG-TATCGATAAATATTGGTCAC * * * * 21145966 GTATGTAGTACTATGTGAAGGCTACTACGTG-AGCCATAAA-ACTTGATCAC 1 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGATAAATA-TTGGTCAC * * * * 21146016 GTGTGTAGTACTATGTGAAGGCTACTACGTG-AACTG-TAAA-ACTGATCTC 1 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATC-GATAAATATTGGTCAC 21146065 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGA 1 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGA 21146102 ATGATAAAAA Statistics Matches: 229, Mismatches: 48, Indels: 23 0.76 0.16 0.08 Matches are distributed among these distances: 49 41 0.18 50 83 0.36 51 103 0.45 52 2 0.01 ACGTcount: A:0.28, C:0.15, G:0.27, T:0.30 Consensus pattern (51 bp): GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGATAAATATTGGTCAC Found at i:21145944 original size:102 final size:101 Alignment explanation

Indices: 21145763--21146101 Score: 379 Period size: 102 Copynumber: 3.4 Consensus size: 101 21145753 CGATGAACAA 21145763 GTGTGTAGTACTGTGTGAAGGCTACTACGTGTATCGATAAATAATGGTCACATGTGTAGTACTAA 1 GTGTGTAGTACTGTGTGAAGGCTACTACGTGTATCGATAAATAATGGTCACATGTGTAGTACTAA * * 21145828 GTGAAGGCTACTATGTG-TACCGAGAAGTTTTGAG-CAC 66 GTGAAGGCTACTATG-GATACCGAAAAG-CTTG-GTCAC * * * * 21145865 GTGTGTAGTACTGTGTGAAAGCTACTACGTGTATCGGTAAACT-ATGGTTACATGTGTGGTACTA 1 GTGTGTAGTACTGTGTGAAGGCTACTACGTGTATCGATAAA-TAATGGTCACATGTGTAGTACTA * 21145929 AGTGAAGGCTGCTATGGATACCGAAAAGCTTGGTCAC 65 AGTGAAGGCTACTATGGATACCGAAAAGCTTGGTCAC * * * * * * * 21145966 GTATGTAGTACTATGTGAAGGCTACTACGTG-AGCCATAAA-ACTTGATCACGTGTGTAGTACTA 1 GTGTGTAGTACTGTGTGAAGGCTACTACGTGTATCGATAAATA-ATGGTCACATGTGTAGTACTA * * * * * 21146029 TGTGAAGGCTACTACGTGA-ACTGTAAAA-C-TGATCTC 65 AGTGAAGGCTACTATG-GATACCG-AAAAGCTTGGTCAC * 21146065 GTGTGTAGTACTATGTGAAGGCTACTACGTGTATCGA 1 GTGTGTAGTACTGTGTGAAGGCTACTACGTGTATCGA 21146102 ATGATAAAAA Statistics Matches: 202, Mismatches: 27, Indels: 18 0.82 0.11 0.07 Matches are distributed among these distances: 99 35 0.17 100 43 0.21 101 41 0.20 102 82 0.41 103 1 0.00 ACGTcount: A:0.28, C:0.15, G:0.27, T:0.30 Consensus pattern (101 bp): GTGTGTAGTACTGTGTGAAGGCTACTACGTGTATCGATAAATAATGGTCACATGTGTAGTACTAA GTGAAGGCTACTATGGATACCGAAAAGCTTGGTCAC Found at i:21148505 original size:28 final size:27 Alignment explanation

Indices: 21148469--21148521 Score: 70 Period size: 28 Copynumber: 1.9 Consensus size: 27 21148459 CGGCATTGAC 21148469 TAAGACATAATAAAAACTTGTAATAGT 1 TAAGACATAATAAAAACTTGTAATAGT * * * 21148496 TAAGAACATATTAATAAGTTGTAATA 1 TAAG-ACATAATAAAAACTTGTAATA 21148522 ATACTTTCTA Statistics Matches: 22, Mismatches: 3, Indels: 1 0.85 0.12 0.04 Matches are distributed among these distances: 27 4 0.18 28 18 0.82 ACGTcount: A:0.51, C:0.06, G:0.11, T:0.32 Consensus pattern (27 bp): TAAGACATAATAAAAACTTGTAATAGT Found at i:21148647 original size:20 final size:20 Alignment explanation

Indices: 21148613--21148651 Score: 60 Period size: 20 Copynumber: 1.9 Consensus size: 20 21148603 ATAAGCTAAA ** 21148613 TTGAGCTCGTGTGAGCTGAC 1 TTGAGCTCGAATGAGCTGAC 21148633 TTGAGCTCGAATGAGCTGA 1 TTGAGCTCGAATGAGCTGA 21148652 ACCACATGGA Statistics Matches: 17, Mismatches: 2, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 20 17 1.00 ACGTcount: A:0.21, C:0.18, G:0.33, T:0.28 Consensus pattern (20 bp): TTGAGCTCGAATGAGCTGAC Found at i:21152431 original size:50 final size:50 Alignment explanation

Indices: 21152306--21152496 Score: 150 Period size: 50 Copynumber: 3.8 Consensus size: 50 21152296 GATAATAACA * * * * ** * * * * 21152306 TGCCAAAGCTATGTCCCAAACATAGTCTTACATGGGATGTTTCTTGT-AC 1 TGCCAACGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATATCAG * * * ** * 21152355 TGCCAATGCCATATCCCAGATATGGTCTTACATGGGAGTTCTCATATCGG 1 TGCCAACGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATATCAG * * * * ** * 21152405 TTCCCATGTCATGTCCCAGACATGGTCTTACGGGGGACCTCTCATCTCAG 1 TGCCAACGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATATCAG * * 21152455 TGCCAACGCCATGTCTCAGACATGGTCTTACATGGGATCTCT 1 TGCCAACGCCATGTCCCAGACATGGTCTTACATGGGACCTCT 21152497 TTACCCAAAT Statistics Matches: 109, Mismatches: 32, Indels: 1 0.77 0.23 0.01 Matches are distributed among these distances: 49 36 0.33 50 73 0.67 ACGTcount: A:0.23, C:0.27, G:0.21, T:0.30 Consensus pattern (50 bp): TGCCAACGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATATCAG Found at i:21154866 original size:50 final size:50 Alignment explanation

Indices: 21154799--21154996 Score: 216 Period size: 50 Copynumber: 3.8 Consensus size: 50 21154789 TTTCTTGTAC * * ** 21154799 TGCCAATGCCATATCCCAGATATGGTCTTACATGGGAGTTCTCATATAGG 1 TGCCAATGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATATAGG * *** * * 21154849 TGCCCATGCCATGTCCCAGACATGGTCTTATGGGGGACCTCTCATCTCGG 1 TGCCAATGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATATAGG * * 21154899 TGCCAATGCCATGTCCCAGACATGGTCTTACATGAGACCTCTCATAATCTCAATGA 1 TGCCAATGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCAT-A--T--A-GG * * 21154955 TGCCAATGCCATGTCCCAGACATGTTCTTCCATGGGACCTCT 1 TGCCAATGCCATGTCCCAGACATGGTCTTACATGGGACCTCT 21154997 TTACCCAAAT Statistics Matches: 121, Mismatches: 21, Indels: 6 0.82 0.14 0.04 Matches are distributed among these distances: 50 80 0.66 53 1 0.01 56 40 0.33 ACGTcount: A:0.23, C:0.29, G:0.21, T:0.28 Consensus pattern (50 bp): TGCCAATGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATATAGG Found at i:21173097 original size:25 final size:24 Alignment explanation

Indices: 21173043--21173110 Score: 66 Period size: 24 Copynumber: 2.8 Consensus size: 24 21173033 GGCTATATAA 21173043 CGGGAAGCTCATAAGAGCCTAAAT 1 CGGGAAGCTCATAAGAGCCTAAAT * * * 21173067 CAGGAAGTTCATAAGGAGCCTTTAAT 1 CGGGAAGCTCATAA-GAGCC-TAAAT ** 21173093 -GGGAAGCTCCGAAGAGCC 1 CGGGAAGCTCATAAGAGCC 21173111 ATTAATCAGA Statistics Matches: 35, Mismatches: 7, Indels: 4 0.76 0.15 0.09 Matches are distributed among these distances: 24 17 0.49 25 14 0.40 26 4 0.11 ACGTcount: A:0.34, C:0.21, G:0.28, T:0.18 Consensus pattern (24 bp): CGGGAAGCTCATAAGAGCCTAAAT Found at i:21173148 original size:25 final size:25 Alignment explanation

Indices: 21173105--21173212 Score: 98 Period size: 25 Copynumber: 4.4 Consensus size: 25 21173095 GAAGCTCCGA * 21173105 AGAGCCAT-TAATC-AGAAGTTCCAG 1 AGAGCCATAT-ATCGAGAAGTTCAAG * 21173129 ATAGCCATATATCGAGAAGTTCAAG 1 AGAGCCATATATCGAGAAGTTCAAG * * * 21173154 CGAGCCATATATTGGGAAGTTCAAG 1 AGAGCCATATATCGAGAAGTTCAAG * * * 21173179 CGAGCCA-A-ATCGAGAAGCTCTAG 1 AGAGCCATATATCGAGAAGTTCAAG * 21173202 ATAGCCATATA 1 AGAGCCATATA 21173213 ACAGGACGCT Statistics Matches: 68, Mismatches: 12, Indels: 7 0.78 0.14 0.08 Matches are distributed among these distances: 23 16 0.24 24 12 0.18 25 40 0.59 ACGTcount: A:0.37, C:0.19, G:0.22, T:0.21 Consensus pattern (25 bp): AGAGCCATATATCGAGAAGTTCAAG Found at i:21175834 original size:11 final size:11 Alignment explanation

Indices: 21175818--21175851 Score: 52 Period size: 11 Copynumber: 3.2 Consensus size: 11 21175808 AGGCATAAAG 21175818 ATGTTCACACC 1 ATGTTCACACC * 21175829 ATGTTCA-GCC 1 ATGTTCACACC 21175839 ATGTTCACACC 1 ATGTTCACACC 21175850 AT 1 AT 21175852 ATGGCTCTTC Statistics Matches: 20, Mismatches: 2, Indels: 2 0.83 0.08 0.08 Matches are distributed among these distances: 10 9 0.45 11 11 0.55 ACGTcount: A:0.26, C:0.32, G:0.12, T:0.29 Consensus pattern (11 bp): ATGTTCACACC Found at i:21178613 original size:25 final size:24 Alignment explanation

Indices: 21178556--21178631 Score: 66 Period size: 24 Copynumber: 3.1 Consensus size: 24 21178546 CCTGGCTATA 21178556 TAACGGGAAGCT-CATAAGAGCC-T 1 TAACGGGAAGCTCCA-AAGAGCCAT * * * * 21178579 AAATCAGGAAGTTCACAAAGAGCCTT 1 TAA-CGGGAAGCTC-CAAAGAGCCAT * 21178605 TAACGGGAAGCTCCGAAGAGCCAT 1 TAACGGGAAGCTCCAAAGAGCCAT 21178629 TAA 1 TAA 21178632 TCAGAAGTTC Statistics Matches: 41, Mismatches: 8, Indels: 7 0.73 0.14 0.12 Matches are distributed among these distances: 23 2 0.05 24 19 0.46 25 15 0.37 26 5 0.12 ACGTcount: A:0.38, C:0.21, G:0.24, T:0.17 Consensus pattern (24 bp): TAACGGGAAGCTCCAAAGAGCCAT Found at i:21178661 original size:49 final size:50 Alignment explanation

Indices: 21178553--21178663 Score: 129 Period size: 49 Copynumber: 2.3 Consensus size: 50 21178543 TATCCTGGCT * 21178553 ATATAACGGGAAGCTCATAAGAGCCTAAATCAGGAAGTTCACAAAGAGCC 1 ATATAACGGGAAGCTCAGAAGAGCCTAAATCAGGAAGTTCACAAAGAGCC * * * * * 21178603 -TTTAACGGGAAGCTCCGAAGAGCCATTAATCA-GAAGTTC-CAGATAGCC 1 ATATAACGGGAAGCTCAGAAGAGCC-TAAATCAGGAAGTTCACAAAGAGCC * 21178651 ATATATCGGGAAG 1 ATATAACGGGAAG 21178664 TTCAAACGAG Statistics Matches: 51, Mismatches: 8, Indels: 5 0.80 0.12 0.08 Matches are distributed among these distances: 48 7 0.14 49 38 0.75 50 6 0.12 ACGTcount: A:0.38, C:0.20, G:0.23, T:0.19 Consensus pattern (50 bp): ATATAACGGGAAGCTCAGAAGAGCCTAAATCAGGAAGTTCACAAAGAGCC Found at i:21178665 original size:25 final size:26 Alignment explanation

Indices: 21178585--21178704 Score: 85 Period size: 25 Copynumber: 4.8 Consensus size: 26 21178575 GCCTAAATCA * * 21178585 GGAAGTTCACAAA-GAGCC-TTTAACG 1 GGAAGTTC-CAAACGAGCCATATATCG * * 21178610 GGAAGCTCCGAA-GAGCCAT-TAATC- 1 GGAAGTTCCAAACGAGCCATAT-ATCG * * * 21178634 AGAAGTTCCAGA-TAGCCATATATCG 1 GGAAGTTCCAAACGAGCCATATATCG * 21178659 GGAAGTT-CAAACGAGCCATATATTG 1 GGAAGTTCCAAACGAGCCATATATCG * 21178684 GGAAGTT-CAAGCGAGCCATAT 1 GGAAGTTCCAAACGAGCCATAT 21178705 CAAGAAGCTC Statistics Matches: 77, Mismatches: 13, Indels: 10 0.77 0.13 0.10 Matches are distributed among these distances: 24 29 0.38 25 48 0.62 ACGTcount: A:0.35, C:0.20, G:0.24, T:0.21 Consensus pattern (26 bp): GGAAGTTCCAAACGAGCCATATATCG Found at i:21190226 original size:22 final size:22 Alignment explanation

Indices: 21190198--21190240 Score: 68 Period size: 22 Copynumber: 2.0 Consensus size: 22 21190188 CATCTTTTCC ** 21190198 TTTTATTTTTCAAGATATTTAA 1 TTTTATTTCCCAAGATATTTAA 21190220 TTTTATTTCCCAAGATATTTA 1 TTTTATTTCCCAAGATATTTA 21190241 GACAAAAGGG Statistics Matches: 19, Mismatches: 2, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 22 19 1.00 ACGTcount: A:0.30, C:0.09, G:0.05, T:0.56 Consensus pattern (22 bp): TTTTATTTCCCAAGATATTTAA Found at i:21191912 original size:34 final size:34 Alignment explanation

Indices: 21191851--21191918 Score: 86 Period size: 34 Copynumber: 2.0 Consensus size: 34 21191841 TCACAGTTGA * 21191851 ACAGTCTTGGGCCTAAGCCATTTTCAATATCAGTG 1 ACAGTCTTGGGCCTAAGCCATTCTCAATA-CAGTG * 21191886 ACAGT-TTGGGCCTTAGCCCA-TCTCAATACAGTG 1 ACAGTCTTGGGCCTAAG-CCATTCTCAATACAGTG 21191919 TCAAAAATGC Statistics Matches: 30, Mismatches: 2, Indels: 4 0.83 0.06 0.11 Matches are distributed among these distances: 33 5 0.17 34 17 0.57 35 8 0.27 ACGTcount: A:0.25, C:0.25, G:0.21, T:0.29 Consensus pattern (34 bp): ACAGTCTTGGGCCTAAGCCATTCTCAATACAGTG Found at i:21195323 original size:21 final size:20 Alignment explanation

Indices: 21195297--21195360 Score: 57 Period size: 21 Copynumber: 3.3 Consensus size: 20 21195287 CATAATTCTA 21195297 TACTTAGCTAAGACAACACTT 1 TACTTAGCT-AGACAACACTT * 21195318 TACTTA-CT---CATAC-CTA 1 TACTTAGCTAGACA-ACACTT 21195334 TACTTAGCTATGACAACACTT 1 TACTTAGCTA-GACAACACTT 21195355 TACTTA 1 TACTTA 21195361 CTCATATCTA Statistics Matches: 34, Mismatches: 2, Indels: 14 0.68 0.04 0.28 Matches are distributed among these distances: 16 10 0.29 17 4 0.12 20 4 0.12 21 16 0.47 ACGTcount: A:0.34, C:0.25, G:0.06, T:0.34 Consensus pattern (20 bp): TACTTAGCTAGACAACACTT Found at i:21195349 original size:37 final size:37 Alignment explanation

Indices: 21195294--21195375 Score: 146 Period size: 37 Copynumber: 2.2 Consensus size: 37 21195284 ACACATAATT 21195294 CTATACTTAGCTAAGACAACACTTTACTTACTCATAC 1 CTATACTTAGCTAAGACAACACTTTACTTACTCATAC * * 21195331 CTATACTTAGCTATGACAACACTTTACTTACTCATAT 1 CTATACTTAGCTAAGACAACACTTTACTTACTCATAC 21195368 CTATACTT 1 CTATACTT 21195376 TGCCAAATGG Statistics Matches: 43, Mismatches: 2, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 37 43 1.00 ACGTcount: A:0.33, C:0.26, G:0.05, T:0.37 Consensus pattern (37 bp): CTATACTTAGCTAAGACAACACTTTACTTACTCATAC Found at i:21202907 original size:31 final size:31 Alignment explanation

Indices: 21202869--21202940 Score: 90 Period size: 33 Copynumber: 2.3 Consensus size: 31 21202859 AGGCCTTTTG 21202869 GCGGCGCTAAAAAGCGCAGCAAAAAGAATTT 1 GCGGCGCTAAAAAGCGCAGCAAAAAGAATTT ** * * 21202900 GCGGCGCTTTTGAAAGCGCTGCAAAAAGTATTT 1 GCGGCGC--TAAAAAGCGCAGCAAAAAGAATTT 21202933 GCGGCGCT 1 GCGGCGCT 21202941 TTTGAAAGCG Statistics Matches: 35, Mismatches: 4, Indels: 4 0.81 0.09 0.09 Matches are distributed among these distances: 31 8 0.23 33 27 0.77 ACGTcount: A:0.31, C:0.21, G:0.29, T:0.19 Consensus pattern (31 bp): GCGGCGCTAAAAAGCGCAGCAAAAAGAATTT Found at i:21202924 original size:33 final size:33 Alignment explanation

Indices: 21202879--21202959 Score: 144 Period size: 33 Copynumber: 2.5 Consensus size: 33 21202869 GCGGCGCTAA * 21202879 AAAGCGCAGCAAAAAGAATTTGCGGCGCTTTTG 1 AAAGCGCTGCAAAAAGAATTTGCGGCGCTTTTG * 21202912 AAAGCGCTGCAAAAAGTATTTGCGGCGCTTTTG 1 AAAGCGCTGCAAAAAGAATTTGCGGCGCTTTTG 21202945 AAAGCGCTGCAAAAA 1 AAAGCGCTGCAAAAA 21202960 ATGCCGCTGA Statistics Matches: 46, Mismatches: 2, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 33 46 1.00 ACGTcount: A:0.35, C:0.19, G:0.26, T:0.21 Consensus pattern (33 bp): AAAGCGCTGCAAAAAGAATTTGCGGCGCTTTTG Found at i:21203070 original size:43 final size:43 Alignment explanation

Indices: 21203007--21203143 Score: 229 Period size: 43 Copynumber: 3.2 Consensus size: 43 21202997 TTATAGAAAT * * 21203007 AAACGCCGCTAAAGGTCATGTTCTTTAGCGGCACTTTTCCCGC 1 AAACGCCGCTAAAGGCCATGTTCTTTAGCGGCGCTTTTCCCGC * 21203050 AAACGCCGCTAAAGGCCATGTTCTTTAGCGGCGCTTTTTCCGC 1 AAACGCCGCTAAAGGCCATGTTCTTTAGCGGCGCTTTTCCCGC * 21203093 AAACGCCGCTAAAGGCCATGTTCTTTAGCGGCGCTTTTCCCAC 1 AAACGCCGCTAAAGGCCATGTTCTTTAGCGGCGCTTTTCCCGC * 21203136 AAAAGCCG 1 AAACGCCG 21203144 TTATTGTTTG Statistics Matches: 88, Mismatches: 6, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 43 88 1.00 ACGTcount: A:0.22, C:0.31, G:0.22, T:0.26 Consensus pattern (43 bp): AAACGCCGCTAAAGGCCATGTTCTTTAGCGGCGCTTTTCCCGC Found at i:21203701 original size:25 final size:26 Alignment explanation

Indices: 21203650--21203711 Score: 90 Period size: 25 Copynumber: 2.4 Consensus size: 26 21203640 TTTTACAATG * * 21203650 ATTAAGTTATGTAAATAATATGAGTT 1 ATTAAGTTATGCAAATAATATAAGTT * 21203676 ATTAAGTTATGCATAT-ATATAAGTT 1 ATTAAGTTATGCAAATAATATAAGTT 21203701 ATTAAGTTATG 1 ATTAAGTTATG 21203712 TAAGTTATTA Statistics Matches: 33, Mismatches: 3, Indels: 1 0.89 0.08 0.03 Matches are distributed among these distances: 25 19 0.58 26 14 0.42 ACGTcount: A:0.40, C:0.02, G:0.15, T:0.44 Consensus pattern (26 bp): ATTAAGTTATGCAAATAATATAAGTT Found at i:21203715 original size:17 final size:17 Alignment explanation

Indices: 21203650--21203721 Score: 58 Period size: 17 Copynumber: 4.2 Consensus size: 17 21203640 TTTTACAATG * * 21203650 ATTAAGTTATGTAAATA 1 ATTAAGTTATGTAAGTT * 21203667 ATATGAGTTAT-TAAGTT 1 AT-TAAGTTATGTAAGTT ** * 21203684 ATGCA-TATATATAAGTT 1 ATTAAGT-TATGTAAGTT 21203701 ATTAAGTTATGTAAGTT 1 ATTAAGTTATGTAAGTT 21203718 ATTA 1 ATTA 21203722 TGTGAGTTAT Statistics Matches: 43, Mismatches: 8, Indels: 8 0.73 0.14 0.14 Matches are distributed among these distances: 15 1 0.02 16 4 0.09 17 30 0.70 18 8 0.19 ACGTcount: A:0.40, C:0.01, G:0.14, T:0.44 Consensus pattern (17 bp): ATTAAGTTATGTAAGTT Found at i:21203836 original size:17 final size:17 Alignment explanation

Indices: 21203814--21203873 Score: 68 Period size: 17 Copynumber: 3.5 Consensus size: 17 21203804 ATATATTTTA 21203814 AAATTATTAAGTTATGT 1 AAATTATTAAGTTATGT ** * 21203831 AAATTATGCGAGTTGT-T 1 AAATTAT-TAAGTTATGT * 21203848 AAGTTATTAAGTTATGT 1 AAATTATTAAGTTATGT 21203865 AAATTATTA 1 AAATTATTA 21203874 TGTGAATTAT Statistics Matches: 33, Mismatches: 8, Indels: 4 0.73 0.18 0.09 Matches are distributed among these distances: 16 5 0.15 17 23 0.70 18 5 0.15 ACGTcount: A:0.38, C:0.02, G:0.15, T:0.45 Consensus pattern (17 bp): AAATTATTAAGTTATGT Found at i:21203896 original size:46 final size:46 Alignment explanation

Indices: 21203841--21203997 Score: 145 Period size: 46 Copynumber: 3.3 Consensus size: 46 21203831 AAATTATGCG * 21203841 AGTTGTTAAGTTATTAAGTTATGTAAATTATTATGTGAATTATATA 1 AGTTATTAAGTTATTAAGTTATGTAAATTATTATGTGAATTATATA * * * * * 21203887 AGTTATTAAGTTA-TATGGTATGACATATATATATGTATTATGTAAATTATGTG 1 AGTTATTAAGTTATTA-AGT-T---ATGTA-A-AT-TATTATGTGAATTATATA * * * * 21203940 AGTTGTTAAGTTGTTCAGTTATGTAAGTTATTATGTGAATTATATA 1 AGTTATTAAGTTATTAAGTTATGTAAATTATTATGTGAATTATATA 21203986 AGTTATTAAGTT 1 AGTTATTAAGTT 21203998 GGACATTTTT Statistics Matches: 86, Mismatches: 16, Indels: 18 0.72 0.13 0.15 Matches are distributed among these distances: 45 2 0.02 46 40 0.47 47 2 0.02 48 1 0.01 49 4 0.05 50 4 0.05 51 1 0.01 52 3 0.03 53 28 0.33 54 1 0.01 ACGTcount: A:0.34, C:0.01, G:0.17, T:0.47 Consensus pattern (46 bp): AGTTATTAAGTTATTAAGTTATGTAAATTATTATGTGAATTATATA Found at i:21206408 original size:31 final size:31 Alignment explanation

Indices: 21206365--21206424 Score: 111 Period size: 31 Copynumber: 1.9 Consensus size: 31 21206355 ACAGCCGCCC * 21206365 CAATTTATTGGCTAAGCCTAACAATGCCTAA 1 CAATGTATTGGCTAAGCCTAACAATGCCTAA 21206396 CAATGTATTGGCTAAGCCTAACAATGCCT 1 CAATGTATTGGCTAAGCCTAACAATGCCT 21206425 CCGCATCGGC Statistics Matches: 28, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 31 28 1.00 ACGTcount: A:0.33, C:0.23, G:0.15, T:0.28 Consensus pattern (31 bp): CAATGTATTGGCTAAGCCTAACAATGCCTAA Found at i:21207999 original size:15 final size:15 Alignment explanation

Indices: 21207977--21208023 Score: 60 Period size: 15 Copynumber: 3.2 Consensus size: 15 21207967 TTTTAATCCC 21207977 TAAACCCCTAACCCT 1 TAAACCCCTAACCCT * * 21207992 TTAACCCCTAAACCT 1 TAAACCCCTAACCCT * 21208007 TAAATCCC-AACCCT 1 TAAACCCCTAACCCT 21208021 TAA 1 TAA 21208024 TCCATAATCC Statistics Matches: 27, Mismatches: 5, Indels: 1 0.82 0.15 0.03 Matches are distributed among these distances: 14 8 0.30 15 19 0.70 ACGTcount: A:0.36, C:0.40, G:0.00, T:0.23 Consensus pattern (15 bp): TAAACCCCTAACCCT Found at i:21208000 original size:23 final size:22 Alignment explanation

Indices: 21207960--21208005 Score: 65 Period size: 23 Copynumber: 2.0 Consensus size: 22 21207950 NNNNNNNNAT * * 21207960 CCCTAACTTTTAATCCCTAAAC 1 CCCTAACCTTTAACCCCTAAAC 21207982 CCCTAACCCTTTAACCCCTAAAC 1 CCCTAA-CCTTTAACCCCTAAAC 21208005 C 1 C 21208006 TTAAATCCCA Statistics Matches: 21, Mismatches: 2, Indels: 1 0.88 0.08 0.04 Matches are distributed among these distances: 22 6 0.29 23 15 0.71 ACGTcount: A:0.30, C:0.43, G:0.00, T:0.26 Consensus pattern (22 bp): CCCTAACCTTTAACCCCTAAAC Found at i:21208043 original size:15 final size:15 Alignment explanation

Indices: 21207970--21208055 Score: 54 Period size: 15 Copynumber: 5.9 Consensus size: 15 21207960 CCCTAACTTT * 21207970 TAATCCCTAAACCCC 1 TAATCCCTAAACCCA * * 21207985 TAA-CCCTTTAACCCC 1 TAATCCC-TAAACCCA * * 21208000 TAAACCTTAAATCCC- 1 TAATCCCTAAA-CCCA * * 21208015 -AA-CCCTTAATCCA 1 TAATCCCTAAACCCA 21208028 TAATCCCTAAACCCA 1 TAATCCCTAAACCCA * 21208043 TACTCCCTAAACC 1 TAATCCCTAAACC 21208056 TTAAAATATG Statistics Matches: 56, Mismatches: 9, Indels: 12 0.73 0.12 0.16 Matches are distributed among these distances: 12 2 0.04 13 5 0.09 14 7 0.12 15 37 0.66 16 5 0.09 ACGTcount: A:0.35, C:0.42, G:0.00, T:0.23 Consensus pattern (15 bp): TAATCCCTAAACCCA Found at i:21208630 original size:73 final size:72 Alignment explanation

Indices: 21208549--21208688 Score: 217 Period size: 73 Copynumber: 1.9 Consensus size: 72 21208539 GCTTTCTTAT * * 21208549 AAACGCTGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCATTGGGCTTAGGTTTTTTTGCGGCGC 1 AAACGCTGCTAAATCCCCGAAAGCTCAGAAAACGACGTCATTGGGATTA-GTTTTTTTGCGGCGC 21208614 TTTACGAA 65 TTTACGAA ** * * 21208622 AAACGCTGCTAAATCCCCGAAAGCTTGGGAAACGACGTCGTTGGGATTAGTTTTTTTGCGGCGCT 1 AAACGCTGCTAAATCCCCGAAAGCTCAGAAAACGACGTCATTGGGATTAGTTTTTTTGCGGCGCT 21208687 TT 66 TT 21208689 CTCAAAAACG Statistics Matches: 61, Mismatches: 6, Indels: 1 0.90 0.09 0.01 Matches are distributed among these distances: 72 18 0.30 73 43 0.70 ACGTcount: A:0.24, C:0.22, G:0.26, T:0.28 Consensus pattern (72 bp): AAACGCTGCTAAATCCCCGAAAGCTCAGAAAACGACGTCATTGGGATTAGTTTTTTTGCGGCGCT TTACGAA Found at i:21208635 original size:114 final size:113 Alignment explanation

Indices: 21208303--21208638 Score: 564 Period size: 114 Copynumber: 2.9 Consensus size: 113 21208293 TAAAATATTT ** 21208303 TTAGCGGCGCTTTTCCAAAAACGCCGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCGTTGGGCT 1 TTAGCGGCGC-TTTCTTAAAACGCCGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCGTTGGGCT * 21208368 TAGGTTTTTTTGCGGCGCTTTCCCAAAAACGCCGCTAAAGCCCTGAGCA 65 TAGGTTTTTTTGCGGCGCTTTCCCAAAAACGCCGCTAAATCCCTGAGCA * 21208417 TGAGCGGCGCTTTCTTCAAAACGCCGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCGTTGGGCT 1 TTAGCGGCGCTTTCTT-AAAACGCCGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCGTTGGGCT 21208482 TAGGTTTTTTTGCGGCGCTTTCCCAAAAACGCCGCTAAATCCCTGAGCA 65 TAGGTTTTTTTGCGGCGCTTTCCCAAAAACGCCGCTAAATCCCTGAGCA * * 21208531 TTAGCGGCGCTTTCTTATAAACGCTGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCATTGGGCT 1 TTAGCGGCGCTTTCTTA-AAACGCCGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCGTTGGGCT * * * 21208596 TAGGTTTTTTTGCGGCGCTTTACGAAAAACGCTGCTAAATCCC 65 TAGGTTTTTTTGCGGCGCTTTCCCAAAAACGCCGCTAAATCCC 21208639 CGAAAGCTTG Statistics Matches: 210, Mismatches: 10, Indels: 4 0.94 0.04 0.02 Matches are distributed among these distances: 113 5 0.02 114 205 0.98 ACGTcount: A:0.24, C:0.28, G:0.23, T:0.25 Consensus pattern (113 bp): TTAGCGGCGCTTTCTTAAAACGCCGCTAAATCCCCGAAAGCTCAGAAAACGGCGTCGTTGGGCTT AGGTTTTTTTGCGGCGCTTTCCCAAAAACGCCGCTAAATCCCTGAGCA Found at i:21210675 original size:65 final size:65 Alignment explanation

Indices: 21210593--21210714 Score: 217 Period size: 65 Copynumber: 1.9 Consensus size: 65 21210583 ACACGCTCAG * * * 21210593 GTGCTTAAACCGTGTACAAATCGAAAATAGGGTCACATGGTCGTGTCCCTAGGCCGTTTAATCGT 1 GTGCTGAAACCGTGTACAAATCGAAAATAGGGTCACATAGCCGTGTCCCTAGGCCGTTTAATCGT 21210658 GTGCTGAAACCGTGTACAAATCGAAAATAGGGTCACATAGCCGTGTCCCTAGGCCGT 1 GTGCTGAAACCGTGTACAAATCGAAAATAGGGTCACATAGCCGTGTCCCTAGGCCGT 21210715 GTAACTAACT Statistics Matches: 54, Mismatches: 3, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 65 54 1.00 ACGTcount: A:0.27, C:0.23, G:0.25, T:0.25 Consensus pattern (65 bp): GTGCTGAAACCGTGTACAAATCGAAAATAGGGTCACATAGCCGTGTCCCTAGGCCGTTTAATCGT Found at i:21211762 original size:13 final size:13 Alignment explanation

Indices: 21211739--21211808 Score: 77 Period size: 13 Copynumber: 5.3 Consensus size: 13 21211729 TCATACATCT 21211739 ATTTCCATATAACC 1 ATTT-CATATAACC * 21211753 AATTCATATAACC 1 ATTTCATATAACC ** * 21211766 ATTTCATGCAATC 1 ATTTCATATAACC 21211779 ATTTCATATAACC 1 ATTTCATATAACC ** 21211792 ATTTTGTATAACC 1 ATTTCATATAACC 21211805 ATTT 1 ATTT 21211809 GATTTAAATA Statistics Matches: 46, Mismatches: 10, Indels: 1 0.81 0.18 0.02 Matches are distributed among these distances: 13 43 0.93 14 3 0.07 ACGTcount: A:0.36, C:0.21, G:0.03, T:0.40 Consensus pattern (13 bp): ATTTCATATAACC Found at i:21211822 original size:39 final size:38 Alignment explanation

Indices: 21211738--21211834 Score: 95 Period size: 39 Copynumber: 2.5 Consensus size: 38 21211728 TTCATACATC * 21211738 TATTTCCATATAACCAATTCATATAACCATTTCATGCAA 1 TATTT-CATATAACCAATTCATATAACCATTTCATGAAA * ** * * 21211777 TCATTTCATATAACCATTTTGTATAACCATTTGATTTAAA 1 T-ATTTCATATAACCAATTCATATAACCATTTCA-TGAAA * * 21211817 TATTTCATTTAAACAATT 1 TATTTCATATAACCAATT 21211835 AACACATAAT Statistics Matches: 47, Mismatches: 9, Indels: 4 0.78 0.15 0.07 Matches are distributed among these distances: 39 39 0.83 40 8 0.17 ACGTcount: A:0.37, C:0.18, G:0.03, T:0.42 Consensus pattern (38 bp): TATTTCATATAACCAATTCATATAACCATTTCATGAAA Found at i:21212646 original size:18 final size:18 Alignment explanation

Indices: 21212623--21212661 Score: 53 Period size: 18 Copynumber: 2.2 Consensus size: 18 21212613 GCTTCCTTAA * 21212623 ATTTTT-AATGTTTTATTC 1 ATTTTTAAAT-TTTAATTC 21212641 ATTTTTAAATTTTAATTC 1 ATTTTTAAATTTTAATTC 21212659 ATT 1 ATT 21212662 AATAATAACA Statistics Matches: 19, Mismatches: 1, Indels: 2 0.86 0.05 0.09 Matches are distributed among these distances: 18 16 0.84 19 3 0.16 ACGTcount: A:0.28, C:0.05, G:0.03, T:0.64 Consensus pattern (18 bp): ATTTTTAAATTTTAATTC Found at i:21213001 original size:14 final size:14 Alignment explanation

Indices: 21212982--21213019 Score: 67 Period size: 14 Copynumber: 2.7 Consensus size: 14 21212972 TTTAGGGACA 21212982 AACCATTTGTACCT 1 AACCATTTGTACCT 21212996 AACCATTTGTACCT 1 AACCATTTGTACCT * 21213010 AACTATTTGT 1 AACCATTTGT 21213020 TCACTTAGAA Statistics Matches: 23, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 14 23 1.00 ACGTcount: A:0.29, C:0.24, G:0.08, T:0.39 Consensus pattern (14 bp): AACCATTTGTACCT Found at i:21214128 original size:28 final size:28 Alignment explanation

Indices: 21214054--21214130 Score: 100 Period size: 28 Copynumber: 2.8 Consensus size: 28 21214044 TAGTACAGTA * * * * * 21214054 TGGGCCTTAGACCAAAACAATAACAATG 1 TGGGCCTTAGCCCAATACAGTAATAGTG 21214082 TGGGCCTTAGCCCAATACAGTAATAGTG 1 TGGGCCTTAGCCCAATACAGTAATAGTG * 21214110 TGGGCCTTAGCCCAGTACAGT 1 TGGGCCTTAGCCCAATACAGT 21214131 CCAATAATGC Statistics Matches: 43, Mismatches: 6, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 28 43 1.00 ACGTcount: A:0.31, C:0.23, G:0.23, T:0.22 Consensus pattern (28 bp): TGGGCCTTAGCCCAATACAGTAATAGTG Found at i:21214442 original size:27 final size:27 Alignment explanation

Indices: 21214412--21214625 Score: 218 Period size: 27 Copynumber: 7.9 Consensus size: 27 21214402 TCATCTCCTA ** * 21214412 GGGGTATAACAGTCATTTTACCCTATG 1 GGGGTATTTCAGTCATTTTACCCTCTG * * * 21214439 GGGGTATTTTAGTCATTTTACCTTTTG 1 GGGGTATTTCAGTCATTTTACCCTCTG ** * * 21214466 GGGGTATTTTGGTCATTTTACCCTTTT 1 GGGGTATTTCAGTCATTTTACCCTCTG * * * * 21214493 AGGGTATTTCGGTCATTTTACCATCCG 1 GGGGTATTTCAGTCATTTTACCCTCTG * 21214520 GGGGTATTTCAGTTATTTTACCCTAC-G 1 GGGGTATTTCAGTCATTTTACCCT-CTG * * 21214547 GGGGTATTTCAGTCATTTGACCCTCTA 1 GGGGTATTTCAGTCATTTTACCCTCTG * 21214574 GGGGTATTTC-GATCATTTTACCCTCTA 1 GGGGTATTTCAG-TCATTTTACCCTCTG 21214601 GGGGTATTTC-GATCATTTTACCCTC 1 GGGGTATTTCAG-TCATTTTACCCTC 21214626 CAGGGTATTT Statistics Matches: 162, Mismatches: 22, Indels: 6 0.85 0.12 0.03 Matches are distributed among these distances: 26 2 0.01 27 159 0.98 28 1 0.01 ACGTcount: A:0.18, C:0.19, G:0.21, T:0.42 Consensus pattern (27 bp): GGGGTATTTCAGTCATTTTACCCTCTG Found at i:21215351 original size:5 final size:5 Alignment explanation

Indices: 21215334--21215365 Score: 50 Period size: 5 Copynumber: 6.8 Consensus size: 5 21215324 GAATGAAGAG 21215334 GAGAA GA-AA -AGAA GAGAA GAGAA GAGAA GAGA 1 GAGAA GAGAA GAGAA GAGAA GAGAA GAGAA GAGA 21215366 GAGGGTTCAA Statistics Matches: 25, Mismatches: 0, Indels: 4 0.86 0.00 0.14 Matches are distributed among these distances: 3 1 0.04 4 4 0.16 5 20 0.80 ACGTcount: A:0.62, C:0.00, G:0.38, T:0.00 Consensus pattern (5 bp): GAGAA Found at i:21217372 original size:43 final size:42 Alignment explanation

Indices: 21217326--21217622 Score: 333 Period size: 43 Copynumber: 6.9 Consensus size: 42 21217316 AGCTAAAAGT * * * 21217326 CATGACCTTTAGCGGTGCTTCTCCCACAAACGCCGCTATAGAT 1 CATGACCTTTAGCGGCGCTT-TCCCACAAACGCCGCTAAAGAA * * * * * 21217369 CATGACTTTTAGCGACGCTTTTCCCACAAACGCTGCTATAGAT 1 CATGACCTTTAGCGGCGC-TTTCCCACAAACGCCGCTAAAGAA * * * 21217412 CATGACTTTTAGCGGTGCTTTTCCCACAAACGCCGCTATAGAA 1 CATGACCTTTAGCGGCGC-TTTCCCACAAACGCCGCTAAAGAA * * * 21217455 CATGAGCTTTAGCGGCACTTTACCACAAACGCCGCTAAAGAA 1 CATGACCTTTAGCGGCGCTTTCCCACAAACGCCGCTAAAGAA * * 21217497 CATGACCTTTAGCGTCGCTTTACCAACAAACGCCGCTAAAGAA 1 CATGACCTTTAGCGGCGCTTT-CCCACAAACGCCGCTAAAGAA * * * 21217540 CATGATCTTTAGCGGCACTTTACCCACAAATGCCGCTAAAGAA 1 CATGACCTTTAGCGGCGCTTT-CCCACAAACGCCGCTAAAGAA * * ** 21217583 CATGATCTTTAGCGGCACTTTTATCACAAACGCCGCTAAA 1 CATGACCTTTAGCGGCGC-TTTCCCACAAACGCCGCTAAA 21217623 AGTATGGTTC Statistics Matches: 224, Mismatches: 27, Indels: 6 0.87 0.11 0.02 Matches are distributed among these distances: 42 40 0.18 43 179 0.80 44 5 0.02 ACGTcount: A:0.29, C:0.29, G:0.17, T:0.25 Consensus pattern (42 bp): CATGACCTTTAGCGGCGCTTTCCCACAAACGCCGCTAAAGAA Found at i:21217501 original size:128 final size:128 Alignment explanation

Indices: 21217326--21217622 Score: 380 Period size: 128 Copynumber: 2.3 Consensus size: 128 21217316 AGCTAAAAGT ** * * * * * 21217326 CATGACCTTTAGCGGTGCTTCTCCCACAAACGCCGCTATAGATCATGACTTTTAGCGACGCTTTT 1 CATGACCTTTAGCGGCACTT-TACCACAAACGCCGCTAAAGAACATGACCTTTAGCGACGCTTTA * * * * ** * * 21217391 CCCACAAACGCTGCTATAGATCATGA-CTTTTAGCGGTGCTTTTCCCACAAACGCCGCTATAGAA 65 CCAACAAACGCCGCTAAAGAACATGATC-TTTAGCGGCACTTTACCCACAAACGCCGCTAAAGAA * * 21217455 CATGAGCTTTAGCGGCACTTTACCACAAACGCCGCTAAAGAACATGACCTTTAGCGTCGCTTTAC 1 CATGACCTTTAGCGGCACTTTACCACAAACGCCGCTAAAGAACATGACCTTTAGCGACGCTTTAC * 21217520 CAACAAACGCCGCTAAAGAACATGATCTTTAGCGGCACTTTACCCACAAATGCCGCTAAAGAA 66 CAACAAACGCCGCTAAAGAACATGATCTTTAGCGGCACTTTACCCACAAACGCCGCTAAAGAA * * 21217583 CATGATCTTTAGCGGCACTTTTATCACAAACGCCGCTAAA 1 CATGACCTTTAGCGGCAC-TTTACCACAAACGCCGCTAAA 21217623 AGTATGGTTC Statistics Matches: 146, Mismatches: 20, Indels: 4 0.86 0.12 0.02 Matches are distributed among these distances: 128 108 0.74 129 38 0.26 ACGTcount: A:0.29, C:0.29, G:0.17, T:0.25 Consensus pattern (128 bp): CATGACCTTTAGCGGCACTTTACCACAAACGCCGCTAAAGAACATGACCTTTAGCGACGCTTTAC CAACAAACGCCGCTAAAGAACATGATCTTTAGCGGCACTTTACCCACAAACGCCGCTAAAGAA Found at i:21219553 original size:86 final size:84 Alignment explanation

Indices: 21219378--21219643 Score: 291 Period size: 86 Copynumber: 3.1 Consensus size: 84 21219368 TTAACACAAA * * * * * * * 21219378 TCAATCTTACAACAGTTTATGTGAAACAAATTAGCAAAAATCAACT-ATTCATAGTTCATATATT 1 TCAATTTTATAACAATTTATATGCAACAAATTAGCAAAAATCAA-TGGTTCA-AGTTCATGTATT ** * * 21219442 TACTGACAATAGGGCTAAATCT 64 TACCAACAATA-GACTAAATTT * * 21219464 TCAATTTCTATAACGATTTATATGCAACAAATTAACAAAAATCAATGGTTCAAGTTCATGTATTT 1 TCAATTT-TATAACAATTTATATGCAACAAATTAGCAAAAATCAATGGTTCAAGTTCATGTATTT * * 21219529 ACCAAAAATTAGACTAATTTT 65 ACCAACAA-TAGACTAAATTT * * * 21219550 TCAATATTTATAACAACTTATATGCAACAAATTAGCAAAAATAAATGGTTCAAGTTCATGCATTT 1 TCAAT-TTTATAACAATTTATATGCAACAAATTAGCAAAAATCAATGGTTCAAGTTCATGTATTT * 21219615 ACCAACAACCAGACTAAATTT 65 ACCAACAA-TAGACTAAATTT 21219636 TCAATTTT 1 TCAATTTT 21219644 CAATTTTTAA Statistics Matches: 152, Mismatches: 24, Indels: 9 0.82 0.13 0.05 Matches are distributed among these distances: 85 3 0.02 86 110 0.72 87 39 0.26 ACGTcount: A:0.42, C:0.16, G:0.09, T:0.34 Consensus pattern (84 bp): TCAATTTTATAACAATTTATATGCAACAAATTAGCAAAAATCAATGGTTCAAGTTCATGTATTTA CCAACAATAGACTAAATTT Found at i:21219750 original size:30 final size:30 Alignment explanation

Indices: 21219714--21219784 Score: 124 Period size: 30 Copynumber: 2.4 Consensus size: 30 21219704 ATTTAAGGAG 21219714 ATATAAAAAATCATCATTTCAACAATTTAC 1 ATATAAAAAATCATCATTTCAACAATTTAC 21219744 ATATAAAAAATCATCATTTCAACAATTTAC 1 ATATAAAAAATCATCATTTCAACAATTTAC ** 21219774 ATGCAAAAAAT 1 ATATAAAAAAT 21219785 TAGCAAAAAT Statistics Matches: 39, Mismatches: 2, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 30 39 1.00 ACGTcount: A:0.52, C:0.15, G:0.01, T:0.31 Consensus pattern (30 bp): ATATAAAAAATCATCATTTCAACAATTTAC Found at i:21220149 original size:168 final size:168 Alignment explanation

Indices: 21219871--21220192 Score: 459 Period size: 168 Copynumber: 1.9 Consensus size: 168 21219861 TTTTAAATGC * ** * * * 21219871 AACAAATTAGCACAAATTAACAATTCAAGTTCATATATTCACCAAAAACCAGACTAAATTTTCAA 1 AACAAATTAGCAAAAACCAACAATTCAAGTGCATATATGCACCAAAAACCAGACTAAATTTCCAA * ** * * 21219936 TTTTTATAAAATAAGAGGATGGAAGTCTAACAAATTAAAGTTTCCAATGAAATTCAGACTTACAC 66 TTTTTACAAAATAAGAGGATCAAAGTCTAACAAATTAAAGTTTCCAATGAAATTCAAAATTACAC 21220001 CATTAAGTAAATAAGAGTTTTAAACTATTGAAGATCAA 131 CATTAAGTAAATAAGAGTTTTAAACTATTGAAGATCAA ** * 21220039 AACAAATTAGCAAAAACCAACTGTTCAAGTGCATGTATGCACCAAAAA-CAGGA-TCAAATTTCC 1 AACAAATTAGCAAAAACCAACAATTCAAGTGCATATATGCACCAAAAACCA-GACT-AAATTTCC * * * 21220102 AATTTTTGCAAATTAAGAGGATCAAATTCTAACAAATTAAAGTTTCCAATGAAATTCAAAATTAC 64 AATTTTTACAAAATAAGAGGATCAAAGTCTAACAAATTAAAGTTTCCAATGAAATTCAAAATTAC 21220167 ACCATTAAGTAAATAAGAGTTTTAAA 129 ACCATTAAGTAAATAAGAGTTTTAAA 21220193 TTACCTGAGA Statistics Matches: 135, Mismatches: 17, Indels: 4 0.87 0.11 0.03 Matches are distributed among these distances: 167 3 0.02 168 132 0.98 ACGTcount: A:0.46, C:0.15, G:0.11, T:0.28 Consensus pattern (168 bp): AACAAATTAGCAAAAACCAACAATTCAAGTGCATATATGCACCAAAAACCAGACTAAATTTCCAA TTTTTACAAAATAAGAGGATCAAAGTCTAACAAATTAAAGTTTCCAATGAAATTCAAAATTACAC CATTAAGTAAATAAGAGTTTTAAACTATTGAAGATCAA Found at i:21223611 original size:31 final size:31 Alignment explanation

Indices: 21223480--21223610 Score: 122 Period size: 31 Copynumber: 4.3 Consensus size: 31 21223470 GTATATATTA 21223480 TATACATTGCTGAAATTTATTTGTATATGGT 1 TATACATTGCTGAAATTTATTTGTATATGGT * * ** * * * 21223511 TATCCATTGTTGAAATAGAATTGTATGTGGA 1 TATACATTGCTGAAATTTATTTGTATATGGT * * * ** 21223542 TATA-AGTTGCCGAAA-TAATTCGTATATAAT 1 TATACA-TTGCTGAAATTTATTTGTATATGGT * 21223572 TATACATTACTGAAATTTATTTGTATATGGT 1 TATACATTGCTGAAATTTATTTGTATATGGT 21223603 TATACATT 1 TATACATT 21223611 ATCGAAGTTG Statistics Matches: 73, Mismatches: 24, Indels: 6 0.71 0.23 0.06 Matches are distributed among these distances: 30 19 0.26 31 54 0.74 ACGTcount: A:0.34, C:0.08, G:0.15, T:0.44 Consensus pattern (31 bp): TATACATTGCTGAAATTTATTTGTATATGGT Found at i:21223643 original size:92 final size:92 Alignment explanation

Indices: 21223470--21223671 Score: 242 Period size: 92 Copynumber: 2.2 Consensus size: 92 21223460 TAATTTAAAT * * * * * * 21223470 GTATATATTATATACATTGCTGAAATTTATTTGTATATGGTTATCCATTGTTGAAATAGAATTGT 1 GTATATAAT-TATACATTACTGAAATTTATTTGTATATGGTTATACATTATCGAAATAGAATTAT ** * * 21223535 ATGTGGATATAAGTTGCCGAAATAATTC 65 ATAAGAAAATAAGTTGCCGAAATAATTC * * * 21223563 GTATATAATTATACATTACTGAAATTTATTTGTATATGGTTATACATTATCGAAGTTGATTTATA 1 GTATATAATTATACATTACTGAAATTTATTTGTATATGGTTATACATTATCGAAATAGAATTATA * * * 21223628 TAAGAAAATAGGTTGCCGAATTGATTC 66 TAAGAAAATAAGTTGCCGAAATAATTC * 21223655 GTATATGATTATACATT 1 GTATATAATTATACATT 21223672 GTCGAATTTG Statistics Matches: 92, Mismatches: 17, Indels: 1 0.84 0.15 0.01 Matches are distributed among these distances: 92 84 0.91 93 8 0.09 ACGTcount: A:0.35, C:0.07, G:0.16, T:0.42 Consensus pattern (92 bp): GTATATAATTATACATTACTGAAATTTATTTGTATATGGTTATACATTATCGAAATAGAATTATA TAAGAAAATAAGTTGCCGAAATAATTC Found at i:21223684 original size:31 final size:31 Alignment explanation

Indices: 21223559--21223771 Score: 101 Period size: 31 Copynumber: 6.9 Consensus size: 31 21223549 TGCCGAAATA * * 21223559 ATTCGTATATAATTATACATT-ACTGAAATTT- 1 ATTCGTATATGATTATACATTGTC-G-AATTTG * * * * 21223590 ATTTGTATATGGTTATACATTATCGAAGTTG 1 ATTCGTATATGATTATACATTGTCGAATTTG ** * ** ** * 21223621 ATTTATATAAGAAAATAGGTTGCCGAA-TTG 1 ATTCGTATATGATTATACATTGTCGAATTTG 21223651 ATTCGTATATGATTATACATTGTCGAATTTG 1 ATTCGTATATGATTATACATTGTCGAATTTG *** ** * * * 21223682 ATTCACCTCCGGTTATACATTG-CCAAATTG 1 ATTCGTATATGATTATACATTGTCGAATTTG * * ** * * 21223712 ATTTGTGTATGATTATATGTTGCCAAATTTG 1 ATTCGTATATGATTATACATTGTCGAATTTG * * * 21223743 ATTCGTATATGAATATAAATTGTTGAATT 1 ATTCGTATATGATTATACATTGTCGAATT 21223772 GACTTGAGCA Statistics Matches: 129, Mismatches: 49, Indels: 8 0.69 0.26 0.04 Matches are distributed among these distances: 30 45 0.35 31 83 0.64 32 1 0.01 ACGTcount: A:0.32, C:0.10, G:0.15, T:0.42 Consensus pattern (31 bp): ATTCGTATATGATTATACATTGTCGAATTTG Found at i:21223696 original size:92 final size:92 Alignment explanation

Indices: 21223600--21223771 Score: 211 Period size: 92 Copynumber: 1.9 Consensus size: 92 21223590 ATTTGTATAT * * * * * 21223600 GGTTATACATTATCGAAGTTGATTTATATAAGAAAATAGGTTGCCGAA-TTGATTCGTATATGAT 1 GGTTATACATT-GCCAAATTGATTTATATAAGAAAATAGGTTGCCAAATTTGATTCGTATATGAA * 21223664 TATACATTGTCGAATTTGATTCACCTCC 65 TATAAATTGTCGAATTTGATTCACCTCC * * * ** * 21223692 GGTTATACATTGCCAAATTGATTTGTGTATGATTATATGTTGCCAAATTTGATTCGTATATGAAT 1 GGTTATACATTGCCAAATTGATTTATATAAGAAAATAGGTTGCCAAATTTGATTCGTATATGAAT * 21223757 ATAAATTGTTGAATT 66 ATAAATTGTCGAATT 21223772 GACTTGAGCA Statistics Matches: 66, Mismatches: 13, Indels: 2 0.81 0.16 0.02 Matches are distributed among these distances: 91 26 0.39 92 40 0.61 ACGTcount: A:0.31, C:0.10, G:0.17, T:0.41 Consensus pattern (92 bp): GGTTATACATTGCCAAATTGATTTATATAAGAAAATAGGTTGCCAAATTTGATTCGTATATGAAT ATAAATTGTCGAATTTGATTCACCTCC Found at i:21224030 original size:42 final size:42 Alignment explanation

Indices: 21223891--21224248 Score: 457 Period size: 42 Copynumber: 8.7 Consensus size: 42 21223881 AGAACTTCGG * 21223891 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATAATT 1 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATGATT 21223933 -CGTGTAAGACCATGTCTGGGACATTGGCATCG-ATATGAGATT 1 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATAT--GATT 21223975 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATGATT 1 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATGATT 21224017 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATGATT 1 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATGATT 21224059 -CGTGTAAGACCATGTCTGGGA-ATTGGCATCG-ATATGAGATT 1 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATAT--GATT 21224100 ACGTGTAAGACCATGTCT--G-----GGCATCGTATATGATT 1 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATGATT 21224135 -CGTGTAAGACCATGTCTGGGACATTGGCATCG-ATATGAGATT 1 ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATAT--GATT 21224177 -CGTGTAAGACCATAG-CTGGG-CTATTGGCATCGATATATGA-T 1 ACGTGTAAGACCAT-GTCTGGGAC-ATTGGCATCG-TATATGATT * * * 21224218 AGCATGTAAGACCATATCTGGGATA-TGGCAT 1 A-CGTGTAAGACCATGTCTGGGACATTGGCAT 21224249 TGTGCGAGTT Statistics Matches: 287, Mismatches: 4, Indels: 50 0.84 0.01 0.15 Matches are distributed among these distances: 34 17 0.06 35 4 0.01 36 8 0.03 37 4 0.01 39 4 0.01 40 19 0.07 41 66 0.23 42 106 0.37 43 51 0.18 44 8 0.03 ACGTcount: A:0.27, C:0.16, G:0.27, T:0.30 Consensus pattern (42 bp): ACGTGTAAGACCATGTCTGGGACATTGGCATCGTATATGATT Found at i:21224130 original size:76 final size:76 Alignment explanation

Indices: 21224043--21224197 Score: 278 Period size: 76 Copynumber: 2.0 Consensus size: 76 21224033 CTGGGACATT 21224043 GGCATCGTATATGATTCGTGTAAGACCATGTCTGGGA-ATTGGCATCGATATGAGATTACGTGTA 1 GGCATCGTATATGATTCGTGTAAGACCATGTCTGGGACATTGGCATCGATATGAGATT-CGTGTA 21224107 AGACCAT-GTCTG 65 AGACCATAG-CTG 21224119 GGCATCGTATATGATTCGTGTAAGACCATGTCTGGGACATTGGCATCGATATGAGATTCGTGTAA 1 GGCATCGTATATGATTCGTGTAAGACCATGTCTGGGACATTGGCATCGATATGAGATTCGTGTAA 21224184 GACCATAGCTG 66 GACCATAGCTG 21224195 GGC 1 GGC 21224198 TATTGGCATC Statistics Matches: 77, Mismatches: 0, Indels: 4 0.95 0.00 0.05 Matches are distributed among these distances: 76 56 0.73 77 21 0.27 ACGTcount: A:0.26, C:0.17, G:0.28, T:0.29 Consensus pattern (76 bp): GGCATCGTATATGATTCGTGTAAGACCATGTCTGGGACATTGGCATCGATATGAGATTCGTGTAA GACCATAGCTG Found at i:21224141 original size:34 final size:34 Alignment explanation

Indices: 21224043--21224154 Score: 136 Period size: 34 Copynumber: 3.1 Consensus size: 34 21224033 CTGGGACATT 21224043 GGCATCGTATATGATTCGTGTAAGACCATGTCTGGG 1 GGCATCGTATATGATTCGTGTAAGACCATGTCT--G 21224079 AATTGGCATCG-ATATGAGATTACGTGTAAGACCATGTCTG 1 ----GGCATCGTATAT--GATT-CGTGTAAGACCATGTCTG 21224119 GGCATCGTATATGATTCGTGTAAGACCATGTCTG 1 GGCATCGTATATGATTCGTGTAAGACCATGTCTG 21224153 GG 1 GG 21224155 ACATTGGCAT Statistics Matches: 68, Mismatches: 0, Indels: 14 0.83 0.00 0.17 Matches are distributed among these distances: 34 20 0.29 35 4 0.06 36 7 0.10 37 4 0.06 39 4 0.06 40 8 0.12 41 4 0.06 42 17 0.25 ACGTcount: A:0.25, C:0.16, G:0.29, T:0.30 Consensus pattern (34 bp): GGCATCGTATATGATTCGTGTAAGACCATGTCTG Found at i:21224741 original size:21 final size:22 Alignment explanation

Indices: 21224714--21224793 Score: 83 Period size: 21 Copynumber: 3.6 Consensus size: 22 21224704 CTAATTAATC * 21224714 TAAACCCTAAACCCCTAACCCC 1 TAAACCCTAAACCCCTAACCCT 21224736 T-AACCCTTAAGTACCCCTAACCCT 1 TAAACCC-TAA--ACCCCTAACCCT ** 21224760 TAAACTTTAAACCCC-AACCCT 1 TAAACCCTAAACCCCTAACCCT * 21224781 TAAACCATAAACC 1 TAAACCCTAAACC 21224794 ATAATCCCTA Statistics Matches: 49, Mismatches: 5, Indels: 9 0.78 0.08 0.14 Matches are distributed among these distances: 21 22 0.45 22 9 0.18 24 15 0.31 25 3 0.06 ACGTcount: A:0.38, C:0.41, G:0.01, T:0.20 Consensus pattern (22 bp): TAAACCCTAAACCCCTAACCCT Found at i:21224769 original size:24 final size:22 Alignment explanation

Indices: 21224714--21224793 Score: 90 Period size: 24 Copynumber: 3.6 Consensus size: 22 21224704 CTAATTAATC * * 21224714 TAAACCCTAAACCCCTAACCCC 1 TAAACCTTAAACCCCTAACCCT * 21224736 TAACCCTTAAGTACCCCTAACCCT 1 TAAACCTTAA--ACCCCTAACCCT * 21224760 TAAACTTTAAACCCC-AACCCT 1 TAAACCTTAAACCCCTAACCCT * 21224781 TAAACCATAAACC 1 TAAACCTTAAACC 21224794 ATAATCCCTA Statistics Matches: 49, Mismatches: 7, Indels: 5 0.80 0.11 0.08 Matches are distributed among these distances: 21 17 0.35 22 13 0.27 24 19 0.39 ACGTcount: A:0.38, C:0.41, G:0.01, T:0.20 Consensus pattern (22 bp): TAAACCTTAAACCCCTAACCCT Found at i:21224810 original size:15 final size:15 Alignment explanation

Indices: 21224775--21224822 Score: 62 Period size: 15 Copynumber: 3.3 Consensus size: 15 21224765 TTTAAACCCC * * 21224775 AACCCTTAAACCATA 1 AACCCATAATCCATA * 21224790 AA-CCATAATCCCTA 1 AACCCATAATCCATA 21224804 AACCCATAATCCATA 1 AACCCATAATCCATA 21224819 AACC 1 AACC 21224823 TTAAGATAGT Statistics Matches: 28, Mismatches: 4, Indels: 2 0.82 0.12 0.06 Matches are distributed among these distances: 14 11 0.39 15 17 0.61 ACGTcount: A:0.46, C:0.35, G:0.00, T:0.19 Consensus pattern (15 bp): AACCCATAATCCATA Found at i:21225050 original size:255 final size:243 Alignment explanation

Indices: 21224640--21225201 Score: 840 Period size: 255 Copynumber: 2.3 Consensus size: 243 21224630 GTTTAAATAA * * 21224640 TAAAATTAATATTATCTCTTTTACAATTATATAAGAAATTATTTAATATATAAATTAAAAAACAC 1 TAAAATTAATACTATCTCTTTTACGATTATATAAGAAATTATTTAATATATAAATTAAAAAACAC * * * 21224705 TAATTAATCTAAACCCTAAACCCCTAACCCCTAACCCTTAAGTACCCCTAACCCTTAAACTTTAA 66 TAATTAATCTAAACCCTAAACCCCTAACCCCTAACCATTAAGAACCCCTAACCCTTAAACCTTAA 21224770 ACCCCAACCCTTAAACCATAAACCATAATCCCTAAACCCATAATCCATAAACCTTAAGATAGTAA 131 ACCCCAACCCTTAAACCATAAACCATAATCCCTAAACCCATAATCCATAAACCTTAAGATAGTAA * ** 21224835 CTCCTAAACCTTAAACCCTAAACTATAATGATAATTAATTTAATGTTT 196 CTCCTAAACCTTAAACCCTAAACTATAATGATAATTAATTCAACATTT * 21224883 TAAAATTAATACTATCTCTTTTACGATTATATAAGAAATTATTTAATATATAAATTAAATAACAC 1 TAAAATTAATACTATCTCTTTTACGATTATATAAGAAATTATTTAATATATAAATTAAAAAACAC * * * 21224948 TAATTAATCT-AACCCTTAAACCCTTATCCCCTATCCCCTAAACCATAAATCAGAAACTCCTAAC 66 TAATTAATCTAAACCC-T-AA-----A-CCCCTAACCCCT-AACCAT---TAAG-AACCCCTAAC * 21225012 TCC-TAAACCTTAAACCCCAACCCTTAAACCATAAACCATAATCCCTAAACCCATGATCCATAAA 118 -CCTTAAACCTTAAACCCCAACCCTTAAACCATAAACCATAATCCCTAAACCCATAATCCATAAA * 21225076 CCTTAAGATAGTAACTCCTAAACCTTAAACCTTAAACTATAATGATAATTAATTCAACATTT 182 CCTTAAGATAGTAACTCCTAAACCTTAAACCCTAAACTATAATGATAATTAATTCAACATTT * * 21225138 TAAAATTAATACTATCTCTTTTACGATTATATAAGAAATTATTTAATACATAAACTAAAAAACA 1 TAAAATTAATACTATCTCTTTTACGATTATATAAGAAATTATTTAATATATAAATTAAAAAACA 21225202 ATCAATAATG Statistics Matches: 288, Mismatches: 17, Indels: 16 0.90 0.05 0.05 Matches are distributed among these distances: 242 5 0.02 243 73 0.25 244 2 0.01 249 1 0.00 250 11 0.04 251 5 0.02 254 3 0.01 255 186 0.65 256 2 0.01 ACGTcount: A:0.44, C:0.23, G:0.03, T:0.31 Consensus pattern (243 bp): TAAAATTAATACTATCTCTTTTACGATTATATAAGAAATTATTTAATATATAAATTAAAAAACAC TAATTAATCTAAACCCTAAACCCCTAACCCCTAACCATTAAGAACCCCTAACCCTTAAACCTTAA ACCCCAACCCTTAAACCATAAACCATAATCCCTAAACCCATAATCCATAAACCTTAAGATAGTAA CTCCTAAACCTTAAACCCTAAACTATAATGATAATTAATTCAACATTT Found at i:21225060 original size:21 final size:20 Alignment explanation

Indices: 21225013--21225062 Score: 55 Period size: 21 Copynumber: 2.4 Consensus size: 20 21225003 ACTCCTAACT * * 21225013 CCTAAACCTTAAACCCCAAC 1 CCTAAACCATAAACCACAAC * 21225033 CCTTAAACCATAAACCATAATC 1 CC-TAAACCATAAACCACAA-C 21225055 CCTAAACC 1 CCTAAACC 21225063 CATGATCCAT Statistics Matches: 25, Mismatches: 3, Indels: 3 0.81 0.10 0.10 Matches are distributed among these distances: 20 2 0.08 21 20 0.80 22 3 0.12 ACGTcount: A:0.42, C:0.40, G:0.00, T:0.18 Consensus pattern (20 bp): CCTAAACCATAAACCACAAC Found at i:21225065 original size:15 final size:15 Alignment explanation

Indices: 21225030--21225077 Score: 53 Period size: 15 Copynumber: 3.3 Consensus size: 15 21225020 CTTAAACCCC * * 21225030 AACCCTTAAACCATA 1 AACCCATAATCCATA * 21225045 AA-CCATAATCCCTA 1 AACCCATAATCCATA * 21225059 AACCCATGATCCATA 1 AACCCATAATCCATA 21225074 AACC 1 AACC 21225078 TTAAGATAGT Statistics Matches: 27, Mismatches: 5, Indels: 2 0.79 0.15 0.06 Matches are distributed among these distances: 14 11 0.41 15 16 0.59 ACGTcount: A:0.44, C:0.35, G:0.02, T:0.19 Consensus pattern (15 bp): AACCCATAATCCATA Found at i:21225487 original size:41 final size:41 Alignment explanation

Indices: 21225425--21225700 Score: 250 Period size: 41 Copynumber: 6.7 Consensus size: 41 21225415 TTTTCGCAGA * ** * * * 21225425 GCTTTTTCAAAAATGCCGCTAAAGCCCCGAGAATTAGCGGC 1 GCTTCTTCAAAAACACCGCTAAAGCACCGAGCATTAGCGAC * * 21225466 GTTTCTTCAAAAACATCGCTAAAGCACCGAGCATTAGCGAC 1 GCTTCTTCAAAAACACCGCTAAAGCACCGAGCATTAGCGAC * * * 21225507 GTTTCTTCAAAAACGCCGCTAAAGCACCGAGCATTCGCGAC 1 GCTTCTTCAAAAACACCGCTAAAGCACCGAGCATTAGCGAC * * ** 21225548 GCTTCTTCGAAAACGCCGCTAAAGCACCGAGCATTAGCGGT 1 GCTTCTTCAAAAACACCGCTAAAGCACCGAGCATTAGCGAC * * * * 21225589 GCTTCTTCGAAAACATCGCTAAAGCATCGAGCATTAG-TAGC 1 GCTTCTTCAAAAACACCGCTAAAGCACCGAGCATTAGCGA-C ** * * ** 21225630 GCTT-TTCCAAAAATGCCCCTAAAGTACCGAGCATTAGCGGT 1 GCTTCTT-CAAAAACACCGCTAAAGCACCGAGCATTAGCGAC * * ** * 21225671 ACTTTTTCAAAAATGCCGCTAAAGCCCCGA 1 GCTTCTTCAAAAACACCGCTAAAGCACCGA 21225701 AAGGTAAGAA Statistics Matches: 195, Mismatches: 36, Indels: 8 0.82 0.15 0.03 Matches are distributed among these distances: 40 2 0.01 41 191 0.98 42 2 0.01 ACGTcount: A:0.30, C:0.28, G:0.20, T:0.22 Consensus pattern (41 bp): GCTTCTTCAAAAACACCGCTAAAGCACCGAGCATTAGCGAC Found at i:21225568 original size:82 final size:82 Alignment explanation

Indices: 21225430--21225700 Score: 280 Period size: 82 Copynumber: 3.3 Consensus size: 82 21225420 GCAGAGCTTT * * * * * ** 21225430 TTCAAAAATGCCGCTAAAGCCCCGAGAATTAGCGGCGTTTCTTCAAAAACATCGCTAAAGCACCG 1 TTCAAAAACGCCGCTAAAGCACCGAGCATTAGCGACGCTTCTTCAAAAACGCCGCTAAAGCACCG 21225495 AGCATTAGCGACGTTTC 66 AGCATTAGCGACGTTTC * * 21225512 TTCAAAAACGCCGCTAAAGCACCGAGCATTCGCGACGCTTCTTCGAAAACGCCGCTAAAGCACCG 1 TTCAAAAACGCCGCTAAAGCACCGAGCATTAGCGACGCTTCTTCAAAAACGCCGCTAAAGCACCG ** * 21225577 AGCATTAGCGGTGCTTC 66 AGCATTAGCGACGTTTC * ** * * * * * 21225594 TTCGAAAACATCGCTAAAGCATCGAGCATTAG-TAGCGCTT-TTCCAAAAATGCCCCTAAAGTAC 1 TTCAAAAACGCCGCTAAAGCACCGAGCATTAGCGA-CGCTTCTT-CAAAAACGCCGCTAAAGCAC 21225657 CGAGCATTAGCGGTAC-TTT- 64 CGAGCATTAGC-G-ACGTTTC * * 21225676 TTCAAAAATGCCGCTAAAGCCCCGA 1 TTCAAAAACGCCGCTAAAGCACCGA 21225701 AAGGTAAGAA Statistics Matches: 154, Mismatches: 31, Indels: 8 0.80 0.16 0.04 Matches are distributed among these distances: 81 3 0.02 82 148 0.96 83 3 0.02 ACGTcount: A:0.31, C:0.28, G:0.20, T:0.21 Consensus pattern (82 bp): TTCAAAAACGCCGCTAAAGCACCGAGCATTAGCGACGCTTCTTCAAAAACGCCGCTAAAGCACCG AGCATTAGCGACGTTTC Found at i:21225641 original size:123 final size:122 Alignment explanation

Indices: 21225417--21225700 Score: 358 Period size: 123 Copynumber: 2.3 Consensus size: 122 21225407 TAAAATGTTT * 21225417 TTCGCAGAGCTTTTTCAAAAATGCCGCTAAAGCCCCGAGAATTAGCGGCGTTTCTTCAAAAACAT 1 TTCGC-GAGCTTTTTCAAAAATGCCGCTAAAGCCCCGAGAATTAGCGGCGCTTCTTCAAAAACAT * * * 21225482 CGCTAAAGCACCGAGCATTAGCGACGTTTCTTCAAAAACGCCGCTAAAGCACCGAGCA 65 CGCTAAAGCACCGAGCATTAGAGACGTTTCTCCAAAAACGCCCCTAAAGCACCGAGCA * * * * * * * 21225540 TTCGCGACGCTTCTTCGAAAACGCCGCTAAAGCACCGAGCATTAGCGGTGCTTCTTCGAAAACAT 1 TTCGCGA-GCTTTTTCAAAAATGCCGCTAAAGCCCCGAGAATTAGCGGCGCTTCTTCAAAAACAT * * * 21225605 CGCTAAAGCATCGAGCATTAGTAG-CGCTTT-TCCAAAAATGCCCCTAAAGTACCGAGCA 65 CGCTAAAGCACCGAGCATTAG-AGACG-TTTCTCCAAAAACGCCCCTAAAGCACCGAGCA * 21225663 TTAGCG-GTACTTTTTCAAAAATGCCGCTAAAGCCCCGA 1 TTCGCGAG--CTTTTTCAAAAATGCCGCTAAAGCCCCGA 21225701 AAGGTAAGAA Statistics Matches: 137, Mismatches: 19, Indels: 10 0.83 0.11 0.06 Matches are distributed among these distances: 121 1 0.01 122 2 0.01 123 130 0.95 124 4 0.03 ACGTcount: A:0.30, C:0.28, G:0.20, T:0.22 Consensus pattern (122 bp): TTCGCGAGCTTTTTCAAAAATGCCGCTAAAGCCCCGAGAATTAGCGGCGCTTCTTCAAAAACATC GCTAAAGCACCGAGCATTAGAGACGTTTCTCCAAAAACGCCCCTAAAGCACCGAGCA Found at i:21225793 original size:40 final size:40 Alignment explanation

Indices: 21225749--21225861 Score: 117 Period size: 40 Copynumber: 2.8 Consensus size: 40 21225739 GACGATTTTT * * 21225749 GAAAAAATGCCGCTAATGC-TCATTTTCAGCGGCGTGTTCC 1 GAAAAAGTGCCGCTAATGCTTGATTTT-AGCGGCGTGTTCC * 21225789 G-AAAAGTGCCG-TAAATGCTTGATCTTTAGCGGCGTTTTCC 1 GAAAAAGTGCCGCT-AATGCTTGAT-TTTAGCGGCGTGTTCC * * 21225829 -ATAAAGCGCCGCTAATGCTTGATTTTTAGCGGC 1 GAAAAAGTGCCGCTAATGCTTGA-TTTTAGCGGC 21225862 ATTTTTTGTC Statistics Matches: 62, Mismatches: 5, Indels: 12 0.78 0.06 0.15 Matches are distributed among these distances: 38 1 0.02 39 14 0.23 40 42 0.68 41 5 0.08 ACGTcount: A:0.24, C:0.22, G:0.24, T:0.30 Consensus pattern (40 bp): GAAAAAGTGCCGCTAATGCTTGATTTTAGCGGCGTGTTCC Found at i:21227504 original size:22 final size:20 Alignment explanation

Indices: 21227478--21227518 Score: 55 Period size: 20 Copynumber: 1.9 Consensus size: 20 21227468 GCTATTTCAG 21227478 TTAAATTCATGTCGAAAAATAA 1 TTAAATT-ATGT-GAAAAATAA * 21227500 TTAAATTATGTTAAAAATA 1 TTAAATTATGTGAAAAATA 21227519 TGAACATGAT Statistics Matches: 18, Mismatches: 1, Indels: 2 0.86 0.05 0.10 Matches are distributed among these distances: 20 7 0.39 21 4 0.22 22 7 0.39 ACGTcount: A:0.51, C:0.05, G:0.07, T:0.37 Consensus pattern (20 bp): TTAAATTATGTGAAAAATAA Found at i:21236385 original size:27 final size:27 Alignment explanation

Indices: 21236347--21236404 Score: 116 Period size: 27 Copynumber: 2.1 Consensus size: 27 21236337 ATGTCTGTAA 21236347 TACGGTCTTAGATGGTTTCCAATCCAG 1 TACGGTCTTAGATGGTTTCCAATCCAG 21236374 TACGGTCTTAGATGGTTTCCAATCCAG 1 TACGGTCTTAGATGGTTTCCAATCCAG 21236401 TACG 1 TACG 21236405 ACTTCAATTT Statistics Matches: 31, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 27 31 1.00 ACGTcount: A:0.22, C:0.22, G:0.22, T:0.33 Consensus pattern (27 bp): TACGGTCTTAGATGGTTTCCAATCCAG Found at i:21239867 original size:28 final size:28 Alignment explanation

Indices: 21239823--21239899 Score: 120 Period size: 28 Copynumber: 2.8 Consensus size: 28 21239813 CCTATACAGT 21239823 AACAGT-ACAGTGTGGGCCTTAGCCCAA 1 AACAGTAACAGTGTGGGCCTTAGCCCAA * 21239850 AACAGTAACAATGTGGGCCTTAGCCCAA 1 AACAGTAACAGTGTGGGCCTTAGCCCAA * * 21239878 TACAGTAATAGTGTGGGCCTTA 1 AACAGTAACAGTGTGGGCCTTA 21239900 CCCTAGTACA Statistics Matches: 45, Mismatches: 4, Indels: 1 0.90 0.08 0.02 Matches are distributed among these distances: 27 6 0.13 28 39 0.87 ACGTcount: A:0.31, C:0.22, G:0.25, T:0.22 Consensus pattern (28 bp): AACAGTAACAGTGTGGGCCTTAGCCCAA Found at i:21239909 original size:28 final size:28 Alignment explanation

Indices: 21239824--21239912 Score: 119 Period size: 28 Copynumber: 3.2 Consensus size: 28 21239814 CTATACAGTA * 21239824 ACAGT-ACAGTGTGGGCCTTAGCCCAAA 1 ACAGTAACAGTGTGGGCCTTAGCCCAAT * 21239851 ACAGTAACAATGTGGGCCTTAGCCCAAT 1 ACAGTAACAGTGTGGGCCTTAGCCCAAT * * 21239879 ACAGTAATAGTGTGGGCCTTA-CCCTAGT 1 ACAGTAACAGTGTGGGCCTTAGCCC-AAT 21239907 ACAGTA 1 ACAGTA 21239913 TAATATTGCA Statistics Matches: 55, Mismatches: 5, Indels: 3 0.87 0.08 0.05 Matches are distributed among these distances: 27 8 0.15 28 47 0.85 ACGTcount: A:0.30, C:0.24, G:0.24, T:0.22 Consensus pattern (28 bp): ACAGTAACAGTGTGGGCCTTAGCCCAAT Found at i:21240275 original size:27 final size:26 Alignment explanation

Indices: 21240256--21240477 Score: 150 Period size: 27 Copynumber: 8.2 Consensus size: 26 21240246 GGTATACCAA 21240256 TCATTTTACCTGATGAGGGTATTTCAG 1 TCATTTTACCT-ATGAGGGTATTTCAG * 21240283 TCATTTTACCCTTTGAGGGTATTTCAG 1 TCATTTTA-CCTATGAGGGTATTTCAG * 21240310 TCATTTTACC-ATTCGGGGGTATTTC-G 1 TCATTTTACCTA-T-GAGGGTATTTCAG * * ** 21240336 ATCATTTTACCCATCAGGGGTATTTTGG 1 -TCATTTTACCTATGA-GGGTATTTCAG * * 21240364 TCATTTTACCCTCTAAGGGTATTTC-G 1 TCATTTTA-CCTATGAGGGTATTTCAG * 21240390 ATCATTTTACCT-TACAGAGGTATTTCAG 1 -TCATTTTACCTAT-GAG-GGTATTTCAG * * 21240418 TCATTTTACCT-TGCGGGGATATTT-TG 1 TCATTTTACCTATG-AGGG-TATTTCAG ** * 21240444 ATCATTTTACTCTCCGGGGGTATTTCAG 1 -TCATTTTAC-CTATGAGGGTATTTCAG 21240472 TCATTT 1 TCATTT 21240478 GCCCACGAAT Statistics Matches: 161, Mismatches: 16, Indels: 36 0.76 0.08 0.17 Matches are distributed among these distances: 25 1 0.01 26 13 0.08 27 129 0.80 28 17 0.11 29 1 0.01 ACGTcount: A:0.20, C:0.18, G:0.19, T:0.42 Consensus pattern (26 bp): TCATTTTACCTATGAGGGTATTTCAG Found at i:21240330 original size:54 final size:53 Alignment explanation

Indices: 21240272--21240477 Score: 204 Period size: 54 Copynumber: 3.8 Consensus size: 53 21240262 TACCTGATGA * 21240272 GGGTATTTC-AGTCATTTTACCCTTTGAGGGTATTTCAGTCATTTTACCATTCGG 1 GGGTATTTCGA-TCATTTTACCCTTTCAGGGTATTTCAGTCATTTTACC-TTCGG * ** * ** 21240326 GGGTATTTCGATCATTTTACCC-ATCAGGGGTATTTTGGTCATTTTACCCTCTAA 1 GGGTATTTCGATCATTTTACCCTTTCA-GGGTATTTCAGTCATTTTACCTTC-GG * 21240380 GGGTATTTCGATCATTTTA-CCTTACAGAGGTATTTCAGTCATTTTACCTTGCGG 1 GGGTATTTCGATCATTTTACCCTTTCAG-GGTATTTCAGTCATTTTACCTT-CGG * * * * * 21240434 GGATATTTTGATCATTTTACTC-TCCGGGGGTATTTCAGTCATTT 1 GGGTATTTCGATCATTTTACCCTTTC-AGGGTATTTCAGTCATTT 21240478 GCCCACGAAT Statistics Matches: 125, Mismatches: 19, Indels: 16 0.78 0.12 0.10 Matches are distributed among these distances: 53 7 0.06 54 114 0.91 55 4 0.03 ACGTcount: A:0.19, C:0.18, G:0.20, T:0.42 Consensus pattern (53 bp): GGGTATTTCGATCATTTTACCCTTTCAGGGTATTTCAGTCATTTTACCTTCGG Found at i:21240367 original size:81 final size:80 Alignment explanation

Indices: 21240255--21240477 Score: 231 Period size: 81 Copynumber: 2.8 Consensus size: 80 21240245 GGGTATACCA * * ** * 21240255 ATCATTTTACCTGATGA-GGGTATTTCAGTCATTTTACCCTTTGAGGGTATTTCAGTCATTTTAC 1 ATCATTTTACC-CATCAGGGGTATTTTGGTCATTTTACCCTCTGAGGGTATTTCAGTCATTTTAC * * 21240319 CATT-CGGGGGTATTTC 65 C-TTACAGAGGTATTTC * 21240335 GATCATTTTACCCATCAGGGGTATTTTGGTCATTTTACCCTCTAAGGGTATTTC-GATCATTTTA 1 -ATCATTTTACCCATCAGGGGTATTTTGGTCATTTTACCCTCTGAGGGTATTTCAG-TCATTTTA 21240399 CCTTACAGAGGTATTTC 64 CCTTACAGAGGTATTTC * * * * * 21240416 AGTCATTTTA-CCTTGC-GGGGATATTTTGATCATTTTACTCTCCGGGGGTATTTCAGTCATTT 1 A-TCATTTTACCCAT-CAGGGG-TATTTTGGTCATTTTACCCTCTGAGGGTATTTCAGTCATTT 21240478 GCCCACGAAT Statistics Matches: 121, Mismatches: 14, Indels: 14 0.81 0.09 0.09 Matches are distributed among these distances: 80 14 0.12 81 106 0.88 82 1 0.01 ACGTcount: A:0.20, C:0.18, G:0.19, T:0.42 Consensus pattern (80 bp): ATCATTTTACCCATCAGGGGTATTTTGGTCATTTTACCCTCTGAGGGTATTTCAGTCATTTTACC TTACAGAGGTATTTC Found at i:21242033 original size:24 final size:24 Alignment explanation

Indices: 21241974--21242034 Score: 68 Period size: 24 Copynumber: 2.5 Consensus size: 24 21241964 CATTCTATTA * * * 21241974 GCCTTTATGGCATATTTTTATTTG 1 GCCTTTATGGCATATTCTGATTGG * * 21241998 ACCTTTAGGGCATATTCTGATTGG 1 GCCTTTATGGCATATTCTGATTGG * 21242022 GCCTTCATGGCAT 1 GCCTTTATGGCAT 21242035 TTTGTTAGCC Statistics Matches: 29, Mismatches: 8, Indels: 0 0.78 0.22 0.00 Matches are distributed among these distances: 24 29 1.00 ACGTcount: A:0.18, C:0.18, G:0.21, T:0.43 Consensus pattern (24 bp): GCCTTTATGGCATATTCTGATTGG Found at i:21242092 original size:19 final size:20 Alignment explanation

Indices: 21242029--21242092 Score: 62 Period size: 19 Copynumber: 3.3 Consensus size: 20 21242019 TGGGCCTTCA * 21242029 TGGCATTTTGTTAGCCTTTG 1 TGGCATTATGTTAGCCTTTG * * * 21242049 TGGAAATCT-TTAGCCGTTT- 1 TGGCATTATGTTAGCC-TTTG 21242068 TGGCATTATGTT-GCCTTTG 1 TGGCATTATGTTAGCCTTTG 21242087 TGGCAT 1 TGGCAT 21242093 ACTCTGTATA Statistics Matches: 35, Mismatches: 6, Indels: 7 0.73 0.12 0.15 Matches are distributed among these distances: 18 3 0.09 19 21 0.60 20 11 0.31 ACGTcount: A:0.14, C:0.16, G:0.25, T:0.45 Consensus pattern (20 bp): TGGCATTATGTTAGCCTTTG Found at i:21261118 original size:16 final size:15 Alignment explanation

Indices: 21261077--21261115 Score: 60 Period size: 16 Copynumber: 2.5 Consensus size: 15 21261067 TGATGACTTT 21261077 CTAGGGTTGTCAAGC 1 CTAGGGTTGTCAAGC * 21261092 CTAAGGATTGTCAAGC 1 CT-AGGGTTGTCAAGC 21261108 CTAGGGTT 1 CTAGGGTT 21261116 TGTTTCACAT Statistics Matches: 21, Mismatches: 2, Indels: 2 0.84 0.08 0.08 Matches are distributed among these distances: 15 7 0.33 16 14 0.67 ACGTcount: A:0.23, C:0.18, G:0.31, T:0.28 Consensus pattern (15 bp): CTAGGGTTGTCAAGC Found at i:21265889 original size:25 final size:25 Alignment explanation

Indices: 21265861--21265910 Score: 82 Period size: 25 Copynumber: 2.0 Consensus size: 25 21265851 TAAAGGGGAC 21265861 TAAGGCATTTCACATGAATTTTTAA 1 TAAGGCATTTCACATGAATTTTTAA ** 21265886 TAAGGCATTTCGTATGAATTTTTAA 1 TAAGGCATTTCACATGAATTTTTAA 21265911 AGTATGTTTC Statistics Matches: 23, Mismatches: 2, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 25 23 1.00 ACGTcount: A:0.34, C:0.10, G:0.14, T:0.42 Consensus pattern (25 bp): TAAGGCATTTCACATGAATTTTTAA Found at i:21267994 original size:19 final size:22 Alignment explanation

Indices: 21267968--21268017 Score: 70 Period size: 22 Copynumber: 2.4 Consensus size: 22 21267958 GAACTAGATT 21267968 AGGTTAAAA-GGG-TG-TTTTA 1 AGGTTAAAAGGGGATGTTTTTA * 21267987 ATGTTAAAAGGGGATGTTTTTA 1 AGGTTAAAAGGGGATGTTTTTA 21268009 AGGTTAAAA 1 AGGTTAAAA 21268018 ATCGAGCATA Statistics Matches: 26, Mismatches: 2, Indels: 3 0.84 0.06 0.10 Matches are distributed among these distances: 19 8 0.31 20 3 0.12 21 2 0.08 22 13 0.50 ACGTcount: A:0.36, C:0.00, G:0.28, T:0.36 Consensus pattern (22 bp): AGGTTAAAAGGGGATGTTTTTA Found at i:21274635 original size:18 final size:18 Alignment explanation

Indices: 21274612--21274646 Score: 54 Period size: 18 Copynumber: 1.9 Consensus size: 18 21274602 CCACTTTCCA 21274612 CTTTTAAGTTT-TTTTTAT 1 CTTTTAA-TTTATTTTTAT 21274630 CTTTTAATTTATTTTTA 1 CTTTTAATTTATTTTTA 21274647 CAATTTTATT Statistics Matches: 16, Mismatches: 0, Indels: 2 0.89 0.00 0.11 Matches are distributed among these distances: 17 3 0.19 18 13 0.81 ACGTcount: A:0.20, C:0.06, G:0.03, T:0.71 Consensus pattern (18 bp): CTTTTAATTTATTTTTAT Found at i:21292015 original size:51 final size:52 Alignment explanation

Indices: 21291847--21292019 Score: 287 Period size: 52 Copynumber: 3.3 Consensus size: 52 21291837 ACTAATGACT 21291847 CAAGTGTGGTACTATGTGAAGGCCACTTTGTGAAGAAAGATAGCTTTGGTCA 1 CAAGTGTGGTACTATGTGAAGGCCACTTTGTGAAGAAAGATAGCTTTGGTCA * * 21291899 CCAGTGTGGTACTATGTGAAGGCCACTTTGTGAAGAAATATAGCTTTGGTCA 1 CAAGTGTGGTACTATGTGAAGGCCACTTTGTGAAGAAAGATAGCTTTGGTCA * 21291951 CAAGTGTGGTACTATGTGAAGGCCACTTTGTGAAG-AAGGTAGCTTTGG-CTA 1 CAAGTGTGGTACTATGTGAAGGCCACTTTGTGAAGAAAGATAGCTTTGGTC-A * 21292002 CAAGGGTGGTACTATGTG 1 CAAGTGTGGTACTATGTG 21292020 CAAGCCATCG Statistics Matches: 114, Mismatches: 6, Indels: 3 0.93 0.05 0.02 Matches are distributed among these distances: 50 1 0.01 51 29 0.25 52 84 0.74 ACGTcount: A:0.27, C:0.14, G:0.30, T:0.29 Consensus pattern (52 bp): CAAGTGTGGTACTATGTGAAGGCCACTTTGTGAAGAAAGATAGCTTTGGTCA Found at i:21294118 original size:20 final size:20 Alignment explanation

Indices: 21294095--21294136 Score: 57 Period size: 20 Copynumber: 2.1 Consensus size: 20 21294085 ATTGATACCA 21294095 CTTAAAGAAAATATCGATAC 1 CTTAAAGAAAATATCGATAC * ** 21294115 CTTAATGTCAATATCGATAC 1 CTTAAAGAAAATATCGATAC 21294135 CT 1 CT 21294137 GCGGGCATTT Statistics Matches: 19, Mismatches: 3, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 20 19 1.00 ACGTcount: A:0.40, C:0.19, G:0.10, T:0.31 Consensus pattern (20 bp): CTTAAAGAAAATATCGATAC Found at i:21314486 original size:20 final size:20 Alignment explanation

Indices: 21314461--21314693 Score: 80 Period size: 20 Copynumber: 11.7 Consensus size: 20 21314451 TCTTGAATAA 21314461 GTGCTCCTGATAGTACTACG 1 GTGCTCCTGATAGTACTACG * * 21314481 GTGCTCCTGGA-AATACTTTC- 1 GTGCTCCT-GATAGTAC-TACG * ** 21314501 GTACTCCTGATAACACTACG 1 GTGCTCCTGATAGTACTACG * * *** 21314521 GTGCTCCTGACAATACTTTT 1 GTGCTCCTGATAGTACTACG * * * 21314541 GTACTCTTGATAGCACTACG 1 GTGCTCCTGATAGTACTACG * * * 21314561 GTGCTCCTGACAGTAGTTTC- 1 GTGCTCCTGATAGTA-CTACG * * * * 21314581 ATACTCCTGATAGCACTATG 1 GTGCTCCTGATAGTACTACG * * * *** 21314601 GTGCTCCTAACAGTTCTTTT 1 GTGCTCCTGATAGTACTACG * * 21314621 GTACTCCT-ATTAGCACTACG 1 GTGCTCCTGA-TAGTACTACG * * 21314641 GTGCTCTTGATAGTACTTTC- 1 GTGCTCCTGATAGTAC-TACG * * * 21314661 GTACTCCTGTTAGCACTACG 1 GTGCTCCTGATAGTACTACG * 21314681 GTGCTCTTGATAG 1 GTGCTCCTGATAG 21314694 CATATTCAAG Statistics Matches: 144, Mismatches: 59, Indels: 20 0.65 0.26 0.09 Matches are distributed among these distances: 19 8 0.06 20 127 0.88 21 9 0.06 ACGTcount: A:0.21, C:0.25, G:0.19, T:0.34 Consensus pattern (20 bp): GTGCTCCTGATAGTACTACG Found at i:21314693 original size:40 final size:40 Alignment explanation

Indices: 21314464--21314686 Score: 279 Period size: 40 Copynumber: 5.6 Consensus size: 40 21314454 TGAATAAGTG * * 21314464 CTCCTGATAGTACTACGGTGCTCCTGGA-AATACTTTCGTA 1 CTCCTGATAGCACTACGGTGCTCCT-GACAGTACTTTCGTA * * * 21314504 CTCCTGATAACACTACGGTGCTCCTGACAATACTTTTGTA 1 CTCCTGATAGCACTACGGTGCTCCTGACAGTACTTTCGTA * * * 21314544 CTCTTGATAGCACTACGGTGCTCCTGACAGTAGTTTCATA 1 CTCCTGATAGCACTACGGTGCTCCTGACAGTACTTTCGTA * * * * 21314584 CTCCTGATAGCACTATGGTGCTCCTAACAGTTCTTTTGTA 1 CTCCTGATAGCACTACGGTGCTCCTGACAGTACTTTCGTA * * 21314624 CTCCT-ATTAGCACTACGGTGCTCTTGATAGTACTTTCGTA 1 CTCCTGA-TAGCACTACGGTGCTCCTGACAGTACTTTCGTA * 21314664 CTCCTGTTAGCACTACGGTGCTC 1 CTCCTGATAGCACTACGGTGCTC 21314687 TTGATAGCAT Statistics Matches: 157, Mismatches: 23, Indels: 6 0.84 0.12 0.03 Matches are distributed among these distances: 39 3 0.02 40 154 0.98 ACGTcount: A:0.21, C:0.26, G:0.18, T:0.34 Consensus pattern (40 bp): CTCCTGATAGCACTACGGTGCTCCTGACAGTACTTTCGTA Done.