Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: chr5

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 39371879
ACGTcount: A:0.31, C:0.16, G:0.16, T:0.31

Warning! 2227820 characters in sequence are not A, C, G, or T


File 54 of 103

Found at i:20557319 original size:2 final size:2

Alignment explanation

Indices: 20557312--20557344 Score: 66 Period size: 2 Copynumber: 16.5 Consensus size: 2 20557302 ACATACATAC 20557312 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 20557345 NGTATAGACA Statistics Matches: 31, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 31 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:20558400 original size:15 final size:15 Alignment explanation

Indices: 20558380--20558442 Score: 51 Period size: 15 Copynumber: 4.2 Consensus size: 15 20558370 TTCTCTTTAG 20558380 GTTTATATATTATAA 1 GTTTATATATTATAA * * 20558395 GTTTATATA-AACAAA 1 GTTTATATATTA-TAA 20558410 GTTATATATATTATAA 1 GTT-TATATATTATAA * 20558426 -TTAATA-ATTTATAA 1 GTTTATATA-TTATAA 20558440 GTT 1 GTT 20558443 ATAATTTTGT Statistics Matches: 38, Mismatches: 5, Indels: 10 0.72 0.09 0.19 Matches are distributed among these distances: 13 1 0.03 14 10 0.26 15 18 0.47 16 8 0.21 17 1 0.03 ACGTcount: A:0.44, C:0.02, G:0.06, T:0.48 Consensus pattern (15 bp): GTTTATATATTATAA Found at i:20558439 original size:14 final size:14 Alignment explanation

Indices: 20558408--20558449 Score: 50 Period size: 14 Copynumber: 2.9 Consensus size: 14 20558398 TATATAAACA 20558408 AAGTTATATATATTAT 1 AAGTTATA-AT-TTAT 20558424 AA-TTAATAATTTAT 1 AAGTT-ATAATTTAT 20558438 AAGTTATAATTT 1 AAGTTATAATTT 20558450 TGTAATGTTA Statistics Matches: 24, Mismatches: 0, Indels: 6 0.80 0.00 0.20 Matches are distributed among these distances: 14 13 0.54 15 6 0.25 16 5 0.21 ACGTcount: A:0.45, C:0.00, G:0.05, T:0.50 Consensus pattern (14 bp): AAGTTATAATTTAT Found at i:20560956 original size:22 final size:22 Alignment explanation

Indices: 20560925--20560994 Score: 63 Period size: 21 Copynumber: 3.2 Consensus size: 22 20560915 TGAGTTGTGC * 20560925 ATTGCGTGACAAGGTTTG-AGGA 1 ATTGCTTGACAA-GTTTGTAGGA * * 20560947 ATTGCTTGACTAGTTTGTAGGC 1 ATTGCTTGACAAGTTTGTAGGA * * * 20560969 ATTG-TTGACTAGTTTTTAGTA 1 ATTGCTTGACAAGTTTGTAGGA 20560990 ATTGC 1 ATTGC 20560995 GTTGCTCTAT Statistics Matches: 40, Mismatches: 6, Indels: 4 0.80 0.12 0.08 Matches are distributed among these distances: 21 23 0.57 22 17 0.43 ACGTcount: A:0.23, C:0.10, G:0.27, T:0.40 Consensus pattern (22 bp): ATTGCTTGACAAGTTTGTAGGA Found at i:20560993 original size:21 final size:22 Alignment explanation

Indices: 20560938--20560994 Score: 73 Period size: 21 Copynumber: 2.7 Consensus size: 22 20560928 GCGTGACAAG 20560938 GTTTG-AGGAATTGCTTGACTA 1 GTTTGTAGGAATTGCTTGACTA * 20560959 GTTTGTAGGCATTG-TTGACTA 1 GTTTGTAGGAATTGCTTGACTA * * 20560980 GTTTTTAGTAATTGC 1 GTTTGTAGGAATTGC 20560995 GTTGCTCTAT Statistics Matches: 30, Mismatches: 4, Indels: 3 0.81 0.11 0.08 Matches are distributed among these distances: 21 23 0.77 22 7 0.23 ACGTcount: A:0.21, C:0.09, G:0.26, T:0.44 Consensus pattern (22 bp): GTTTGTAGGAATTGCTTGACTA Found at i:20567131 original size:28 final size:29 Alignment explanation

Indices: 20567090--20567154 Score: 69 Period size: 30 Copynumber: 2.2 Consensus size: 29 20567080 GTCTATGGGC * 20567090 AATTTTTCAATTTT-TGGGGCAAAAATGT 1 AATTTTTCAATTTTAAGGGGCAAAAATGT * * * 20567118 AATTTTTGACTTTTACAGGGGCAAAAGTGT 1 AATTTTTCAATTTTA-AGGGGCAAAAATGT * 20567148 AAATTTT 1 AATTTTT 20567155 AAAAATTACT Statistics Matches: 30, Mismatches: 5, Indels: 2 0.81 0.14 0.05 Matches are distributed among these distances: 28 12 0.40 30 18 0.60 ACGTcount: A:0.32, C:0.08, G:0.18, T:0.42 Consensus pattern (29 bp): AATTTTTCAATTTTAAGGGGCAAAAATGT Found at i:20573984 original size:19 final size:19 Alignment explanation

Indices: 20573960--20573997 Score: 67 Period size: 19 Copynumber: 2.0 Consensus size: 19 20573950 CCATTTGACT * 20573960 TATTGGAGTATGATTTGAG 1 TATTGGAGTATAATTTGAG 20573979 TATTGGAGTATAATTTGAG 1 TATTGGAGTATAATTTGAG 20573998 GGTTTGGCAG Statistics Matches: 18, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 19 18 1.00 ACGTcount: A:0.29, C:0.00, G:0.29, T:0.42 Consensus pattern (19 bp): TATTGGAGTATAATTTGAG Found at i:20585539 original size:22 final size:21 Alignment explanation

Indices: 20585510--20585550 Score: 55 Period size: 22 Copynumber: 1.9 Consensus size: 21 20585500 GGACAAGTGC * 20585510 ATGGTGAAAATTTGAAAACAA 1 ATGGTGAAAATTTAAAAACAA * 20585531 ATGGATGAAATTTTAAAAAC 1 ATGG-TGAAAATTTAAAAAC 20585551 GATGCAATGG Statistics Matches: 17, Mismatches: 2, Indels: 1 0.85 0.10 0.05 Matches are distributed among these distances: 21 4 0.24 22 13 0.76 ACGTcount: A:0.51, C:0.05, G:0.17, T:0.27 Consensus pattern (21 bp): ATGGTGAAAATTTAAAAACAA Found at i:20594505 original size:2 final size:2 Alignment explanation

Indices: 20594498--20594551 Score: 92 Period size: 2 Copynumber: 27.0 Consensus size: 2 20594488 TGCAACAAGG 20594498 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT ANT -T 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A-T AT 20594540 AT AT AT AT AT AT 1 AT AT AT AT AT AT 20594552 GTTAGAGGTA Statistics Matches: 50, Mismatches: 0, Indels: 4 0.93 0.00 0.07 Matches are distributed among these distances: 1 1 0.02 2 48 0.96 3 1 0.02 ACGTcount: A:0.48, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): AT Found at i:20596924 original size:2 final size:2 Alignment explanation

Indices: 20596914--20596952 Score: 51 Period size: 2 Copynumber: 19.5 Consensus size: 2 20596904 CATAACACAC * * * 20596914 AT AT CT AT AT AT GT AT AT GT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 20596953 CCCATTATCA Statistics Matches: 31, Mismatches: 6, Indels: 0 0.84 0.16 0.00 Matches are distributed among these distances: 2 31 1.00 ACGTcount: A:0.44, C:0.03, G:0.05, T:0.49 Consensus pattern (2 bp): AT Found at i:20603456 original size:42 final size:42 Alignment explanation

Indices: 20603396--20603509 Score: 158 Period size: 42 Copynumber: 2.7 Consensus size: 42 20603386 CAACTCATAT * * 20603396 GCACTCCCCACT-AGCACATGGTCGGGTCAGCATCGGCTTATGC 1 GCACT-CCCACTCA-CACATAGTCGGGTCAACATCGGCTTATGC * 20603439 GCACTCCCACTCACACATAGCCGGGTCAACATCGGCTTATGC 1 GCACTCCCACTCACACATAGTCGGGTCAACATCGGCTTATGC * * 20603481 GCACTCCTACTCGCACATAGTCGGGTCAA 1 GCACTCCCACTCACACATAGTCGGGTCAA 20603510 TATCATTACA Statistics Matches: 64, Mismatches: 6, Indels: 3 0.88 0.08 0.04 Matches are distributed among these distances: 42 58 0.91 43 6 0.09 ACGTcount: A:0.22, C:0.36, G:0.22, T:0.20 Consensus pattern (42 bp): GCACTCCCACTCACACATAGTCGGGTCAACATCGGCTTATGC Found at i:20605990 original size:21 final size:21 Alignment explanation

Indices: 20605973--20606017 Score: 72 Period size: 21 Copynumber: 2.1 Consensus size: 21 20605963 AATATGTTTC 20605973 CAATGTATCGATACATGTTCA 1 CAATGTATCGATACATGTTCA * * 20605994 TAATGTATCGATACATGATCA 1 CAATGTATCGATACATGTTCA 20606015 CAA 1 CAA 20606018 AACATGAGTT Statistics Matches: 21, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 21 21 1.00 ACGTcount: A:0.38, C:0.18, G:0.13, T:0.31 Consensus pattern (21 bp): CAATGTATCGATACATGTTCA Found at i:20614702 original size:20 final size:20 Alignment explanation

Indices: 20614677--20614716 Score: 62 Period size: 20 Copynumber: 2.0 Consensus size: 20 20614667 AACATTCTTC * * 20614677 AACTTTAATCGATTGAAATA 1 AACTTTAATAGATTAAAATA 20614697 AACTTTAATAGATTAAAATA 1 AACTTTAATAGATTAAAATA 20614717 TCTTCGAATT Statistics Matches: 18, Mismatches: 2, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 20 18 1.00 ACGTcount: A:0.50, C:0.07, G:0.07, T:0.35 Consensus pattern (20 bp): AACTTTAATAGATTAAAATA Found at i:20616728 original size:36 final size:36 Alignment explanation

Indices: 20616636--20616728 Score: 118 Period size: 36 Copynumber: 2.6 Consensus size: 36 20616626 TTCTAGGACT * * 20616636 TGGGCTATCATTCATTCACACTCACATCACAATAGG 1 TGGGCTATCATTCATCCACATTCACATCACAATAGG * * 20616672 TGGGCTA-CTAATCATCCACATTCACATCATAATAGG 1 TGGGCTATC-ATTCATCCACATTCACATCACAATAGG 20616708 TGGG-TATGCATTCATCCACAT 1 TGGGCTAT-CATTCATCCACAT 20616729 AGGCAATAAC Statistics Matches: 49, Mismatches: 5, Indels: 6 0.82 0.08 0.10 Matches are distributed among these distances: 35 3 0.06 36 45 0.92 37 1 0.02 ACGTcount: A:0.30, C:0.26, G:0.15, T:0.29 Consensus pattern (36 bp): TGGGCTATCATTCATCCACATTCACATCACAATAGG Found at i:20620384 original size:34 final size:34 Alignment explanation

Indices: 20620346--20620418 Score: 87 Period size: 33 Copynumber: 2.2 Consensus size: 34 20620336 GGCTGCTGAT * 20620346 TTTTGAAGG-GAAAATAGAGCAGCTTAGCTACTAA 1 TTTTGAAGGAG-AAATAGAGCAGCTTAACTACTAA ** * 20620380 TTTTG-AGGAGAAATAGAGTTGCTTAACTACTGA 1 TTTTGAAGGAGAAATAGAGCAGCTTAACTACTAA 20620413 TTTTGA 1 TTTTGA 20620419 TGTGGATATG Statistics Matches: 33, Mismatches: 4, Indels: 4 0.80 0.10 0.10 Matches are distributed among these distances: 33 27 0.82 34 6 0.18 ACGTcount: A:0.34, C:0.10, G:0.23, T:0.33 Consensus pattern (34 bp): TTTTGAAGGAGAAATAGAGCAGCTTAACTACTAA Found at i:20620616 original size:5 final size:5 Alignment explanation

Indices: 20620577--20620649 Score: 85 Period size: 5 Copynumber: 14.2 Consensus size: 5 20620567 TCGGCTGGAG * * 20620577 AGAAA AGGAAG AGAAA GAGAGAA AGAAA A-AAA AGAAA AGAAA AAAAA 1 AGAAA A-GAAA AGAAA -AGA-AA AGAAA AGAAA AGAAA AGAAA AGAAA * 20620624 AGAAA AGAAG AGAAA AGAAA AGAAA A 1 AGAAA AGAAA AGAAA AGAAA AGAAA A 20620650 ATATATATAT Statistics Matches: 58, Mismatches: 6, Indels: 8 0.81 0.08 0.11 Matches are distributed among these distances: 4 4 0.07 5 42 0.72 6 10 0.17 7 2 0.03 ACGTcount: A:0.77, C:0.00, G:0.23, T:0.00 Consensus pattern (5 bp): AGAAA Found at i:20620720 original size:25 final size:25 Alignment explanation

Indices: 20620666--20620722 Score: 73 Period size: 25 Copynumber: 2.3 Consensus size: 25 20620656 ATATGCATAT * 20620666 AAATGGACATGGCCCAATAAGGAAAG 1 AAATGGGCATGGCCCAATAA-GAAAG 20620692 -AATGGGCATGGCCCAATTAA-AAAG 1 AAATGGGCATGGCCCAA-TAAGAAAG 20620716 AAATGGG 1 AAATGGG 20620723 AGAAGGCCCT Statistics Matches: 28, Mismatches: 1, Indels: 5 0.82 0.03 0.15 Matches are distributed among these distances: 24 4 0.14 25 21 0.75 26 3 0.11 ACGTcount: A:0.44, C:0.14, G:0.28, T:0.14 Consensus pattern (25 bp): AAATGGGCATGGCCCAATAAGAAAG Found at i:20627127 original size:16 final size:18 Alignment explanation

Indices: 20627096--20627134 Score: 55 Period size: 16 Copynumber: 2.3 Consensus size: 18 20627086 AGGGAGAATA 20627096 AAAGAGAAACGAAAGAAC 1 AAAGAGAAACGAAAGAAC * 20627114 AAAG-GAAA-GAAAGAAG 1 AAAGAGAAACGAAAGAAC 20627130 AAAGA 1 AAAGA 20627135 AGTAGCCGAG Statistics Matches: 19, Mismatches: 1, Indels: 3 0.83 0.04 0.13 Matches are distributed among these distances: 16 11 0.58 17 4 0.21 18 4 0.21 ACGTcount: A:0.69, C:0.05, G:0.26, T:0.00 Consensus pattern (18 bp): AAAGAGAAACGAAAGAAC Found at i:20627710 original size:15 final size:15 Alignment explanation

Indices: 20627675--20627725 Score: 59 Period size: 15 Copynumber: 3.4 Consensus size: 15 20627665 AAGAAAAGTT 20627675 AAGAAAATAA-ATGAA 1 AAGAAAATAAGAT-AA * 20627690 ACGAAAATAAGATAA 1 AAGAAAATAAGATAA * * 20627705 AAGAAAAGAAGAGAA 1 AAGAAAATAAGATAA 20627720 AAGAAA 1 AAGAAA 20627726 GAAAAATATA Statistics Matches: 31, Mismatches: 4, Indels: 2 0.84 0.11 0.05 Matches are distributed among these distances: 15 29 0.94 16 2 0.06 ACGTcount: A:0.73, C:0.02, G:0.18, T:0.08 Consensus pattern (15 bp): AAGAAAATAAGATAA Found at i:20627801 original size:25 final size:25 Alignment explanation

Indices: 20627745--20627803 Score: 77 Period size: 25 Copynumber: 2.4 Consensus size: 25 20627735 ATATATGCAT * 20627745 ATAAATGGACATGGCCCAATAAGGAA 1 ATAAATGGGCATGGCCCAATAA-GAA 20627771 AT-AATGGGCATGGCCCAATTAA-AA 1 ATAAATGGGCATGGCCCAA-TAAGAA 20627795 ATAAATGGG 1 ATAAATGGG 20627804 AGAAGTCCAT Statistics Matches: 30, Mismatches: 1, Indels: 5 0.83 0.03 0.14 Matches are distributed among these distances: 24 4 0.13 25 21 0.70 26 5 0.17 ACGTcount: A:0.44, C:0.14, G:0.24, T:0.19 Consensus pattern (25 bp): ATAAATGGGCATGGCCCAATAAGAA Found at i:20630928 original size:19 final size:19 Alignment explanation

Indices: 20630888--20630929 Score: 50 Period size: 19 Copynumber: 2.2 Consensus size: 19 20630878 AGACCTAAGG * * 20630888 AATAAATGAATGATGCTAA 1 AATAAATGAATGAAGATAA 20630907 AATAAATGAA-GAAGATAGA 1 AATAAATGAATGAAGATA-A 20630926 AATA 1 AATA 20630930 CTGATAATTT Statistics Matches: 20, Mismatches: 2, Indels: 2 0.83 0.08 0.08 Matches are distributed among these distances: 18 5 0.25 19 15 0.75 ACGTcount: A:0.60, C:0.02, G:0.17, T:0.21 Consensus pattern (19 bp): AATAAATGAATGAAGATAA Found at i:20631880 original size:43 final size:43 Alignment explanation

Indices: 20631812--20631997 Score: 255 Period size: 43 Copynumber: 4.3 Consensus size: 43 20631802 TTGTGTGATA * * * * * 20631812 ATGATGACCCAGCTACGTGCGGTGTAAGAATGCACATGAGTTG 1 ATGATGACCCAGCTATGTGCAGTGTAGGAGTGCACATGAGCTG * * 20631855 ATGATGACTCAGCTATGTGCAGTGTAGGAGTGCACATGAGTTG 1 ATGATGACCCAGCTATGTGCAGTGTAGGAGTGCACATGAGCTG * * 20631898 ATGATGACCCAGCTATGTGTAGTGTAGGAGTGCACGTGAGCTG 1 ATGATGACCCAGCTATGTGCAGTGTAGGAGTGCACATGAGCTG * * * 20631941 ATGATGACCTAGCCATGTGCAGTGTAGGAGTGCACATGAACTG 1 ATGATGACCCAGCTATGTGCAGTGTAGGAGTGCACATGAGCTG * 20631984 ATGATGACCTAGCT 1 ATGATGACCCAGCT 20631998 GAGGGAGAGT Statistics Matches: 128, Mismatches: 15, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 43 128 1.00 ACGTcount: A:0.26, C:0.17, G:0.31, T:0.25 Consensus pattern (43 bp): ATGATGACCCAGCTATGTGCAGTGTAGGAGTGCACATGAGCTG Found at i:20635313 original size:17 final size:18 Alignment explanation

Indices: 20635291--20635324 Score: 52 Period size: 18 Copynumber: 1.9 Consensus size: 18 20635281 AACCCTAGGA 20635291 AGAG-AAGAAAGAAAAAG 1 AGAGAAAGAAAGAAAAAG * 20635308 AGAGAAAGGAAGAAAAA 1 AGAGAAAGAAAGAAAAA 20635325 AGCTTGGTTT Statistics Matches: 15, Mismatches: 1, Indels: 1 0.88 0.06 0.06 Matches are distributed among these distances: 17 4 0.27 18 11 0.73 ACGTcount: A:0.71, C:0.00, G:0.29, T:0.00 Consensus pattern (18 bp): AGAGAAAGAAAGAAAAAG Found at i:20643947 original size:18 final size:18 Alignment explanation

Indices: 20643920--20643977 Score: 71 Period size: 18 Copynumber: 3.2 Consensus size: 18 20643910 TTGTTGTTTG * * 20643920 GCCTATTTGGACTACTTA 1 GCCTACTTGGGCTACTTA * 20643938 GCCTACTTGGGGTACTTA 1 GCCTACTTGGGCTACTTA * * 20643956 GCCTACTAGGGCTGCTTA 1 GCCTACTTGGGCTACTTA 20643974 GCCT 1 GCCT 20643978 TCTCAACCTA Statistics Matches: 34, Mismatches: 6, Indels: 0 0.85 0.15 0.00 Matches are distributed among these distances: 18 34 1.00 ACGTcount: A:0.17, C:0.26, G:0.24, T:0.33 Consensus pattern (18 bp): GCCTACTTGGGCTACTTA Found at i:20646912 original size:19 final size:21 Alignment explanation

Indices: 20646874--20646913 Score: 57 Period size: 19 Copynumber: 2.0 Consensus size: 21 20646864 CCACTCATGC * 20646874 CACATTGTACATTAAATCAAG 1 CACATTGTACAATAAATCAAG 20646895 CACATTG-ACAAT-AATCAAG 1 CACATTGTACAATAAATCAAG 20646914 TTTTCTCAAT Statistics Matches: 18, Mismatches: 1, Indels: 2 0.86 0.05 0.10 Matches are distributed among these distances: 19 7 0.39 20 4 0.22 21 7 0.39 ACGTcount: A:0.45, C:0.20, G:0.10, T:0.25 Consensus pattern (21 bp): CACATTGTACAATAAATCAAG Found at i:20647129 original size:37 final size:37 Alignment explanation

Indices: 20647065--20647212 Score: 147 Period size: 37 Copynumber: 3.8 Consensus size: 37 20647055 TAGCCAGAGT * * 20647065 CATCATGTGC-ACCCCCACATCATGCACAACTAATGC 1 CATCATGTGCTCCCCCCACATCACGCACAACTAATGC * 20647101 CATCATGTGCTCCCCCCACATCCCGCACATGGCACATATAT-C 1 CATCATGTGCTCCCCCCACATCACGCACA----AC-TA-ATGC * * 20647143 ATCATCATCGTGTTCACCCCCACATCGCGCACAACTAATGC 1 --CATCAT-GTGCTC-CCCCCACATCACGCACAACTAATGC 20647184 CATCATGTGCTCCCCCCACATCACGCACA 1 CATCATGTGCTCCCCCCACATCACGCACA 20647213 TGGCATGTAT Statistics Matches: 93, Mismatches: 7, Indels: 23 0.76 0.06 0.19 Matches are distributed among these distances: 36 10 0.11 37 31 0.33 38 5 0.05 39 6 0.06 40 2 0.02 41 5 0.05 42 5 0.05 43 2 0.02 44 6 0.06 45 5 0.05 46 16 0.17 ACGTcount: A:0.26, C:0.42, G:0.11, T:0.20 Consensus pattern (37 bp): CATCATGTGCTCCCCCCACATCACGCACAACTAATGC Found at i:20647227 original size:39 final size:39 Alignment explanation

Indices: 20647101--20647228 Score: 98 Period size: 37 Copynumber: 3.2 Consensus size: 39 20647091 CAACTAATGC * * 20647101 CATCATGTGCTCCCCCCACATCCCGCACATGGCACATATATCAT 1 CATCATGTGCTCCCCCCACATCACGCACATGG--C--ATGT-AT * * **** * 20647145 CATCATCGTGTTCACCCCCACATCGCGCACAACTAATG--C 1 CATCAT-GTGCTC-CCCCCACATCACGCACATGGCATGTAT 20647184 CATCATGTGCTCCCCCCACATCACGCACATGGCATGTAT 1 CATCATGTGCTCCCCCCACATCACGCACATGGCATGTAT 20647223 CATCAT 1 CATCAT 20647229 CATCATGTGC Statistics Matches: 65, Mismatches: 15, Indels: 13 0.70 0.16 0.14 Matches are distributed among these distances: 37 19 0.29 38 5 0.08 39 12 0.18 42 2 0.03 44 6 0.09 45 5 0.08 46 16 0.25 ACGTcount: A:0.25, C:0.40, G:0.12, T:0.23 Consensus pattern (39 bp): CATCATGTGCTCCCCCCACATCACGCACATGGCATGTAT Found at i:20647283 original size:83 final size:82 Alignment explanation

Indices: 20647064--20647281 Score: 348 Period size: 83 Copynumber: 2.7 Consensus size: 82 20647054 ATAGCCAGAG * * 20647064 TCATCATGTGCACCCCCACATCATGCACAACTAATGCCATCATGTGCTCCCCCCACATCCCGCAC 1 TCATCATGTGCACCCCCACATCGTGCACAACTAATGCCATCATGTGCTCCCCCCACATCACGCAC 20647129 ATGGCACATATATCATCA 66 ATGG-ACATATATCATCA * * * 20647147 TCATCGTGTTCACCCCCACATCGCGCACAACTAATGCCATCATGTGCTCCCCCCACATCACGCAC 1 TCATCATGTGCACCCCCACATCGTGCACAACTAATGCCATCATGTGCTCCCCCCACATCACGCAC * 20647212 ATGG-CATGTATCATCA 66 ATGGACATATATCATCA * * 20647228 TCATCATGTGCACCTCCACGTCGTGCACAACTAATGCCATCATGTGCTCCCCCC 1 TCATCATGTGCACCCCCACATCGTGCACAACTAATGCCATCATGTGCTCCCCCC 20647282 CCCCCCCACT Statistics Matches: 124, Mismatches: 11, Indels: 2 0.91 0.08 0.01 Matches are distributed among these distances: 81 60 0.48 83 64 0.52 ACGTcount: A:0.25, C:0.40, G:0.13, T:0.22 Consensus pattern (82 bp): TCATCATGTGCACCCCCACATCGTGCACAACTAATGCCATCATGTGCTCCCCCCACATCACGCAC ATGGACATATATCATCA Found at i:20649337 original size:21 final size:22 Alignment explanation

Indices: 20649312--20649352 Score: 66 Period size: 21 Copynumber: 1.9 Consensus size: 22 20649302 CCTAGGTCGA * 20649312 AAAATTCAAGGGAA-AAGTGTG 1 AAAATTCAAGGAAACAAGTGTG 20649333 AAAATTCAAGGAAACAAGTG 1 AAAATTCAAGGAAACAAGTG 20649353 ACAAAAGGTA Statistics Matches: 18, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 21 13 0.72 22 5 0.28 ACGTcount: A:0.51, C:0.07, G:0.24, T:0.17 Consensus pattern (22 bp): AAAATTCAAGGAAACAAGTGTG Found at i:20649358 original size:22 final size:21 Alignment explanation

Indices: 20649312--20649358 Score: 58 Period size: 21 Copynumber: 2.2 Consensus size: 21 20649302 CCTAGGTCGA * ** 20649312 AAAATTCAAGGGAAAAGTGTG 1 AAAATTCAAGGAAAAAGTGAC 20649333 AAAATTCAAGGAAACAAGTGAC 1 AAAATTCAAGGAAA-AAGTGAC 20649355 AAAA 1 AAAA 20649359 GGTAAGATTT Statistics Matches: 22, Mismatches: 3, Indels: 1 0.85 0.12 0.04 Matches are distributed among these distances: 21 13 0.59 22 9 0.41 ACGTcount: A:0.55, C:0.09, G:0.21, T:0.15 Consensus pattern (21 bp): AAAATTCAAGGAAAAAGTGAC Found at i:20651642 original size:11 final size:11 Alignment explanation

Indices: 20651628--20651657 Score: 60 Period size: 11 Copynumber: 2.7 Consensus size: 11 20651618 CAAAATAAAA 20651628 TATTAATATTT 1 TATTAATATTT 20651639 TATTAATATTT 1 TATTAATATTT 20651650 TATTAATA 1 TATTAATA 20651658 AAATATTATT Statistics Matches: 19, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 11 19 1.00 ACGTcount: A:0.40, C:0.00, G:0.00, T:0.60 Consensus pattern (11 bp): TATTAATATTT Found at i:20656263 original size:37 final size:37 Alignment explanation

Indices: 20656222--20656331 Score: 139 Period size: 37 Copynumber: 3.0 Consensus size: 37 20656212 ATAATGTTCT * ** 20656222 ATGATGATAATGAGGTGTGCGAGTAGGGAGTGCACAC 1 ATGATGATATTGAATTGTGCGAGTAGGGAGTGCACAC * * 20656259 ATGATGACATTGAATTGTGCGAGTAGGGAGTGCACAT 1 ATGATGATATTGAATTGTGCGAGTAGGGAGTGCACAC * * * * 20656296 ATGGTGGTATTGACTTGTGCGTGTAGGGAGTGCACA 1 ATGATGATATTGAATTGTGCGAGTAGGGAGTGCACA 20656332 AGAGCTGAGG Statistics Matches: 63, Mismatches: 10, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 37 63 1.00 ACGTcount: A:0.26, C:0.11, G:0.36, T:0.26 Consensus pattern (37 bp): ATGATGATATTGAATTGTGCGAGTAGGGAGTGCACAC Found at i:20666714 original size:21 final size:21 Alignment explanation

Indices: 20666690--20666740 Score: 102 Period size: 21 Copynumber: 2.4 Consensus size: 21 20666680 ATTTATTTTA 20666690 CTTACACGTTACAAAAGATTT 1 CTTACACGTTACAAAAGATTT 20666711 CTTACACGTTACAAAAGATTT 1 CTTACACGTTACAAAAGATTT 20666732 CTTACACGT 1 CTTACACGT 20666741 GTTTCTATAA Statistics Matches: 30, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 21 30 1.00 ACGTcount: A:0.35, C:0.22, G:0.10, T:0.33 Consensus pattern (21 bp): CTTACACGTTACAAAAGATTT Found at i:20669905 original size:37 final size:36 Alignment explanation

Indices: 20669860--20669996 Score: 112 Period size: 37 Copynumber: 3.5 Consensus size: 36 20669850 ATCGTGTGCA * 20669860 CCCCCACATCGCGCACAACTAATGCCATCATGTTCT 1 CCCCCACATCGCGCACAACTAATGCCATCATGTCCT * * * 20669896 CCCTCCACATCACGCACATGGCACATATATCATCATCATCATGTCCA 1 CCC-CCACATCGCGCACA----AC-TA-AT--GC--CATCATGTCCT * * 20669943 CCCCCACATTGCGCACAACTAATGCCATCATGTGCT 1 CCCCCACATCGCGCACAACTAATGCCATCATGTCCT 20669979 CTCCCCACATCGCGCACA 1 C-CCCCACATCGCGCACA 20669997 TGGCATATAT Statistics Matches: 79, Mismatches: 10, Indels: 23 0.71 0.09 0.21 Matches are distributed among these distances: 36 13 0.16 37 28 0.35 38 1 0.01 40 2 0.03 41 4 0.05 42 4 0.05 43 2 0.03 45 1 0.01 46 12 0.15 47 12 0.15 ACGTcount: A:0.26, C:0.42, G:0.11, T:0.21 Consensus pattern (36 bp): CCCCCACATCGCGCACAACTAATGCCATCATGTCCT Found at i:20670054 original size:81 final size:80 Alignment explanation

Indices: 20669848--20670094 Score: 334 Period size: 81 Copynumber: 3.0 Consensus size: 80 20669838 ATAGCCGGAG * * 20669848 TCATCGTGTGCACCCCCACATCGCGCACAACTAATGCCATCATGTTCTCCCTCCACATCACGCAC 1 TCATCGTGTGCACCCCCACATCGCGCACAACTAATGCCATCATGTGCTCCC-CCACATCGCGCAC 20669913 ATGGCACATATATCATCA 65 ATGG--CATATATCATCA * * * 20669931 TCATCATGTCCACCCCCACATTGCGCACAACTAATGCCATCATGTGCTCTCCCCACATCGCGCAC 1 TCATCGTGTGCACCCCCACATCGCGCACAACTAATGCCATCATGTGCTC-CCCCACATCGCGCAC 20669996 ATGGCATATATCATCA 65 ATGGCATATATCATCA * * * * * 20670012 TCATCGTGTGCACCTCCACATCACACAAAACTAATGCCATCATGTGCTCCCCCCACATTGCGCAC 1 TCATCGTGTGCACCCCCACATCGCGCACAACTAATGCCATCATGTGCT-CCCCCACATCGCGCAC * 20670077 ATAGGCA-ATTTCATCA 65 AT-GGCATATATCATCA 20670093 TC 1 TC 20670095 CACACTCCCT Statistics Matches: 147, Mismatches: 14, Indels: 8 0.87 0.08 0.05 Matches are distributed among these distances: 81 79 0.54 82 5 0.03 83 61 0.41 84 2 0.01 ACGTcount: A:0.27, C:0.38, G:0.12, T:0.23 Consensus pattern (80 bp): TCATCGTGTGCACCCCCACATCGCGCACAACTAATGCCATCATGTGCTCCCCCACATCGCGCACA TGGCATATATCATCA Found at i:20676574 original size:2 final size:2 Alignment explanation

Indices: 20676567--20676609 Score: 52 Period size: 2 Copynumber: 21.0 Consensus size: 2 20676557 TTATAAGTTA * 20676567 AT AT AT AT AT AT GAT AT -T AA AT AT ACT AT AT AT AT AT AT AT AT 1 AT AT AT AT AT AT -AT AT AT AT AT AT A-T AT AT AT AT AT AT AT AT 20676610 TAATATGTTA Statistics Matches: 36, Mismatches: 2, Indels: 6 0.82 0.05 0.14 Matches are distributed among these distances: 1 1 0.03 2 31 0.86 3 4 0.11 ACGTcount: A:0.49, C:0.02, G:0.02, T:0.47 Consensus pattern (2 bp): AT Found at i:20677089 original size:23 final size:23 Alignment explanation

Indices: 20677040--20677098 Score: 66 Period size: 23 Copynumber: 2.5 Consensus size: 23 20677030 AAATATATAT * * * 20677040 AATAAATAATTTAGTTAATATAT 1 AATAAATAATATAGTTAATAGAG 20677063 AATAAATAATATA-TTATATAGAG 1 AATAAATAATATAGTTA-ATAGAG 20677086 AATAAAATAATAT 1 AAT-AAATAATAT 20677099 TTAATTTATT Statistics Matches: 31, Mismatches: 3, Indels: 3 0.84 0.08 0.08 Matches are distributed among these distances: 22 3 0.10 23 19 0.61 24 9 0.29 ACGTcount: A:0.58, C:0.00, G:0.05, T:0.37 Consensus pattern (23 bp): AATAAATAATATAGTTAATAGAG Found at i:20677483 original size:23 final size:24 Alignment explanation

Indices: 20677449--20677506 Score: 73 Period size: 24 Copynumber: 2.4 Consensus size: 24 20677439 TGAACGACAG * 20677449 CTAAAGCTTTC-TTTGTTGTGCCAA 1 CTAAAG-TTTCTTTTGTCGTGCCAA * * 20677473 CTAAATTTTCTTTTGTCGTGCCGA 1 CTAAAGTTTCTTTTGTCGTGCCAA 20677497 CTAAAGTTTC 1 CTAAAGTTTC 20677507 CTTTACCTTG Statistics Matches: 29, Mismatches: 4, Indels: 2 0.83 0.11 0.06 Matches are distributed among these distances: 23 4 0.14 24 25 0.86 ACGTcount: A:0.21, C:0.21, G:0.16, T:0.43 Consensus pattern (24 bp): CTAAAGTTTCTTTTGTCGTGCCAA Found at i:20681070 original size:22 final size:21 Alignment explanation

Indices: 20681041--20681081 Score: 55 Period size: 22 Copynumber: 1.9 Consensus size: 21 20681031 GGACAAGTGT * 20681041 ATGGTGAAAATTTGAAAACAA 1 ATGGTGAAAATTTAAAAACAA * 20681062 ATGGATGAAATTTTAAAAAC 1 ATGG-TGAAAATTTAAAAAC 20681082 GGACCAATGG Statistics Matches: 17, Mismatches: 2, Indels: 1 0.85 0.10 0.05 Matches are distributed among these distances: 21 4 0.24 22 13 0.76 ACGTcount: A:0.51, C:0.05, G:0.17, T:0.27 Consensus pattern (21 bp): ATGGTGAAAATTTAAAAACAA Found at i:20695816 original size:21 final size:19 Alignment explanation

Indices: 20695790--20695830 Score: 55 Period size: 21 Copynumber: 2.1 Consensus size: 19 20695780 AATTTAAAAC 20695790 AAAATAACATAAAATTTAACT 1 AAAATAAC-TAAAA-TTAACT * 20695811 AAAATAACTTAAATTAACT 1 AAAATAACTAAAATTAACT 20695830 A 1 A 20695831 CTCAACATGT Statistics Matches: 19, Mismatches: 1, Indels: 2 0.86 0.05 0.09 Matches are distributed among these distances: 19 7 0.37 20 4 0.21 21 8 0.42 ACGTcount: A:0.61, C:0.10, G:0.00, T:0.29 Consensus pattern (19 bp): AAAATAACTAAAATTAACT Found at i:20696981 original size:23 final size:23 Alignment explanation

Indices: 20696937--20696981 Score: 56 Period size: 23 Copynumber: 2.0 Consensus size: 23 20696927 AGTAGGACTA * 20696937 AGGAAAAATGAATGATGAAAATG 1 AGGAAAAATGAATGAAGAAAATG * 20696960 AGGATAAAATG-ATGAAGGAAAT 1 AGGA-AAAATGAATGAAGAAAAT 20696982 AGAAATAATG Statistics Matches: 19, Mismatches: 2, Indels: 2 0.83 0.09 0.09 Matches are distributed among these distances: 23 13 0.68 24 6 0.32 ACGTcount: A:0.56, C:0.00, G:0.27, T:0.18 Consensus pattern (23 bp): AGGAAAAATGAATGAAGAAAATG Found at i:20697251 original size:18 final size:21 Alignment explanation

Indices: 20697206--20697248 Score: 77 Period size: 21 Copynumber: 2.0 Consensus size: 21 20697196 GCTATAAATA 20697206 TCATTTTCCAGCAGCTTCTTC 1 TCATTTTCCAGCAGCTTCTTC * 20697227 TCATTTTCTAGCAGCTTCTTC 1 TCATTTTCCAGCAGCTTCTTC 20697248 T 1 T 20697249 TTTTCTTCCT Statistics Matches: 21, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 21 21 1.00 ACGTcount: A:0.14, C:0.30, G:0.09, T:0.47 Consensus pattern (21 bp): TCATTTTCCAGCAGCTTCTTC Found at i:20700160 original size:40 final size:38 Alignment explanation

Indices: 20700067--20700172 Score: 167 Period size: 38 Copynumber: 2.7 Consensus size: 38 20700057 ATAGTGCCAG * * 20700067 TGTGCACGATGTGGGGGGATGTACACGATGGCACTGAT 1 TGTGCACGATGTGGGGGGATGTACATGATGGCACCGAT * 20700105 TGTGCACGATGTGGGGGGATGCACATGATGGCACCGATT 1 TGTGCACGATGTGGGGGGATGTACATGATGGCACCGA-T 20700144 ATGTGCACGATGTGGGGGGATGTACATGA 1 -TGTGCACGATGTGGGGGGATGTACATGA 20700173 GCCGGGTGTG Statistics Matches: 62, Mismatches: 4, Indels: 2 0.91 0.06 0.03 Matches are distributed among these distances: 38 34 0.55 39 1 0.02 40 27 0.44 ACGTcount: A:0.22, C:0.15, G:0.40, T:0.24 Consensus pattern (38 bp): TGTGCACGATGTGGGGGGATGTACATGATGGCACCGAT Found at i:20701376 original size:43 final size:43 Alignment explanation

Indices: 20701315--20701444 Score: 129 Period size: 43 Copynumber: 3.0 Consensus size: 43 20701305 TTTTGTGCGA * 20701315 TAATGATGACCCAGCTATGTTCGGTGTGGAAGTACACATGAGC 1 TAATGATGACCCAGCTATGTGCGGTGTGGAAGTACACATGAGC * * * * * 20701358 TAATGATGACTCAGCTATGTGCTGTGTGGGAGTGCACATGAGT 1 TAATGATGACCCAGCTATGTGCGGTGTGGAAGTACACATGAGC * * * * * 20701401 TGATGGTGAACCGGCTATGTGCGAGTAG-GG-AGTGCACATGAGC 1 TAATGATGACCCAGCTATGTGCG-GT-GTGGAAGTACACATGAGC 20701444 T 1 T 20701445 GAGTGTGGCG Statistics Matches: 72, Mismatches: 13, Indels: 4 0.81 0.15 0.04 Matches are distributed among these distances: 43 67 0.93 44 4 0.06 45 1 0.01 ACGTcount: A:0.25, C:0.16, G:0.33, T:0.26 Consensus pattern (43 bp): TAATGATGACCCAGCTATGTGCGGTGTGGAAGTACACATGAGC Found at i:20711277 original size:38 final size:40 Alignment explanation

Indices: 20711235--20711337 Score: 129 Period size: 38 Copynumber: 2.6 Consensus size: 40 20711225 ATAGTGCCCG * 20711235 TGTGCACGATGTGGGGGGATGTACACGATGG-CACTGAT- 1 TGTGCACGATGTGGGGGGATGTACACGATGGTCACGGATA * * ** * 20711273 TGTGCACGATGTGAGGGGATGCACACGATGGTTTCGGCTA 1 TGTGCACGATGTGGGGGGATGTACACGATGGTCACGGATA * 20711313 TGTGCACAATGTGGGGGGATGTACA 1 TGTGCACGATGTGGGGGGATGTACA 20711338 TGAGCCGGGT Statistics Matches: 54, Mismatches: 9, Indels: 2 0.83 0.14 0.03 Matches are distributed among these distances: 38 29 0.54 39 3 0.06 40 22 0.41 ACGTcount: A:0.21, C:0.16, G:0.39, T:0.24 Consensus pattern (40 bp): TGTGCACGATGTGGGGGGATGTACACGATGGTCACGGATA Found at i:20715295 original size:30 final size:30 Alignment explanation

Indices: 20715261--20715325 Score: 94 Period size: 30 Copynumber: 2.2 Consensus size: 30 20715251 AATTTAATGG * * * 20715261 GAAATGCTTTTCAAACAGAAAATTCTATTA 1 GAAATGCTTCTCAAACAAAAAATTCCATTA * 20715291 GAAATGCTTCTCCAACAAAAAATTCCATTA 1 GAAATGCTTCTCAAACAAAAAATTCCATTA 20715321 GAAAT 1 GAAAT 20715326 TTTCAATGAA Statistics Matches: 31, Mismatches: 4, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 30 31 1.00 ACGTcount: A:0.45, C:0.17, G:0.09, T:0.29 Consensus pattern (30 bp): GAAATGCTTCTCAAACAAAAAATTCCATTA Found at i:20715372 original size:24 final size:24 Alignment explanation

Indices: 20715309--20715372 Score: 67 Period size: 24 Copynumber: 2.7 Consensus size: 24 20715299 TCTCCAACAA * * * 20715309 AAAATTCCATTAGAAATTTTCAAT 1 AAAATTCCGTTGGAAATTTTCAAC * * 20715333 GAAATTTCG-TGAGAAATTTTCAAC 1 AAAATTCCGTTG-GAAATTTTCAAC 20715357 AAAATTCCGTTGGAAA 1 AAAATTCCGTTGGAAA 20715373 AGCATTTTCA Statistics Matches: 31, Mismatches: 7, Indels: 4 0.74 0.17 0.10 Matches are distributed among these distances: 23 1 0.03 24 28 0.90 25 2 0.06 ACGTcount: A:0.42, C:0.12, G:0.12, T:0.33 Consensus pattern (24 bp): AAAATTCCGTTGGAAATTTTCAAC Found at i:20715401 original size:28 final size:28 Alignment explanation

Indices: 20715348--20715401 Score: 65 Period size: 28 Copynumber: 1.9 Consensus size: 28 20715338 TTCGTGAGAA ** 20715348 ATTTTCAACAAAATTCCGTTGGAAAAGC 1 ATTTTCAACAAAATTCCGTTCAAAAAGC * 20715376 ATTTTCAAAGAAAATTCCG-TCAAAAA 1 ATTTTC-AACAAAATTCCGTTCAAAAA 20715402 TTCCATTGAA Statistics Matches: 22, Mismatches: 3, Indels: 2 0.81 0.11 0.07 Matches are distributed among these distances: 28 11 0.50 29 11 0.50 ACGTcount: A:0.44, C:0.17, G:0.11, T:0.28 Consensus pattern (28 bp): ATTTTCAACAAAATTCCGTTCAAAAAGC Found at i:20720115 original size:42 final size:42 Alignment explanation

Indices: 20720067--20720352 Score: 383 Period size: 42 Copynumber: 6.8 Consensus size: 42 20720057 TGTCGACTCA * * * 20720067 GCTATGTGCGAGTAGGAGTGTACATAAGCCGTTGTTGACCCG 1 GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG * 20720109 ACTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG 1 GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG * * * 20720151 ACTATGTGAGAGTGGGGGTGCACATAAGCCGTTGCTGACCCG 1 GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG * * * * 20720193 GCTATGTGCTAGTGGGAGTGCACATAAGCCATTGTTGACCAG 1 GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG * * * * 20720235 GCTCTGTGCGAGTGAGAGTGCACATAAGCCGTTGCTAACCTG 1 GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG *** 20720277 GCTATGTGCGAGTAAAAGTGCACATAAGCCGTTGCTGACCCG 1 GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG * * * 20720319 GCTATATGCGAGTGGGAGTACGCATAAGCCGTTG 1 GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTG 20720353 TATATGAGAT Statistics Matches: 213, Mismatches: 31, Indels: 0 0.87 0.13 0.00 Matches are distributed among these distances: 42 213 1.00 ACGTcount: A:0.23, C:0.21, G:0.33, T:0.23 Consensus pattern (42 bp): GCTATGTGCGAGTGGGAGTGCACATAAGCCGTTGCTGACCCG Found at i:20723847 original size:42 final size:42 Alignment explanation

Indices: 20723781--20723986 Score: 281 Period size: 42 Copynumber: 4.9 Consensus size: 42 20723771 TGATGTGGAT * 20723781 CCCG-CTATGTGCGAGTGGAAGTGCACATAAGCCGTTGCTGA 1 CCCGACTATGTGCGAGTGGAAGTGCACATAAGCCATTGCTGA * 20723822 CCCGACTATGTGCGAGTAGG-GGTGCACATAAGCCATTGCTGA 1 CCCGACTATGTGCGAGT-GGAAGTGCACATAAGCCATTGCTGA ** * * 20723864 CCCGGTTATGTGCGAGTGGAACTGCACATAAGCCATTGGTGA 1 CCCGACTATGTGCGAGTGGAAGTGCACATAAGCCATTGCTGA * * 20723906 CCCGACTATATGCGAGTGGAACTGCACATAAGCCATTGCTGA 1 CCCGACTATGTGCGAGTGGAAGTGCACATAAGCCATTGCTGA * * * * 20723948 CCCGACTATGTGCGAGTGGGAGTACGCATAAGCCGTTGC 1 CCCGACTATGTGCGAGTGGAAGTGCACATAAGCCATTGC 20723987 ATATGAGATG Statistics Matches: 145, Mismatches: 17, Indels: 5 0.87 0.10 0.03 Matches are distributed among these distances: 41 6 0.04 42 137 0.94 43 2 0.01 ACGTcount: A:0.24, C:0.24, G:0.30, T:0.22 Consensus pattern (42 bp): CCCGACTATGTGCGAGTGGAAGTGCACATAAGCCATTGCTGA Found at i:20726038 original size:40 final size:38 Alignment explanation

Indices: 20725977--20726079 Score: 127 Period size: 38 Copynumber: 2.7 Consensus size: 38 20725967 ACTCGGCTCA * * 20725977 TGTACATCTCCCCACATCGTGCACATAGCCG-GCACCATCG 1 TGTACATCCCCCCACATCGTGCACA-AGCAGTG--CCATCG * * * 20726017 TGTGCATCCCCCCATATCGTGCACAATCAGTGCCATCG 1 TGTACATCCCCCCACATCGTGCACAAGCAGTGCCATCG 20726055 TGTACATCCCCCCACATCGTGCACA 1 TGTACATCCCCCCACATCGTGCACA 20726080 CGGGCACTAT Statistics Matches: 55, Mismatches: 7, Indels: 4 0.83 0.11 0.06 Matches are distributed among these distances: 38 29 0.53 39 3 0.05 40 23 0.42 ACGTcount: A:0.22, C:0.40, G:0.17, T:0.21 Consensus pattern (38 bp): TGTACATCCCCCCACATCGTGCACAAGCAGTGCCATCG Found at i:20726072 original size:19 final size:19 Alignment explanation

Indices: 20725977--20726075 Score: 56 Period size: 19 Copynumber: 5.1 Consensus size: 19 20725967 ACTCGGCTCA * 20725977 TGTACATCTCCCCACATCG 1 TGTACATCCCCCCACATCG * * ** 20725996 TGCACATAGCCGGCACCATCG 1 TGTACAT-CCCCCCA-CATCG * * 20726017 TGTGCATCCCCCCATATCG 1 TGTACATCCCCCCACATCG * **** 20726036 TGCACAATCAGTGC-CATCG 1 TGTAC-ATCCCCCCACATCG 20726055 TGTACATCCCCCCACATCG 1 TGTACATCCCCCCACATCG 20726074 TG 1 TG 20726076 CACACGGGCA Statistics Matches: 53, Mismatches: 23, Indels: 8 0.63 0.27 0.10 Matches are distributed among these distances: 18 4 0.08 19 28 0.53 20 11 0.21 21 10 0.19 ACGTcount: A:0.21, C:0.39, G:0.17, T:0.22 Consensus pattern (19 bp): TGTACATCCCCCCACATCG Found at i:20728630 original size:16 final size:16 Alignment explanation

Indices: 20728594--20728624 Score: 53 Period size: 16 Copynumber: 1.9 Consensus size: 16 20728584 ATAGTACAAT * 20728594 ATATGTGTATGTATGC 1 ATATATGTATGTATGC 20728610 ATATATGTATGTATG 1 ATATATGTATGTATG 20728625 TGTATACATA Statistics Matches: 14, Mismatches: 1, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 16 14 1.00 ACGTcount: A:0.29, C:0.03, G:0.23, T:0.45 Consensus pattern (16 bp): ATATATGTATGTATGC Found at i:20728734 original size:20 final size:20 Alignment explanation

Indices: 20728592--20728735 Score: 58 Period size: 20 Copynumber: 7.3 Consensus size: 20 20728582 CAATAGTACA * ** 20728592 ATATATGTGTATGTATGCAT 1 ATATATATGTATGTATATAT * * * 20728612 ATATGTATGTATGTGTATAC 1 ATATATATGTATGTATATAT * ** * * 20728632 ATACATACATATATATCTAT 1 ATATATATGTATGTATATAT * * 20728652 ATATATATGTATGTGTGTAT 1 ATATATATGTATGTATATAT * * ** * 20728672 GTGTATACATA-G-ATACAT 1 ATATATATGTATGTATATAT * * * * 20728690 TTGTATGTGTATGTGTATAT 1 ATATATATGTATGTATATAT * 20728710 ATATATATGTATGTATGTAT 1 ATATATATGTATGTATATAT * 20728730 GTATAT 1 ATATAT 20728736 GGTTAAAGTA Statistics Matches: 83, Mismatches: 39, Indels: 4 0.66 0.31 0.03 Matches are distributed among these distances: 18 10 0.12 19 2 0.02 20 71 0.86 ACGTcount: A:0.33, C:0.05, G:0.17, T:0.46 Consensus pattern (20 bp): ATATATATGTATGTATATAT Found at i:20729955 original size:8 final size:8 Alignment explanation

Indices: 20729912--20729955 Score: 52 Period size: 8 Copynumber: 5.2 Consensus size: 8 20729902 ACATATACAC 20729912 ATACATAT 1 ATACATAT * 20729920 ATACATACAA 1 ATACAT--AT * 20729930 ACACATAT 1 ATACATAT 20729938 ATACATAT 1 ATACATAT 20729946 ATACATAT 1 ATACATAT 20729954 AT 1 AT 20729956 GTATGTGTAT Statistics Matches: 30, Mismatches: 4, Indels: 4 0.79 0.11 0.11 Matches are distributed among these distances: 8 24 0.80 10 6 0.20 ACGTcount: A:0.52, C:0.16, G:0.00, T:0.32 Consensus pattern (8 bp): ATACATAT Found at i:20729955 original size:18 final size:18 Alignment explanation

Indices: 20729902--20729955 Score: 54 Period size: 18 Copynumber: 2.9 Consensus size: 18 20729892 ACACACACGC * 20729902 ACATATACACATACATATAT 1 ACATATAAACAT--ATATAT * * 20729922 ACATACAAACACATATAT 1 ACATATAAACATATATAT * 20729940 ACATATATACATATAT 1 ACATATAAACATATAT 20729956 GTATGTGTAT Statistics Matches: 28, Mismatches: 6, Indels: 2 0.78 0.17 0.06 Matches are distributed among these distances: 18 19 0.68 20 9 0.32 ACGTcount: A:0.52, C:0.19, G:0.00, T:0.30 Consensus pattern (18 bp): ACATATAAACATATATAT Found at i:20740846 original size:6 final size:6 Alignment explanation

Indices: 20740831--20740891 Score: 50 Period size: 6 Copynumber: 10.2 Consensus size: 6 20740821 TATACATCCG * * * * * * 20740831 TATACA TATATG TATATA TATATA TATATA CATCTA TATACA TAGATA 1 TATATA TATATA TATATA TATATA TATATA TATATA TATATA TATATA * * 20740879 CATACA TATATA T 1 TATATA TATATA T 20740892 GTACGTAAGC Statistics Matches: 40, Mismatches: 15, Indels: 0 0.73 0.27 0.00 Matches are distributed among these distances: 6 40 1.00 ACGTcount: A:0.46, C:0.10, G:0.03, T:0.41 Consensus pattern (6 bp): TATATA Found at i:20740852 original size:4 final size:4 Alignment explanation

Indices: 20740831--20740891 Score: 50 Period size: 4 Copynumber: 15.2 Consensus size: 4 20740821 TATACATCCG * * * * * * 20740831 TATA CATA TATG TATA TATA TATA TATA TACA TCTA TATA CATA GATA 1 TATA TATA TATA TATA TATA TATA TATA TATA TATA TATA TATA TATA * * 20740879 CATA CATA TATA T 1 TATA TATA TATA T 20740892 GTACGTAAGC Statistics Matches: 45, Mismatches: 12, Indels: 0 0.79 0.21 0.00 Matches are distributed among these distances: 4 45 1.00 ACGTcount: A:0.46, C:0.10, G:0.03, T:0.41 Consensus pattern (4 bp): TATA Found at i:20740868 original size:10 final size:10 Alignment explanation

Indices: 20740843--20740891 Score: 53 Period size: 10 Copynumber: 4.7 Consensus size: 10 20740833 TACATATATG * 20740843 TATATATATA 1 TATATATACA 20740853 TATATATACA 1 TATATATACA * 20740863 TCTATATACA 1 TATATATACA * 20740873 TAGATACATACA 1 T--ATATATACA 20740885 TATATAT 1 TATATAT 20740892 GTACGTAAGC Statistics Matches: 32, Mismatches: 5, Indels: 4 0.78 0.12 0.10 Matches are distributed among these distances: 10 24 0.75 12 8 0.25 ACGTcount: A:0.47, C:0.10, G:0.02, T:0.41 Consensus pattern (10 bp): TATATATACA Found at i:20767460 original size:23 final size:23 Alignment explanation

Indices: 20767427--20767484 Score: 73 Period size: 23 Copynumber: 2.5 Consensus size: 23 20767417 TATGTATACC * * * 20767427 CATCATTATCATGCACATCATC- 1 CATCATCATCATACACACCATCA 20767449 CTATCATCATCATACACACCATCA 1 C-ATCATCATCATACACACCATCA 20767473 CATCATCATCAT 1 CATCATCATCAT 20767485 CACCAGCTCA Statistics Matches: 31, Mismatches: 3, Indels: 3 0.84 0.08 0.08 Matches are distributed among these distances: 22 1 0.03 23 29 0.94 24 1 0.03 ACGTcount: A:0.34, C:0.34, G:0.02, T:0.29 Consensus pattern (23 bp): CATCATCATCATACACACCATCA Found at i:20767530 original size:43 final size:43 Alignment explanation

Indices: 20767481--20767603 Score: 140 Period size: 43 Copynumber: 2.9 Consensus size: 43 20767471 CACATCATCA * * ** 20767481 TCATCACCAGCTCATGCGCACTCCCTACTCGCACATGGCTGGG 1 TCATCACCGGCTTATGCGCACTCCCTACTCGCACATAACTGGG * * * * 20767524 TCATCACCGGGTTATGCGCACTTCTTACTCGTACATAACTGGG 1 TCATCACCGGCTTATGCGCACTCCCTACTCGCACATAACTGGG * * 20767567 TCATC-CTTGGCTTATGCGCACTCCCTACTCACACATA 1 TCATCAC-CGGCTTATGCGCACTCCCTACTCGCACATA 20767604 GCCGAGTCAA Statistics Matches: 65, Mismatches: 14, Indels: 2 0.80 0.17 0.02 Matches are distributed among these distances: 42 1 0.02 43 64 0.98 ACGTcount: A:0.20, C:0.35, G:0.18, T:0.27 Consensus pattern (43 bp): TCATCACCGGCTTATGCGCACTCCCTACTCGCACATAACTGGG Found at i:20772439 original size:35 final size:35 Alignment explanation

Indices: 20772217--20772437 Score: 176 Period size: 35 Copynumber: 6.3 Consensus size: 35 20772207 TTGTGGCTAA * * 20772217 TTAATTAAATCATTGGGCCTAATTAATTAAATT-G 1 TTAATTAAGTCATTGGGCCTAATTAATTAAATTAC * 20772251 TTAATTAAGTCATTGGGCCTTATTAATTAAATTAC 1 TTAATTAAGTCATTGGGCCTAATTAATTAAATTAC ** * * * * * 20772286 ACAATCATGTCATTGGGTCTAATTGATTAAATTTGC 1 TTAATTAAGTCATTGGGCCTAATTAATTAAA-TTAC * * * ** 20772322 TTAATAAAGTCATTGGGCTTAATTGATTAAAATTGT 1 TTAATTAAGTCATTGGGCCTAATTAATT-AAATTAC * ** * * 20772358 TTAATTAAATTGTTAGGCTTAATTAATTAAATT-C 1 TTAATTAAGTCATTGGGCCTAATTAATTAAATTAC * * * * 20772392 TTTAATTAAGTTATTGGGCCTGATTGATTAAATTAA 1 -TTAATTAAGTCATTGGGCCTAATTAATTAAATTAC * 20772428 ATAATTAAGT 1 TTAATTAAGT 20772438 TAACTTGGCC Statistics Matches: 149, Mismatches: 33, Indels: 9 0.78 0.17 0.05 Matches are distributed among these distances: 34 31 0.21 35 65 0.44 36 50 0.34 37 3 0.02 ACGTcount: A:0.36, C:0.08, G:0.14, T:0.43 Consensus pattern (35 bp): TTAATTAAGTCATTGGGCCTAATTAATTAAATTAC Found at i:20773934 original size:18 final size:18 Alignment explanation

Indices: 20773911--20773950 Score: 62 Period size: 18 Copynumber: 2.2 Consensus size: 18 20773901 CTTTTCATAT 20773911 TATGAAAATTTTGTGAAA 1 TATGAAAATTTTGTGAAA * * 20773929 TATGAAACTTTTGTGAAT 1 TATGAAAATTTTGTGAAA 20773947 TATG 1 TATG 20773951 TCATGTTGAC Statistics Matches: 20, Mismatches: 2, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 18 20 1.00 ACGTcount: A:0.38, C:0.03, G:0.17, T:0.42 Consensus pattern (18 bp): TATGAAAATTTTGTGAAA Found at i:20775526 original size:2 final size:2 Alignment explanation

Indices: 20775519--20775560 Score: 84 Period size: 2 Copynumber: 21.0 Consensus size: 2 20775509 TAGGATGCTG 20775519 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 20775561 NNNNNNNNNN Statistics Matches: 40, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 40 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): AT Found at i:20775865 original size:2 final size:2 Alignment explanation

Indices: 20775858--20775892 Score: 70 Period size: 2 Copynumber: 17.5 Consensus size: 2 20775848 GGTGTTTATG 20775858 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 20775893 TTATGCAATA Statistics Matches: 33, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 33 1.00 ACGTcount: A:0.49, C:0.00, G:0.00, T:0.51 Consensus pattern (2 bp): TA Found at i:20779743 original size:2 final size:2 Alignment explanation

Indices: 20779736--20779770 Score: 61 Period size: 2 Copynumber: 17.5 Consensus size: 2 20779726 ATTATGTTGG * 20779736 TA TA TA TA TA GA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 20779771 GTTGTGCATG Statistics Matches: 31, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 2 31 1.00 ACGTcount: A:0.49, C:0.00, G:0.03, T:0.49 Consensus pattern (2 bp): TA Found at i:20779928 original size:38 final size:40 Alignment explanation

Indices: 20779886--20779991 Score: 135 Period size: 38 Copynumber: 2.7 Consensus size: 40 20779876 ATAGTGCCCG * * 20779886 TGTGCACGATGTGGGGGGATGTACATGATGGCACT-GAT- 1 TGTGCACGATGTGGGGGGATGCACATGATGACACTAGATA * ** * 20779924 TGTGCACGATGTGGGGGTATGCACATGATGACTTTAGCTA 1 TGTGCACGATGTGGGGGGATGCACATGATGACACTAGATA * 20779964 TGTACACGATGTGGGGGGATGCACATGA 1 TGTGCACGATGTGGGGGGATGCACATGA 20779992 GCTGGATGTG Statistics Matches: 58, Mismatches: 8, Indels: 2 0.85 0.12 0.03 Matches are distributed among these distances: 38 30 0.52 39 2 0.03 40 26 0.45 ACGTcount: A:0.23, C:0.14, G:0.37, T:0.26 Consensus pattern (40 bp): TGTGCACGATGTGGGGGGATGCACATGATGACACTAGATA Found at i:20783383 original size:32 final size:32 Alignment explanation

Indices: 20783310--20783383 Score: 87 Period size: 33 Copynumber: 2.3 Consensus size: 32 20783300 GAGATGGGGT 20783310 CTTAACCATTGCCAGTCCAAGGGGTAATGAGGG 1 CTTAACCATTGCCAGTCCAAGGGGTAAT-AGGG * ** * 20783343 CTTCACCATTTTCAGTCCAAGGTGT-ATATGGG 1 CTTAACCATTGCCAGTCCAAGGGGTAATA-GGG 20783375 CTTAACCAT 1 CTTAACCAT 20783384 CACAAACTTA Statistics Matches: 35, Mismatches: 5, Indels: 3 0.81 0.12 0.07 Matches are distributed among these distances: 31 1 0.03 32 13 0.37 33 21 0.60 ACGTcount: A:0.26, C:0.23, G:0.23, T:0.28 Consensus pattern (32 bp): CTTAACCATTGCCAGTCCAAGGGGTAATAGGG Found at i:20785358 original size:42 final size:42 Alignment explanation

Indices: 20785312--20785392 Score: 126 Period size: 42 Copynumber: 1.9 Consensus size: 42 20785302 GTGAACCCGA * * 20785312 CTATGTACGAGTAGGAGTGTGCATACGCCGATGATGACCTGG 1 CTATGTACGAGTAGGAGTGCGCATAAGCCGATGATGACCTGG ** 20785354 CTATGTGTGAGTAGGAGTGCGCATAAGCCGATGATGACC 1 CTATGTACGAGTAGGAGTGCGCATAAGCCGATGATGACC 20785393 CAGCCATGTG Statistics Matches: 35, Mismatches: 4, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 42 35 1.00 ACGTcount: A:0.25, C:0.19, G:0.33, T:0.23 Consensus pattern (42 bp): CTATGTACGAGTAGGAGTGCGCATAAGCCGATGATGACCTGG Found at i:20785402 original size:42 final size:42 Alignment explanation

Indices: 20785320--20785402 Score: 121 Period size: 42 Copynumber: 2.0 Consensus size: 42 20785310 GACTATGTAC * * ** * 20785320 GAGTAGGAGTGTGCATACGCCGATGATGACCTGGCTATGTGT 1 GAGTAGGAGTGCGCATAAGCCGATGATGACCCAGCCATGTGT 20785362 GAGTAGGAGTGCGCATAAGCCGATGATGACCCAGCCATGTG 1 GAGTAGGAGTGCGCATAAGCCGATGATGACCCAGCCATGTG 20785403 CTAGTGGAGA Statistics Matches: 36, Mismatches: 5, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 42 36 1.00 ACGTcount: A:0.24, C:0.19, G:0.35, T:0.22 Consensus pattern (42 bp): GAGTAGGAGTGCGCATAAGCCGATGATGACCCAGCCATGTGT Found at i:20793445 original size:28 final size:27 Alignment explanation

Indices: 20793364--20793531 Score: 165 Period size: 28 Copynumber: 5.8 Consensus size: 27 20793354 TTTTAACATG * 20793364 TTTGTCTTAATTTAGACAAGAAAACATG 1 TTTGTCTTAATTTAGACAAGAAAA-ATA * * 20793392 TTTAATCTTAATTTGGACAAGAAAAAATA 1 TTT-GTCTTAATTTAGACAAG-AAAAATA * 20793421 TGTGTCTTAATTTAAGACAAGAAAAATA 1 TTTGTCTTAATTT-AGACAAGAAAAATA * * 20793449 TTTTTTGTCTTAATCTAGACAAGAAAATGTA 1 ---TTTGTCTTAATTTAGACAAGAAAA-ATA 20793480 TTTGTCTTAATTTAGACAAGATAAAATAA 1 TTTGTCTTAATTTAGACAAGA-AAAAT-A * 20793509 TTTTGTCTTAATTAAAGACAAGA 1 -TTTGTCTTAATT-TAGACAAGA 20793532 GTTGTCTTAA Statistics Matches: 117, Mismatches: 12, Indels: 19 0.79 0.08 0.13 Matches are distributed among these distances: 28 40 0.34 29 29 0.25 30 27 0.23 31 21 0.18 ACGTcount: A:0.42, C:0.08, G:0.12, T:0.37 Consensus pattern (27 bp): TTTGTCTTAATTTAGACAAGAAAAATA Found at i:20793470 original size:59 final size:58 Alignment explanation

Indices: 20793366--20793520 Score: 188 Period size: 59 Copynumber: 2.7 Consensus size: 58 20793356 TTAACATGTT * * ** * * 20793366 TGTCTTAATTTAGACAAG-AAAACATGTTTAATCTTAATTTGGACAAGAAAAAATATG 1 TGTCTTAATTTAGACAAGAAAAATATATTTTGTCTTAATCTAGACAAGAAAAAATATG * ** * 20793423 TGTCTTAATTTAAGACAAGAAAAATATTTTTTGTCTTAATCTAGACAAGAAAATGTATT 1 TGTCTTAATTT-AGACAAGAAAAATATATTTTGTCTTAATCTAGACAAGAAAAAATATG 20793482 TGTCTTAATTTAGACAAGATAAAATA-ATTTTGTCTTAAT 1 TGTCTTAATTTAGACAAGA-AAAATATATTTTGTCTTAAT 20793521 TAAAGACAAG Statistics Matches: 85, Mismatches: 10, Indels: 5 0.85 0.10 0.05 Matches are distributed among these distances: 57 11 0.13 58 27 0.32 59 47 0.55 ACGTcount: A:0.41, C:0.08, G:0.12, T:0.38 Consensus pattern (58 bp): TGTCTTAATTTAGACAAGAAAAATATATTTTGTCTTAATCTAGACAAGAAAAAATATG Found at i:20793695 original size:17 final size:17 Alignment explanation

Indices: 20793673--20793705 Score: 50 Period size: 17 Copynumber: 1.9 Consensus size: 17 20793663 ATAATAATAA 20793673 AATAA-GAGTTATTATTC 1 AATAAGGAGTT-TTATTC 20793690 AATAAGGAGTTTTATT 1 AATAAGGAGTTTTATT 20793706 TTATCAAGAA Statistics Matches: 15, Mismatches: 0, Indels: 2 0.88 0.00 0.12 Matches are distributed among these distances: 17 10 0.67 18 5 0.33 ACGTcount: A:0.39, C:0.03, G:0.15, T:0.42 Consensus pattern (17 bp): AATAAGGAGTTTTATTC Found at i:20800894 original size:43 final size:43 Alignment explanation

Indices: 20800847--20800956 Score: 130 Period size: 43 Copynumber: 2.6 Consensus size: 43 20800837 TTGACTCGGC * ** 20800847 TATGTGCGAGTAGGGAATGTGCATAGGCCGATGATGACCCCGT 1 TATGTGCGAGTAGGGAATGCGCATAACCCGATGATGACCCCGT * * * 20800890 TATGTGCGAGTAGGGAATGCGCATAACCCGGTGATGACCCAGC 1 TATGTGCGAGTAGGGAATGCGCATAACCCGATGATGACCCCGT * * * * 20800933 CATGTACGAGTAAGGAGTGCGCAT 1 TATGTGCGAGTAGGGAATGCGCAT 20800957 GAGCTGGTGA Statistics Matches: 57, Mismatches: 10, Indels: 0 0.85 0.15 0.00 Matches are distributed among these distances: 43 57 1.00 ACGTcount: A:0.25, C:0.20, G:0.34, T:0.21 Consensus pattern (43 bp): TATGTGCGAGTAGGGAATGCGCATAACCCGATGATGACCCCGT Found at i:20808415 original size:42 final size:42 Alignment explanation

Indices: 20808356--20808448 Score: 109 Period size: 42 Copynumber: 2.2 Consensus size: 42 20808346 TTTTATGTGA * * 20808356 TAATGATGACCCAGCTATGTGCG-TGTGGGAGTGCACATAAGC 1 TAATGATGACCCAGCTATGTCCGATGT-AGAGTGCACATAAGC ** * 20808398 TAATGATGACGTAGCTATGTCCGATGTAGAGTGCACATGAGC 1 TAATGATGACCCAGCTATGTCCGATGTAGAGTGCACATAAGC * 20808440 TGATG-TGAC 1 TAATGATGAC 20808449 TGTATTTGAG Statistics Matches: 44, Mismatches: 6, Indels: 3 0.83 0.11 0.06 Matches are distributed among these distances: 41 4 0.09 42 37 0.84 43 3 0.07 ACGTcount: A:0.27, C:0.17, G:0.30, T:0.26 Consensus pattern (42 bp): TAATGATGACCCAGCTATGTCCGATGTAGAGTGCACATAAGC Found at i:20809745 original size:54 final size:54 Alignment explanation

Indices: 20809685--20809839 Score: 229 Period size: 54 Copynumber: 2.9 Consensus size: 54 20809675 TGATATTGTT 20809685 CGGCTATGTGCACGATGTGGTGGGATGCACATGAGCTGGGTGAGATGGGATGTC 1 CGGCTATGTGCACGATGTGGTGGGATGCACATGAGCTGGGTGAGATGGGATGTC * * * 20809739 TGGCTATGTACACGATGTGGTGGGATGCACATGAGCTGGGTGGGATGGGATGTC 1 CGGCTATGTGCACGATGTGGTGGGATGCACATGAGCTGGGTGAGATGGGATGTC ** * * * * 20809793 CGATTATGTGCATGATGTTGTGGGATGCACATGAGCTAGGTGTGATG 1 CGGCTATGTGCACGATGTGGTGGGATGCACATGAGCTGGGTGAGATG 20809840 CACATGATGA Statistics Matches: 90, Mismatches: 11, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 54 90 1.00 ACGTcount: A:0.19, C:0.13, G:0.41, T:0.27 Consensus pattern (54 bp): CGGCTATGTGCACGATGTGGTGGGATGCACATGAGCTGGGTGAGATGGGATGTC Found at i:20812651 original size:2 final size:2 Alignment explanation

Indices: 20812646--20812686 Score: 82 Period size: 2 Copynumber: 20.5 Consensus size: 2 20812636 ATACTTATGT 20812646 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 20812687 GGTTTGAATT Statistics Matches: 39, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 39 1.00 ACGTcount: A:0.49, C:0.00, G:0.00, T:0.51 Consensus pattern (2 bp): TA Found at i:20816409 original size:27 final size:28 Alignment explanation

Indices: 20816379--20816432 Score: 85 Period size: 28 Copynumber: 2.0 Consensus size: 28 20816369 TCGAAGTATA 20816379 AGGGCAAAATC-GT-AATTTTTTACCCCG 1 AGGGC-AAATCGGTAAATTTTTTACCCCG 20816406 AGGGCAAATCGGTAAATTTTTTACCCC 1 AGGGCAAATCGGTAAATTTTTTACCCC 20816433 AGCACTTGTC Statistics Matches: 25, Mismatches: 0, Indels: 3 0.89 0.00 0.11 Matches are distributed among these distances: 26 5 0.20 27 7 0.28 28 13 0.52 ACGTcount: A:0.30, C:0.22, G:0.19, T:0.30 Consensus pattern (28 bp): AGGGCAAATCGGTAAATTTTTTACCCCG Found at i:20817827 original size:38 final size:39 Alignment explanation

Indices: 20817776--20817878 Score: 122 Period size: 38 Copynumber: 2.6 Consensus size: 39 20817766 ATAGTGCCCG 20817776 TGTGCACGATGTGGGGGGATGTACACAAT-AG-CA-CTGA 1 TGTGCACGATGTGGGGGGATGTACACAATGAGCCAGCT-A * ** * 20817813 TTGTGCACGATGTGGGGGGATGCACATGATGGTGCCAGCTA 1 -TGTGCACGATGTGGGGGGATGTACACAAT-GAGCCAGCTA 20817854 TGTGCACGATGTGGGGGGATGTACA 1 TGTGCACGATGTGGGGGGATGTACA 20817879 TGAGCCGGGA Statistics Matches: 56, Mismatches: 5, Indels: 6 0.84 0.07 0.09 Matches are distributed among these distances: 38 26 0.46 40 25 0.45 41 3 0.05 42 2 0.04 ACGTcount: A:0.22, C:0.16, G:0.39, T:0.23 Consensus pattern (39 bp): TGTGCACGATGTGGGGGGATGTACACAATGAGCCAGCTA Found at i:20818254 original size:21 final size:21 Alignment explanation

Indices: 20818212--20818254 Score: 59 Period size: 21 Copynumber: 2.0 Consensus size: 21 20818202 GGCAACAATT ** 20818212 TCGCTGCAGCGAAATTCATTC 1 TCGCTGCAGCGAAAAACATTC * 20818233 TCGCTGCAGCGAAAAACTTTC 1 TCGCTGCAGCGAAAAACATTC 20818254 T 1 T 20818255 ATTTTGTCTC Statistics Matches: 19, Mismatches: 3, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 21 19 1.00 ACGTcount: A:0.26, C:0.28, G:0.19, T:0.28 Consensus pattern (21 bp): TCGCTGCAGCGAAAAACATTC Found at i:20826097 original size:38 final size:40 Alignment explanation

Indices: 20826010--20826113 Score: 131 Period size: 38 Copynumber: 2.6 Consensus size: 40 20826000 CACCCGGCTT * 20826010 ATGTACATCCCCCCACATCATGCACATAGCTGGCGCCATC 1 ATGTACATCCCCCCACATCATGCACATAGCTAGCGCCATC * * * * * 20826050 ATGTGCATCCCCCCACATCGTGCACA-ATC-AGTGCCATT 1 ATGTACATCCCCCCACATCATGCACATAGCTAGCGCCATC * 20826088 GTGTACATCCCCCCACATCATGCACA 1 ATGTACATCCCCCCACATCATGCACA 20826114 CAAACACTAT Statistics Matches: 55, Mismatches: 9, Indels: 2 0.83 0.14 0.03 Matches are distributed among these distances: 38 29 0.53 39 2 0.04 40 24 0.44 ACGTcount: A:0.25, C:0.39, G:0.14, T:0.21 Consensus pattern (40 bp): ATGTACATCCCCCCACATCATGCACATAGCTAGCGCCATC Found at i:20832086 original size:2 final size:2 Alignment explanation

Indices: 20832076--20832114 Score: 60 Period size: 2 Copynumber: 19.5 Consensus size: 2 20832066 ACACATTTAC * * 20832076 AT AT GT AT AT AT AT AT AT AT AT AT AT AT AT AC AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 20832115 CCTATCATCA Statistics Matches: 33, Mismatches: 4, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 2 33 1.00 ACGTcount: A:0.49, C:0.03, G:0.03, T:0.46 Consensus pattern (2 bp): AT Found at i:20832180 original size:20 final size:20 Alignment explanation

Indices: 20832119--20832181 Score: 101 Period size: 20 Copynumber: 3.2 Consensus size: 20 20832109 TATATACCTA 20832119 TCATCATCATC-TCACCAAC 1 TCATCATCATCATCACCAAC * 20832138 CCATCATCATCATCACCAAC 1 TCATCATCATCATCACCAAC * 20832158 TCATCATCATCATCACCAGC 1 TCATCATCATCATCACCAAC 20832178 TCAT 1 TCAT 20832182 GCGCACTCCC Statistics Matches: 40, Mismatches: 3, Indels: 1 0.91 0.07 0.02 Matches are distributed among these distances: 19 10 0.25 20 30 0.75 ACGTcount: A:0.32, C:0.41, G:0.02, T:0.25 Consensus pattern (20 bp): TCATCATCATCATCACCAAC Found at i:20832228 original size:43 final size:43 Alignment explanation

Indices: 20832180--20832341 Score: 254 Period size: 43 Copynumber: 3.8 Consensus size: 43 20832170 TCACCAGCTC ** * 20832180 ATGCGCACTCCCTACTCGCACATGGCTGGGTCATCACCGGGTT 1 ATGCGCACTCCCTACTCGCACATAACCGGGTCATCACCGGGTT 20832223 ATGCGCACTCCCTACTCGCACATAACCGGGTCATCACCGGGTT 1 ATGCGCACTCCCTACTCGCACATAACCGGGTCATCACCGGGTT * 20832266 ATGCGCACTCCCTACTCGCACATAACCGGGTCATC-CTCGGCTT 1 ATGCGCACTCCCTACTCGCACATAACCGGGTCATCAC-CGGGTT * * 20832309 ATGCGCACTCCCTACTCGCACATAGCCGAGTCA 1 ATGCGCACTCCCTACTCGCACATAACCGGGTCA 20832342 ACATAATTAC Statistics Matches: 112, Mismatches: 6, Indels: 2 0.93 0.05 0.02 Matches are distributed among these distances: 42 1 0.01 43 111 0.99 ACGTcount: A:0.20, C:0.38, G:0.21, T:0.22 Consensus pattern (43 bp): ATGCGCACTCCCTACTCGCACATAACCGGGTCATCACCGGGTT Found at i:20834882 original size:65 final size:65 Alignment explanation

Indices: 20834778--20834909 Score: 239 Period size: 65 Copynumber: 2.0 Consensus size: 65 20834768 TTATAACTAA 20834778 CTCCACATGAGCTTTTATACCAATATGCAACAATTATATTTCCAAAAATTTTAACTTCGAATAGC 1 CTCCACATGAGCTTTTATACCAATATGCAACAATTATATTTCCAAAAATTTTAACTTCGAATAGC * 20834843 CTCCACATGAGCTTTTATATCAATATGCAAC-ATATATATTTCCAAAAATTTTAACTTCGAATAG 1 CTCCACATGAGCTTTTATACCAATATGCAACAAT-TATATTTCCAAAAATTTTAACTTCGAATAG 20834907 C 65 C 20834908 CT 1 CT 20834910 TAGGCATATC Statistics Matches: 65, Mismatches: 1, Indels: 2 0.96 0.01 0.03 Matches are distributed among these distances: 64 2 0.03 65 63 0.97 ACGTcount: A:0.36, C:0.21, G:0.08, T:0.35 Consensus pattern (65 bp): CTCCACATGAGCTTTTATACCAATATGCAACAATTATATTTCCAAAAATTTTAACTTCGAATAGC Found at i:20836845 original size:26 final size:26 Alignment explanation

Indices: 20836785--20836846 Score: 97 Period size: 26 Copynumber: 2.4 Consensus size: 26 20836775 ACCATATACC * 20836785 CATCATCAACATGCACACCATCCCAT 1 CATCATCATCATGCACACCATCCCAT * 20836811 CATCATCATCATGCACATCATCCCAT 1 CATCATCATCATGCACACCATCCCAT * 20836837 TATCATCATC 1 CATCATCATC 20836847 TCATATGCAA Statistics Matches: 33, Mismatches: 3, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 26 33 1.00 ACGTcount: A:0.32, C:0.39, G:0.03, T:0.26 Consensus pattern (26 bp): CATCATCATCATGCACACCATCCCAT Found at i:20837139 original size:42 final size:42 Alignment explanation

Indices: 20837081--20837246 Score: 296 Period size: 42 Copynumber: 4.0 Consensus size: 42 20837071 ATGGGGGCAA * * 20837081 CAACGGCTTATGTGCACTCCCACTCACACATAGCCGGGTCAG 1 CAACGGCTTATGTGCACTCCTACTCGCACATAGCCGGGTCAG * * 20837123 CAACGGCTTATATGCACTCCTACTCGCACATAGTCGGGTCAG 1 CAACGGCTTATGTGCACTCCTACTCGCACATAGCCGGGTCAG 20837165 CAACGGCTTATGTGCACTCCTACTCGCACATAGCCGGGTCAG 1 CAACGGCTTATGTGCACTCCTACTCGCACATAGCCGGGTCAG 20837207 CAACGGCTTATGTGCACTCCTACTCGCACATAGCCGGGTC 1 CAACGGCTTATGTGCACTCCTACTCGCACATAGCCGGGTC 20837247 CAAATCAACA Statistics Matches: 118, Mismatches: 6, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 42 118 1.00 ACGTcount: A:0.22, C:0.34, G:0.22, T:0.22 Consensus pattern (42 bp): CAACGGCTTATGTGCACTCCTACTCGCACATAGCCGGGTCAG Found at i:20841703 original size:37 final size:37 Alignment explanation

Indices: 20841609--20841704 Score: 115 Period size: 38 Copynumber: 2.5 Consensus size: 37 20841599 GGCTCATGTG * * 20841609 CATCCCCTCTCATCGTGCACATAGTCGGTGCCATCGTA 1 CATCCCC-CACATCGTGCACATAGTCAGTGCCATCGTA 20841647 CATCCCCCCACATCGTGCACA-A-TCAGTGCCATCGTGTA 1 CAT-CCCCCACATCGTGCACATAGTCAGTGCCATC--GTA * 20841685 CATCCCCCACATCATGCACA 1 CATCCCCCACATCGTGCACA 20841705 CGGGCACTAT Statistics Matches: 52, Mismatches: 3, Indels: 7 0.84 0.05 0.11 Matches are distributed among these distances: 36 10 0.19 37 17 0.33 38 21 0.40 39 4 0.08 ACGTcount: A:0.23, C:0.41, G:0.15, T:0.22 Consensus pattern (37 bp): CATCCCCCACATCGTGCACATAGTCAGTGCCATCGTA Found at i:20841810 original size:2 final size:2 Alignment explanation

Indices: 20841803--20841834 Score: 64 Period size: 2 Copynumber: 16.0 Consensus size: 2 20841793 CATGCACAAC 20841803 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 20841835 CATCATAATG Statistics Matches: 30, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 30 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): AT Found at i:20849852 original size:115 final size:115 Alignment explanation

Indices: 20849663--20849874 Score: 329 Period size: 115 Copynumber: 1.8 Consensus size: 115 20849653 CACGACCCAG * * * * 20849663 AACCTCCCTCGGGCCCATGACAACAGTCGCGATGTCCCGATTGATACTCATTACTCGGAATACCA 1 AACCTCCCTCGGACCCATGACAACAGTCGCGATGTCCCGATTGACACTCATTACCCGAAATACCA * 20849728 ATTGAAACCCCGCAAGGCTTACATATTAGTTTTAAATTGTCATAACTCGA 66 ATTGAAACCCCGCAAGGCTTACATATCAGTTTTAAATTGTCATAACTCGA * * 20849778 AACCTCTCTCGGACCCATGACAACCA-TCGCGATGTCCCGATTGACACTCATTACCCGAAATGCC 1 AACCTCCCTCGGACCCATGACAA-CAGTCGCGATGTCCCGATTGACACTCATTACCCGAAATACC 20849842 AATT-AGAACCCCGCAAGGCTTACATATCAGTTT 65 AATTGA-AACCCCGCAAGGCTTACATATCAGTTT 20849875 CTTGCCTTTT Statistics Matches: 88, Mismatches: 7, Indels: 4 0.89 0.07 0.04 Matches are distributed among these distances: 114 1 0.01 115 85 0.97 116 2 0.02 ACGTcount: A:0.29, C:0.30, G:0.16, T:0.25 Consensus pattern (115 bp): AACCTCCCTCGGACCCATGACAACAGTCGCGATGTCCCGATTGACACTCATTACCCGAAATACCA ATTGAAACCCCGCAAGGCTTACATATCAGTTTTAAATTGTCATAACTCGA Found at i:20851810 original size:26 final size:26 Alignment explanation

Indices: 20851781--20851839 Score: 66 Period size: 26 Copynumber: 2.3 Consensus size: 26 20851771 ATTTTCTAGT * * 20851781 TTTCCTTTGT-TTTTCTCTTTTCTTCC 1 TTTCCTTTCTATTTTCTCTCTTCTT-C * * 20851807 TTTCCTCTCTATTTTCTCTCTTGTTC 1 TTTCCTTTCTATTTTCTCTCTTCTTC 20851833 TTTCCTT 1 TTTCCTT 20851840 ATGGCTGGCC Statistics Matches: 27, Mismatches: 5, Indels: 2 0.79 0.15 0.06 Matches are distributed among these distances: 26 15 0.56 27 12 0.44 ACGTcount: A:0.02, C:0.29, G:0.03, T:0.66 Consensus pattern (26 bp): TTTCCTTTCTATTTTCTCTCTTCTTC Found at i:20866910 original size:11 final size:11 Alignment explanation

Indices: 20866894--20866919 Score: 52 Period size: 11 Copynumber: 2.4 Consensus size: 11 20866884 TTTCGTATGC 20866894 AATAAAATAAT 1 AATAAAATAAT 20866905 AATAAAATAAT 1 AATAAAATAAT 20866916 AATA 1 AATA 20866920 TCTTATTGAG Statistics Matches: 15, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 11 15 1.00 ACGTcount: A:0.73, C:0.00, G:0.00, T:0.27 Consensus pattern (11 bp): AATAAAATAAT Found at i:20871371 original size:17 final size:19 Alignment explanation

Indices: 20871330--20871371 Score: 52 Period size: 20 Copynumber: 2.2 Consensus size: 19 20871320 ATGTATATAT 20871330 ATGAATATATGTTGTGCATTG 1 ATGAATATATGTTGTG-A-TG 20871351 ATGAATA-ATGTTGTG-TG 1 ATGAATATATGTTGTGATG 20871368 ATGA 1 ATGA 20871372 TGTTGAAGTG Statistics Matches: 21, Mismatches: 0, Indels: 4 0.84 0.00 0.16 Matches are distributed among these distances: 17 6 0.29 20 8 0.38 21 7 0.33 ACGTcount: A:0.31, C:0.02, G:0.26, T:0.40 Consensus pattern (19 bp): ATGAATATATGTTGTGATG Found at i:20871408 original size:37 final size:36 Alignment explanation

Indices: 20871366--20871474 Score: 157 Period size: 37 Copynumber: 2.9 Consensus size: 36 20871356 TAATGTTGTG 20871366 TGATGATGTTGAAGTGTGCGAGTAGGAAGTGCACACA 1 TGATGATGTTGAAGTGTGCGAGTAGG-AGTGCACACA * 20871403 TGATGATGTTGAAGTGTGCGAATAGGGAGTGCACACA 1 TGATGATGTTGAAGTGTGCGAGTA-GGAGTGCACACA * 20871440 TGATGATGATAGAA-TGTGCGAGTATGGAGTGCACA 1 TGATGATG-TTGAAGTGTGCGAGTA-GGAGTGCACA 20871475 TGACAATGAA Statistics Matches: 66, Mismatches: 4, Indels: 4 0.89 0.05 0.05 Matches are distributed among these distances: 37 60 0.91 38 6 0.09 ACGTcount: A:0.30, C:0.10, G:0.35, T:0.25 Consensus pattern (36 bp): TGATGATGTTGAAGTGTGCGAGTAGGAGTGCACACA Found at i:20875270 original size:12 final size:12 Alignment explanation

Indices: 20875250--20875279 Score: 51 Period size: 12 Copynumber: 2.5 Consensus size: 12 20875240 AGCACCTTAA * 20875250 CTGCTCAAGTGG 1 CTGCACAAGTGG 20875262 CTGCACAAGTGG 1 CTGCACAAGTGG 20875274 CTGCAC 1 CTGCAC 20875280 TGTCGAAGAA Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 12 17 1.00 ACGTcount: A:0.20, C:0.30, G:0.30, T:0.20 Consensus pattern (12 bp): CTGCACAAGTGG Found at i:20890661 original size:20 final size:20 Alignment explanation

Indices: 20890638--20890687 Score: 91 Period size: 20 Copynumber: 2.5 Consensus size: 20 20890628 AAATAAAAAC 20890638 CAAAGAGAAACAAGAAAATT 1 CAAAGAGAAACAAGAAAATT 20890658 CAAAGAGAAACAAGAAAATT 1 CAAAGAGAAACAAGAAAATT * 20890678 CAAAGGGAAA 1 CAAAGAGAAA 20890688 TTCACGTTTT Statistics Matches: 29, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 20 29 1.00 ACGTcount: A:0.64, C:0.10, G:0.18, T:0.08 Consensus pattern (20 bp): CAAAGAGAAACAAGAAAATT Found at i:20896498 original size:2 final size:2 Alignment explanation

Indices: 20896493--20896531 Score: 78 Period size: 2 Copynumber: 19.5 Consensus size: 2 20896483 TCTATACATG 20896493 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 20896532 TCATCATCAA Statistics Matches: 37, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 37 1.00 ACGTcount: A:0.49, C:0.00, G:0.00, T:0.51 Consensus pattern (2 bp): TA Found at i:20896605 original size:43 final size:42 Alignment explanation

Indices: 20896558--20896683 Score: 117 Period size: 42 Copynumber: 3.0 Consensus size: 42 20896548 TCTCCCATCG * * 20896558 TCATCAGCTCATATGCACTCCGCACTAGCACATGGCTGGGTCA 1 TCATCAGCTCATATGCACTCC-CACTAGCACATAGCCGGGTCA * * * * 20896601 TCATCGGCTTATGTGCACTCCCACTCGCACATAGCCGGGTCA 1 TCATCAGCTCATATGCACTCCCACTAGCACATAGCCGGGTCA * * * * ** * * 20896643 TCATCGGCTTATGTACACTCCTGCTCGAACATAGCCGGGTC 1 TCATCAGCTCATATGCACTCCCACTAGCACATAGCCGGGTC 20896684 CACATCAACA Statistics Matches: 73, Mismatches: 10, Indels: 1 0.87 0.12 0.01 Matches are distributed among these distances: 42 55 0.75 43 18 0.25 ACGTcount: A:0.21, C:0.33, G:0.21, T:0.25 Consensus pattern (42 bp): TCATCAGCTCATATGCACTCCCACTAGCACATAGCCGGGTCA Found at i:20896643 original size:42 final size:42 Alignment explanation

Indices: 20896571--20896683 Score: 154 Period size: 42 Copynumber: 2.7 Consensus size: 42 20896561 TCAGCTCATA * * * 20896571 TGCACTCCGCACTAGCACATGGCTGGGTCATCATCGGCTTATG 1 TGCACTCC-CACTCGCACATAGCCGGGTCATCATCGGCTTATG 20896614 TGCACTCCCACTCGCACATAGCCGGGTCATCATCGGCTTATG 1 TGCACTCCCACTCGCACATAGCCGGGTCATCATCGGCTTATG * ** * 20896656 TACACTCCTGCTCGAACATAGCCGGGTC 1 TGCACTCCCACTCGCACATAGCCGGGTC 20896684 CACATCAACA Statistics Matches: 63, Mismatches: 7, Indels: 1 0.89 0.10 0.01 Matches are distributed among these distances: 42 55 0.87 43 8 0.13 ACGTcount: A:0.19, C:0.34, G:0.23, T:0.24 Consensus pattern (42 bp): TGCACTCCCACTCGCACATAGCCGGGTCATCATCGGCTTATG Found at i:20900423 original size:63 final size:61 Alignment explanation

Indices: 20900309--20900429 Score: 215 Period size: 63 Copynumber: 2.0 Consensus size: 61 20900299 TAAGTCTAAT * 20900309 TGGATGAAGTCCATGAGTTTAATATGCTTGCAAGGGAACTGTAATGGGTTGCAGTTATTGA 1 TGGATGAAGTCCATGAGATTAATATGCTTGCAAGGGAACTGTAATGGGTTGCAGTTATTGA 20900370 NTGGATGAAGTCCATGAGATTAATATGCCTTGCAAGGGAACTGTAATGGGTTGCAGTTAT 1 -TGGATGAAGTCCATGAGATTAATATG-CTTGCAAGGGAACTGTAATGGGTTGCAGTTAT 20900430 GACATCCTTA Statistics Matches: 57, Mismatches: 1, Indels: 1 0.97 0.02 0.02 Matches are distributed among these distances: 62 25 0.44 63 32 0.56 ACGTcount: A:0.28, C:0.11, G:0.29, T:0.31 Consensus pattern (61 bp): TGGATGAAGTCCATGAGATTAATATGCTTGCAAGGGAACTGTAATGGGTTGCAGTTATTGA Found at i:20901448 original size:35 final size:35 Alignment explanation

Indices: 20901380--20901628 Score: 252 Period size: 35 Copynumber: 7.1 Consensus size: 35 20901370 CACTTCATTA * * 20901380 AAATTGTTTAATTAAG---TTGTGACTAATTAATT 1 AAATTGTTTAATTAAGTCATTGGGCCTAATTAATT * 20901412 AAATTGTTTAATTAAGTCATTCGGCCTAATTAATT 1 AAATTGTTTAATTAAGTCATTGGGCCTAATTAATT * * 20901447 AAATTGTTTAATTAAGTCATTGTGCTTAATTAATT 1 AAATTGTTTAATTAAGTCATTGGGCCTAATTAATT **** * * * * 20901482 AAATTAAACAATTAAGTTATTGGGCTTAGTTGATT 1 AAATTGTTTAATTAAGTCATTGGGCCTAATTAATT * * 20901517 AAATTTGTTTAATTAAGTCATTGAGCCTAATTGATT 1 AAA-TTGTTTAATTAAGTCATTGGGCCTAATTAATT * ** * 20901553 AAAATTGTTTAATTAAATTGTTGGGCTTAATTAATT 1 -AAATTGTTTAATTAAGTCATTGGGCCTAATTAATT * * * * 20901589 AAATTTTTTAATTACGTTATTGGGCCTAATTGATT 1 AAATTGTTTAATTAAGTCATTGGGCCTAATTAATT 20901624 AAATT 1 AAATT 20901629 AAACAATTAA Statistics Matches: 178, Mismatches: 34, Indels: 7 0.81 0.16 0.03 Matches are distributed among these distances: 32 16 0.09 35 109 0.61 36 50 0.28 37 3 0.02 ACGTcount: A:0.35, C:0.06, G:0.13, T:0.45 Consensus pattern (35 bp): AAATTGTTTAATTAAGTCATTGGGCCTAATTAATT Found at i:20901477 original size:18 final size:18 Alignment explanation

Indices: 20901419--20901479 Score: 56 Period size: 18 Copynumber: 3.4 Consensus size: 18 20901409 ATTAAATTGT 20901419 TTAATTAAGTCATTCG-GC 1 TTAATTAAGTCATT-GTGC * * * 20901437 CTAATTAATTAAATTGT-- 1 TTAATTAAGT-CATTGTGC 20901454 TTAATTAAGTCATTGTGC 1 TTAATTAAGTCATTGTGC 20901472 TTAATTAA 1 TTAATTAA 20901480 TTAAATTAAA Statistics Matches: 33, Mismatches: 6, Indels: 8 0.70 0.13 0.17 Matches are distributed among these distances: 16 5 0.15 17 8 0.24 18 17 0.52 19 3 0.09 ACGTcount: A:0.34, C:0.10, G:0.11, T:0.44 Consensus pattern (18 bp): TTAATTAAGTCATTGTGC Found at i:20901593 original size:107 final size:106 Alignment explanation

Indices: 20901402--20901628 Score: 305 Period size: 107 Copynumber: 2.1 Consensus size: 106 20901392 TAAGTTGTGA * * 20901402 CTAATTAATTAAA-TTGTTTAATTAAGTCATTCGGCCTAATTAATTAAATTGTTTAATTAAGTCA 1 CTAATTGATTAAATTTGTTTAATTAAGTCATTCGGCCTAATTAATTAAATTGTTTAATTAAATCA * 20901466 TTGTGCTTAATTAATTAAATTAAACAATTAAGTTATTGGGC 66 TTGGGCTTAATTAATTAAATTAAACAATTAAGTTATTGGGC * * * 20901507 TTAGTTGATTAAATTTGTTTAATTAAGTCATT-GAGCCTAATTGATTAAAATTGTTTAATTAAAT 1 CTAATTGATTAAATTTGTTTAATTAAGTCATTCG-GCCTAATTAATT-AAATTGTTTAATTAAAT ** **** * 20901571 TGTTGGGCTTAATTAATTAAATTTTTTAATTACGTTATTGGGC 64 CATTGGGCTTAATTAATTAAATTAAACAATTAAGTTATTGGGC 20901614 CTAATTGATTAAATT 1 CTAATTGATTAAATT 20901629 AAACAATTAA Statistics Matches: 104, Mismatches: 15, Indels: 4 0.85 0.12 0.03 Matches are distributed among these distances: 105 11 0.11 106 29 0.28 107 64 0.62 ACGTcount: A:0.35, C:0.07, G:0.13, T:0.45 Consensus pattern (106 bp): CTAATTGATTAAATTTGTTTAATTAAGTCATTCGGCCTAATTAATTAAATTGTTTAATTAAATCA TTGGGCTTAATTAATTAAATTAAACAATTAAGTTATTGGGC Found at i:20901619 original size:142 final size:138 Alignment explanation

Indices: 20901376--20901642 Score: 365 Period size: 142 Copynumber: 1.9 Consensus size: 138 20901366 CAAGCACTTC * 20901376 ATTAAAATTGTTTAATTAAGTTGTGACTAATTAATTAAATTGTTTAATTAAGTCATTCGGCCTAA 1 ATTAAAATTGTTTAATTAAGTTGTGACTAATTAATTAAATTGTTTAATTAAATCATTCGGCCTAA * * 20901441 TTAATTAAATTGTTTAATTAAGTCATTGTGCTTAATTAATTAAATTAAACAATTAAGTTATTGGG 66 TTAATTAAATTGTTTAATTAAGTCATTGGGCCTAATTAATTAAATTAAACAATTAAGTTATTGGG 20901506 CTTAGTTG 131 CTTAGTTG * * ** * 20901514 ATTAAATTTGTTTAATTAAGTCAT-TGAGCCTAATTGATTAAAATTGTTTAATTAAATTGTTGGG 1 ATTAAAATTGTTTAATTAAGT--TGTGA--CTAATTAATT-AAATTGTTTAATTAAATCATTCGG * * * * * 20901578 CTTAATTAATTAAATTTTTTAATTACGTTATTGGGCCTAATTGATTAAATTAAACAATTAAGTTA 61 CCTAATTAATTAAATTGTTTAATTAAGTCATTGGGCCTAATTAATTAAATTAAACAATTAAGTTA 20901643 ATTTGACTTT Statistics Matches: 111, Mismatches: 13, Indels: 6 0.85 0.10 0.05 Matches are distributed among these distances: 138 20 0.18 139 3 0.03 140 1 0.01 141 9 0.08 142 78 0.70 ACGTcount: A:0.36, C:0.06, G:0.13, T:0.45 Consensus pattern (138 bp): ATTAAAATTGTTTAATTAAGTTGTGACTAATTAATTAAATTGTTTAATTAAATCATTCGGCCTAA TTAATTAAATTGTTTAATTAAGTCATTGGGCCTAATTAATTAAATTAAACAATTAAGTTATTGGG CTTAGTTG Found at i:20905853 original size:20 final size:20 Alignment explanation

Indices: 20905827--20905872 Score: 56 Period size: 21 Copynumber: 2.2 Consensus size: 20 20905817 AATCTTACCT * * 20905827 TTTTTCACTTGTTTCCTTGAA 1 TTTTACACTT-TTCCCTTGAA 20905848 TTTTCACACTTTTCCCTTGAA 1 TTTT-ACACTTTTCCCTTGAA 20905869 TTTT 1 TTTT 20905873 TCCACCTAGG Statistics Matches: 22, Mismatches: 2, Indels: 2 0.85 0.08 0.08 Matches are distributed among these distances: 21 17 0.77 22 5 0.23 ACGTcount: A:0.15, C:0.22, G:0.07, T:0.57 Consensus pattern (20 bp): TTTTACACTTTTCCCTTGAA Found at i:20905874 original size:22 final size:22 Alignment explanation

Indices: 20905832--20905872 Score: 66 Period size: 21 Copynumber: 1.9 Consensus size: 22 20905822 TACCTTTTTT * 20905832 CACTTGTTTCCTTGAATTTTCA 1 CACTTGTTCCCTTGAATTTTCA 20905854 CACTT-TTCCCTTGAATTTT 1 CACTTGTTCCCTTGAATTTT 20905873 TCCACCTAGG Statistics Matches: 18, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 21 13 0.72 22 5 0.28 ACGTcount: A:0.17, C:0.24, G:0.07, T:0.51 Consensus pattern (22 bp): CACTTGTTCCCTTGAATTTTCA Found at i:20924702 original size:31 final size:30 Alignment explanation

Indices: 20924648--20924732 Score: 91 Period size: 31 Copynumber: 2.8 Consensus size: 30 20924638 AAAGGTGAAG * * * 20924648 TTGTAATTATCAAAAG-TTGAAGGATCAAT 1 TTGTAATTATGAAAAGTTTGAAGGACCAAA 20924677 TTGTAATTTATGAAAAGTTTGAAGGACCAAA 1 TTGTAA-TTATGAAAAGTTTGAAGGACCAAA * ** 20924708 TTATAAAGAATGAAAAGTTTGAAGG 1 TTGT-AATTATGAAAAGTTTGAAGG 20924733 GTTAAATGGT Statistics Matches: 47, Mismatches: 6, Indels: 4 0.82 0.11 0.07 Matches are distributed among these distances: 29 6 0.13 30 9 0.19 31 30 0.64 32 2 0.04 ACGTcount: A:0.44, C:0.05, G:0.20, T:0.32 Consensus pattern (30 bp): TTGTAATTATGAAAAGTTTGAAGGACCAAA Found at i:20928251 original size:34 final size:35 Alignment explanation

Indices: 20928153--20928294 Score: 198 Period size: 35 Copynumber: 4.1 Consensus size: 35 20928143 CTTACTCAAC * 20928153 TCATGTGCACTCCCTACACGCACAAG-GCAATATCA 1 TCATGTGCACTCCCTACACGCAC-AGTTCAATATCA * 20928188 TCATGTGCACTCCCTACACGCACATTTCAATATCA 1 TCATGTGCACTCCCTACACGCACAGTTCAATATCA 20928223 TCATGTGCACTCCCTA-ACGCACAGTTCAATATCA 1 TCATGTGCACTCCCTACACGCACAGTTCAATATCA * * * * * 20928257 TAATGCGCACTCCCTAAAAGCACATTTCAATATCA 1 TCATGTGCACTCCCTACACGCACAGTTCAATATCA 20928292 TCA 1 TCA 20928295 CACATACTCC Statistics Matches: 97, Mismatches: 8, Indels: 4 0.89 0.07 0.04 Matches are distributed among these distances: 34 32 0.33 35 65 0.67 ACGTcount: A:0.32, C:0.32, G:0.11, T:0.25 Consensus pattern (35 bp): TCATGTGCACTCCCTACACGCACAGTTCAATATCA Found at i:20928257 original size:69 final size:70 Alignment explanation

Indices: 20928153--20928294 Score: 216 Period size: 69 Copynumber: 2.0 Consensus size: 70 20928143 CTTACTCAAC * * * * 20928153 TCATGTGCACTCCCTACACGCACAAGGCAATATCATCATGTGCACTCCCTACACGCACATTTCAA 1 TCATGTGCACTCCCTACACGCACAAGGCAATATCATAATGCGCACTCCCTAAAAGCACATTTCAA 20928218 TATCA 66 TATCA * 20928223 TCATGTGCACTCCCTA-ACGCAC-AGTTCAATATCATAATGCGCACTCCCTAAAAGCACATTTCA 1 TCATGTGCACTCCCTACACGCACAAG-GCAATATCATAATGCGCACTCCCTAAAAGCACATTTCA 20928286 ATATCA 65 ATATCA 20928292 TCA 1 TCA 20928295 CACATACTCC Statistics Matches: 66, Mismatches: 5, Indels: 3 0.89 0.07 0.04 Matches are distributed among these distances: 68 2 0.03 69 48 0.73 70 16 0.24 ACGTcount: A:0.32, C:0.32, G:0.11, T:0.25 Consensus pattern (70 bp): TCATGTGCACTCCCTACACGCACAAGGCAATATCATAATGCGCACTCCCTAAAAGCACATTTCAA TATCA Found at i:20931135 original size:19 final size:19 Alignment explanation

Indices: 20931111--20931152 Score: 57 Period size: 19 Copynumber: 2.2 Consensus size: 19 20931101 GGTGGTAAGG 20931111 TGAAAATAGCATGGGAGCA 1 TGAAAATAGCATGGGAGCA * ** 20931130 TGAAAATGGTGTGGGAGCA 1 TGAAAATAGCATGGGAGCA 20931149 TGAA 1 TGAA 20931153 GGAAAAGAAG Statistics Matches: 20, Mismatches: 3, Indels: 0 0.87 0.13 0.00 Matches are distributed among these distances: 19 20 1.00 ACGTcount: A:0.38, C:0.07, G:0.36, T:0.19 Consensus pattern (19 bp): TGAAAATAGCATGGGAGCA Found at i:20936562 original size:17 final size:17 Alignment explanation

Indices: 20936540--20936574 Score: 61 Period size: 17 Copynumber: 2.1 Consensus size: 17 20936530 GGAGGTTTTC 20936540 CTCCCCCTAAGTGTGTG 1 CTCCCCCTAAGTGTGTG * 20936557 CTCCCCCTTAGTGTGTG 1 CTCCCCCTAAGTGTGTG 20936574 C 1 C 20936575 ATATTTTATT Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 17 17 1.00 ACGTcount: A:0.09, C:0.37, G:0.23, T:0.31 Consensus pattern (17 bp): CTCCCCCTAAGTGTGTG Found at i:20945311 original size:42 final size:43 Alignment explanation

Indices: 20945264--20945450 Score: 184 Period size: 42 Copynumber: 4.3 Consensus size: 43 20945254 TGTGATAATA * * * 20945264 ATGACCCAGCTATATGCGTGT-GGGAGTGCACATAAGCTGATG 1 ATGACCCAGCTATGTGCGAGTAAGGAGTGCACATAAGCTGATG * * * 20945306 ATGACCCAGCTATGTGC-ATGT-GGGAGTGTACATGAGCTGATG 1 ATGACCCAGCTATGTGCGA-GTAAGGAGTGCACATAAGCTGATG * ** * 20945348 ATGACCC-GACTATGTACGAGTAAGGAGTGCGTATAAGCCGATG 1 ATGACCCAG-CTATGTGCGAGTAAGGAGTGCACATAAGCTGATG * * * 20945391 ATGACCCGGCTATGTGCGAGTAAAGAGTGCTCATAAGCAGATGATG 1 ATGACCCAGCTATGTGCGAGTAAGGAGTGCACATAAGC---TGATG 20945437 ATGACCCAGCTATG 1 ATGACCCAGCTATG 20945451 ACGATGTATG Statistics Matches: 120, Mismatches: 17, Indels: 12 0.81 0.11 0.08 Matches are distributed among these distances: 41 1 0.01 42 53 0.44 43 48 0.40 44 1 0.01 46 17 0.14 ACGTcount: A:0.28, C:0.19, G:0.30, T:0.23 Consensus pattern (43 bp): ATGACCCAGCTATGTGCGAGTAAGGAGTGCACATAAGCTGATG Found at i:20945391 original size:43 final size:43 Alignment explanation

Indices: 20945344--20945437 Score: 136 Period size: 43 Copynumber: 2.2 Consensus size: 43 20945334 TACATGAGCT * * 20945344 GATGATGACCCGACTATGTACGAGTAAGGAGTGCGT-ATAAGCC 1 GATGATGACCCGACTATGTACGAGTAAAGAGTGC-TCATAAGCA * * 20945387 GATGATGACCCGGCTATGTGCGAGTAAAGAGTGCTCATAAGCA 1 GATGATGACCCGACTATGTACGAGTAAAGAGTGCTCATAAGCA 20945430 GATGATGA 1 GATGATGA 20945438 TGACCCAGCT Statistics Matches: 46, Mismatches: 4, Indels: 2 0.88 0.08 0.04 Matches are distributed among these distances: 42 1 0.02 43 45 0.98 ACGTcount: A:0.31, C:0.17, G:0.31, T:0.21 Consensus pattern (43 bp): GATGATGACCCGACTATGTACGAGTAAAGAGTGCTCATAAGCA Found at i:20945439 original size:46 final size:43 Alignment explanation

Indices: 20945340--20945454 Score: 135 Period size: 43 Copynumber: 2.6 Consensus size: 43 20945330 AGTGTACATG * * 20945340 AGCTGATGATGACCCGACTATGTACGAGTAAGGAGTGCGTATA 1 AGCTGATGATGACCCGGCTATGTACGAGTAAAGAGTGCGTATA * * 20945383 AGCCGATGATGACCCGGCTATGTGCGAGTAAAGAGTGC-TCATA 1 AGCTGATGATGACCCGGCTATGTACGAGTAAAGAGTGCGT-ATA * 20945426 AGCAGATGATGATGACCCAGCTATG-ACGA 1 AGC---TGATGATGACCCGGCTATGTACGA 20945455 TGTATGAAGT Statistics Matches: 61, Mismatches: 7, Indels: 6 0.82 0.09 0.08 Matches are distributed among these distances: 42 1 0.02 43 40 0.66 45 3 0.05 46 17 0.28 ACGTcount: A:0.30, C:0.19, G:0.30, T:0.21 Consensus pattern (43 bp): AGCTGATGATGACCCGGCTATGTACGAGTAAAGAGTGCGTATA Found at i:20950367 original size:20 final size:20 Alignment explanation

Indices: 20950342--20950401 Score: 75 Period size: 20 Copynumber: 3.0 Consensus size: 20 20950332 GTTTTGGACC * 20950342 AAATGTATCGATACATTTTG 1 AAATGTATCCATACATTTTG * * 20950362 AAATGTATCCAAACATTTGG 1 AAATGTATCCATACATTTTG * * 20950382 GAATGCATCCATACATTTTG 1 AAATGTATCCATACATTTTG 20950402 GGCAAAATCA Statistics Matches: 33, Mismatches: 7, Indels: 0 0.82 0.17 0.00 Matches are distributed among these distances: 20 33 1.00 ACGTcount: A:0.35, C:0.15, G:0.15, T:0.35 Consensus pattern (20 bp): AAATGTATCCATACATTTTG Found at i:20950674 original size:21 final size:22 Alignment explanation

Indices: 20950644--20950727 Score: 91 Period size: 21 Copynumber: 3.9 Consensus size: 22 20950634 TAGGTCCTTC * * 20950644 AAAACTTATTTAAACAAGTTTA 1 AAAACTTATTCAAACAAGTTCA * 20950666 AAAAC-TATTCGAACAAGTTCA 1 AAAACTTATTCAAACAAGTTCA * * * 20950687 AAAACTTATTTAAATAACTTCA 1 AAAACTTATTCAAACAAGTTCA * 20950709 AAAA-TCATTCAAACAAGTT 1 AAAACTTATTCAAACAAGTT 20950728 TAAGGATCAC Statistics Matches: 50, Mismatches: 11, Indels: 3 0.78 0.17 0.05 Matches are distributed among these distances: 21 29 0.58 22 21 0.42 ACGTcount: A:0.50, C:0.14, G:0.05, T:0.31 Consensus pattern (22 bp): AAAACTTATTCAAACAAGTTCA Found at i:20950712 original size:43 final size:43 Alignment explanation

Indices: 20950644--20950727 Score: 116 Period size: 43 Copynumber: 2.0 Consensus size: 43 20950634 TAGGTCCTTC * * * 20950644 AAAACTTATTTAAACAAGTTTAAAAA-CTATTCGAACAAGTTCA 1 AAAACTTATTTAAACAACTTCAAAAATC-ATTCAAACAAGTTCA * 20950687 AAAACTTATTTAAATAACTTCAAAAATCATTCAAACAAGTT 1 AAAACTTATTTAAACAACTTCAAAAATCATTCAAACAAGTT 20950728 TAAGGATCAC Statistics Matches: 36, Mismatches: 4, Indels: 2 0.86 0.10 0.05 Matches are distributed among these distances: 43 35 0.97 44 1 0.03 ACGTcount: A:0.50, C:0.14, G:0.05, T:0.31 Consensus pattern (43 bp): AAAACTTATTTAAACAACTTCAAAAATCATTCAAACAAGTTCA Found at i:20950741 original size:21 final size:21 Alignment explanation

Indices: 20950705--20950757 Score: 70 Period size: 21 Copynumber: 2.5 Consensus size: 21 20950695 TTTAAATAAC * * 20950705 TTCAAAAATCATTCAAACAAG 1 TTCAAAGATCACTCAAACAAG * * 20950726 TTTAAGGATCACTCAAACAAG 1 TTCAAAGATCACTCAAACAAG 20950747 TTCAAAGATCA 1 TTCAAAGATCA 20950758 TTGAAGCAAT Statistics Matches: 26, Mismatches: 6, Indels: 0 0.81 0.19 0.00 Matches are distributed among these distances: 21 26 1.00 ACGTcount: A:0.47, C:0.19, G:0.09, T:0.25 Consensus pattern (21 bp): TTCAAAGATCACTCAAACAAG Found at i:20950983 original size:21 final size:21 Alignment explanation

Indices: 20950954--20951000 Score: 53 Period size: 21 Copynumber: 2.3 Consensus size: 21 20950944 CATTCTAAGT * 20950954 AATA-GTATCAATACCT-GAA 1 AATAGGTATCAATAACTGGAA * 20950973 GAATAGGTATCGATAACTGGAA 1 -AATAGGTATCAATAACTGGAA 20950995 AATAGG 1 AATAGG 20951001 AAGCCTTCTC Statistics Matches: 23, Mismatches: 2, Indels: 3 0.82 0.07 0.11 Matches are distributed among these distances: 20 4 0.17 21 16 0.70 22 3 0.13 ACGTcount: A:0.45, C:0.11, G:0.21, T:0.23 Consensus pattern (21 bp): AATAGGTATCAATAACTGGAA Found at i:20971784 original size:46 final size:46 Alignment explanation

Indices: 20971717--20971810 Score: 161 Period size: 46 Copynumber: 2.0 Consensus size: 46 20971707 TTGAATTTAC * * 20971717 ACACCTCTACCGACCAATATATTGAAAATAGTCATATTAACTCTCA 1 ACACCCCTACCGACCAATATATTGAAAATAATCATATTAACTCTCA * 20971763 ACACCCCTACCGGCCAATATATTGAAAATAATCATATTAACTCTCA 1 ACACCCCTACCGACCAATATATTGAAAATAATCATATTAACTCTCA 20971809 AC 1 AC 20971811 TAGTATATTT Statistics Matches: 45, Mismatches: 3, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 46 45 1.00 ACGTcount: A:0.39, C:0.28, G:0.06, T:0.27 Consensus pattern (46 bp): ACACCCCTACCGACCAATATATTGAAAATAATCATATTAACTCTCA Done.