Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Scaffold2335

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 46498
ACGTcount: A:0.31, C:0.19, G:0.19, T:0.30


Found at i:1775 original size:40 final size:40

Alignment explanation

Indices: 1731--1954 Score: 220 Period size: 40 Copynumber: 5.7 Consensus size: 40 1721 AGCTGACCAT 1731 ATCCGGAGCTAAGATCCGAAGGCATTTG-GCGAGATACTAA 1 ATCCGG-GCTAAGATCCGAAGGCATTTGTGCGAGATACTAA * 1771 ATCC-GACTAAGA-CCGAAGGCATTTGTGGCGAGATACTAA 1 ATCCGGGCTAAGATCCGAAGGCATTTGT-GCGAGATACTAA * 1810 ATCCGGACTAAGATCCG-AGGCAGTTTGTGCGAGATACT-A 1 ATCCGGGCTAAGATCCGAAGGCA-TTTGTGCGAGATACTAA * * * 1849 TTCCGGGCTTAAG-TCCCAAGGCA-TTGTGCGAGTTACTAA 1 ATCCGGGC-TAAGATCCGAAGGCATTTGTGCGAGATACTAA * 1888 ATCCGGGTTAA-AGTCCCG-AGGCATTTGTGCGA-ATTACTATA 1 ATCCGGGCTAAGA-T-CCGAAGGCATTTGTGCGAGA-TACTA-A * 1929 A-CCGGGCTATG-TCCCGAAGGCATTTG 1 ATCCGGGCTAAGAT-CCGAAGGCATTTG 1955 AATGAGAGCT Statistics Matches: 157, Mismatches: 11, Indels: 32 0.79 0.05 0.16 Matches are distributed among these distances: 37 13 0.08 38 22 0.14 39 45 0.29 40 67 0.43 41 10 0.06 ACGTcount: A:0.28, C:0.21, G:0.27, T:0.24 Consensus pattern (40 bp): ATCCGGGCTAAGATCCGAAGGCATTTGTGCGAGATACTAA Found at i:1808 original size:39 final size:39 Alignment explanation

Indices: 1731--1893 Score: 185 Period size: 39 Copynumber: 4.2 Consensus size: 39 1721 AGCTGACCAT 1731 ATCCGGAGCTAAGATCCGAAGGCATT-TGGCGAGATACTAA 1 ATCCGGA-CTAAGATCCGAAGGCATTGT-GCGAGATACTAA 1771 ATCC-GACTAAGA-CCGAAGGCATTTGTGGCGAGATACTAA 1 ATCCGGACTAAGATCCGAAGGCA-TTGT-GCGAGATACTAA 1810 ATCCGGACTAAGATCCG-AGGCAGTTTGTGCGAGATACT-A 1 ATCCGGACTAAGATCCGAAGGCA--TTGTGCGAGATACTAA * * * * 1849 TTCCGGGCTTAAG-TCCCAAGGCATTGTGCGAGTTACTAA 1 ATCCGGAC-TAAGATCCGAAGGCATTGTGCGAGATACTAA 1888 ATCCGG 1 ATCCGG 1894 GTTAAAGTCC Statistics Matches: 110, Mismatches: 5, Indels: 17 0.83 0.04 0.13 Matches are distributed among these distances: 37 9 0.08 38 21 0.19 39 36 0.33 40 36 0.33 41 8 0.07 ACGTcount: A:0.29, C:0.21, G:0.27, T:0.23 Consensus pattern (39 bp): ATCCGGACTAAGATCCGAAGGCATTGTGCGAGATACTAA Found at i:1888 original size:78 final size:77 Alignment explanation

Indices: 1731--1952 Score: 229 Period size: 78 Copynumber: 2.8 Consensus size: 77 1721 AGCTGACCAT * * * * 1731 ATCCGGAGCTAAGATCCGAAGGCATTTG-GCGAGATACTAAATCC-GACTAAGACCGAAGGCATT 1 ATCCGGA-CTAAGATCCG-AGGCATTTGTGCGAGATACT-ATTCCGGGCTAAGTCCCAAGGCA-T 1794 TGTGGCGAGATACTAA 62 TGTGGCGAGATACTAA 1810 ATCCGGACTAAGATCCGAGGCAGTTTGTGCGAGATACTATTCCGGGCTTAAGTCCCAAGGCATTG 1 ATCCGGACTAAGATCCGAGGCA-TTTGTGCGAGATACTATTCCGGGC-TAAGTCCCAAGGCATTG * 1875 T-GCGAGTTACTAA 64 TGGCGAGATACTAA ** * * 1888 ATCCGGGTTAA-AGTCCCGAGGCATTTGTGCGA-ATTACTATAACCGGGCTATGTCCCGAAGGCA 1 ATCCGGACTAAGA-T-CCGAGGCATTTGTGCGAGA-TACTAT-TCCGGGCTAAGTCCC-AAGGCA 1951 TT 61 TT 1953 TGAATGAGAG Statistics Matches: 125, Mismatches: 9, Indels: 18 0.82 0.06 0.12 Matches are distributed among these distances: 77 7 0.06 78 61 0.49 79 45 0.36 80 12 0.10 ACGTcount: A:0.28, C:0.22, G:0.27, T:0.24 Consensus pattern (77 bp): ATCCGGACTAAGATCCGAGGCATTTGTGCGAGATACTATTCCGGGCTAAGTCCCAAGGCATTGTG GCGAGATACTAA Found at i:11238 original size:40 final size:40 Alignment explanation

Indices: 11194--11457 Score: 298 Period size: 40 Copynumber: 6.7 Consensus size: 40 11184 TTGAATGATG * 11194 TCCGGGCTAAGTCCCGAAGGC-TTTGTGCTA-AGTGAC-ATAA 1 TCCGGGCTAAGTCCCGAAGGCATTTGTGCGAGA-T-ACTA-AA * 11234 TCCGGACTAAGAT-CCGAAGGCATTTGTGCGAGATACTAAA 1 TCCGGGCTAAG-TCCCGAAGGCATTTGTGCGAGATACTAAA * 11274 TCCGGACTAGAGAT-CCGAAGGCATTTGTGCGAGATACTAAA 1 TCCGGGCTA-AG-TCCCGAAGGCATTTGTGCGAGATACTAAA * * 11315 TCCGGACTAAGAT-CCGAAGGCATTTGTGCGAGATACTAAT 1 TCCGGGCTAAG-TCCCGAAGGCATTTGTGCGAGATACTAAA * 11355 TCC-GGCTAAGTCCCGAAGGCATTTGTGCGAGTTACTAAA 1 TCCGGGCTAAGTCCCGAAGGCATTTGTGCGAGATACTAAA * 11394 TCCGGGTTAAGTCCCGAAGGCA-TTGTGCGA-ATTACTATAA 1 TCCGGGCTAAGTCCCGAAGGCATTTGTGCGAGA-TACTA-AA * 11434 -CC-GGCTATGTCCCGAAGGCATTTG 1 TCCGGGCTAAGTCCCGAAGGCATTTG 11458 AACGAGGAGC Statistics Matches: 204, Mismatches: 10, Indels: 21 0.87 0.04 0.09 Matches are distributed among these distances: 38 17 0.08 39 52 0.25 40 83 0.41 41 51 0.25 42 1 0.00 ACGTcount: A:0.28, C:0.22, G:0.26, T:0.25 Consensus pattern (40 bp): TCCGGGCTAAGTCCCGAAGGCATTTGTGCGAGATACTAAA Found at i:11317 original size:81 final size:79 Alignment explanation

Indices: 11198--11457 Score: 329 Period size: 81 Copynumber: 3.3 Consensus size: 79 11188 ATGATGTCCG * 11198 GGCTAAGTCCCGAAGGC-TTTGTGCTA-AGTGAC-ATAATCCGGACTAAGATCCGAAGGCATTTG 1 GGCTAAGTCCCGAAGGCATTTGTGCGAGA-T-ACTA-AATCCGGACTAAGATCCGAAGGCATTTG 11260 TGCGAGATACTAAATCC 63 TGCGAGATACTAAATCC 11277 GGACTAGAGAT-CCGAAGGCATTTGTGCGAGATACTAAATCCGGACTAAGATCCGAAGGCATTTG 1 GG-CTA-AG-TCCCGAAGGCATTTGTGCGAGATACTAAATCCGGACTAAGATCCGAAGGCATTTG * 11341 TGCGAGATACTAATTCC 63 TGCGAGATACTAAATCC * ** 11358 GGCTAAGTCCCGAAGGCATTTGTGCGAGTTACTAAATCCGGGTTAAG-TCCCGAAGGCA-TTGTG 1 GGCTAAGTCCCGAAGGCATTTGTGCGAGATACTAAATCCGGACTAAGAT-CCGAAGGCATTTGTG 11421 CGA-ATTACTATAA-CC 65 CGAGA-TACTA-AATCC * 11436 GGCTATGTCCCGAAGGCATTTG 1 GGCTAAGTCCCGAAGGCATTTG 11458 AACGAGGAGC Statistics Matches: 164, Mismatches: 7, Indels: 21 0.85 0.04 0.11 Matches are distributed among these distances: 77 1 0.01 78 38 0.23 79 49 0.30 80 6 0.04 81 58 0.35 82 11 0.07 83 1 0.01 ACGTcount: A:0.28, C:0.21, G:0.26, T:0.25 Consensus pattern (79 bp): GGCTAAGTCCCGAAGGCATTTGTGCGAGATACTAAATCCGGACTAAGATCCGAAGGCATTTGTGC GAGATACTAAATCC Found at i:11475 original size:78 final size:78 Alignment explanation

Indices: 11198--11489 Score: 237 Period size: 81 Copynumber: 3.7 Consensus size: 78 11188 ATGATGTCCG ** * * 11198 GGCTAAGTCCCGAAGGC-TTTGTGCTAAGTGAC-ATAATCCGGACTAAGAT-CCGAAGGCATTTG 1 GGCTAAGTCCCGAAGGCATTTGAAC-GAGTGACTA-AATCCGG-TTAA-ATCCCGAAGGCATTTG 11260 TGCGAGATACTAA-ATCC 62 TGCGAGATACTAATA-CC ** * 11277 GGACTAGAGAT-CCGAAGGCATTTGTGCGAGAT-ACTAAATCCGGACTAAGAT-CCGAAGGCATT 1 GG-CTA-AG-TCCCGAAGGCATTTGAACGAG-TGACTAAATCCGG-TTAA-ATCCCGAAGGCATT * 11339 TGTGCGAGATACTAATTCC 60 TGTGCGAGATACTAATACC ** * * 11358 GGCTAAGTCCCGAAGGCATTTGTGCGAGTTACTAAATCCGGGTTAAGTCCCGAAGGCA-TTGTGC 1 GGCTAAGTCCCGAAGGCATTTGAACGAGTGACTAAATCC-GGTTAAATCCCGAAGGCATTTGTGC 11422 GA-ATTACT-ATAACC 65 GAGA-TACTAAT-ACC * * * 11436 GGCTATGTCCCGAAGGCATTTGAACGAG-GAGCTATATCCGGTTAAATTCCGAAG 1 GGCTAAGTCCCGAAGGCATTTGAACGAGTGA-CTAAATCCGGTTAAATCCCGAAG 11490 TACGTGATTT Statistics Matches: 187, Mismatches: 12, Indels: 30 0.82 0.05 0.13 Matches are distributed among these distances: 77 17 0.09 78 49 0.26 79 44 0.24 80 8 0.04 81 59 0.32 82 10 0.05 ACGTcount: A:0.29, C:0.21, G:0.26, T:0.24 Consensus pattern (78 bp): GGCTAAGTCCCGAAGGCATTTGAACGAGTGACTAAATCCGGTTAAATCCCGAAGGCATTTGTGCG AGATACTAATACC Found at i:14382 original size:36 final size:37 Alignment explanation

Indices: 14310--14387 Score: 122 Period size: 36 Copynumber: 2.1 Consensus size: 37 14300 TTATTACGAA * * 14310 GTCTTACCCGGACATAATCTCCACACGAAGTTATCGG 1 GTCTTACCCGGACATAATCCCCACACGAAGTCATCGG * 14347 GTCTTACCCGGACA-AATCCCCACACGTAGTCATCGG 1 GTCTTACCCGGACATAATCCCCACACGAAGTCATCGG 14383 GTCTT 1 GTCTT 14388 TAGAGCTCGG Statistics Matches: 38, Mismatches: 3, Indels: 1 0.90 0.07 0.02 Matches are distributed among these distances: 36 24 0.63 37 14 0.37 ACGTcount: A:0.24, C:0.32, G:0.19, T:0.24 Consensus pattern (37 bp): GTCTTACCCGGACATAATCCCCACACGAAGTCATCGG Found at i:14584 original size:47 final size:47 Alignment explanation

Indices: 14506--15129 Score: 1049 Period size: 47 Copynumber: 13.5 Consensus size: 47 14496 CCCTTCGGGA * * * * * * 14506 CTTATCACATTTATACACTTTCACATCCATCACGTTGGCCACTCGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * * 14553 CCTGTCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 14600 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 14647 CTTATCACATATATACACTTTCACATTCATCACATCGG-C-TTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * 14692 CTTATCACATATATACACTTTCGCATTCATCAC-TCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 14738 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 14785 CTTATCACATATATATACACTTTCACATTCATCACATCGGCC-TTAGGC 1 CTTATCAC--ATATATACACTTTCACATTCATCACATCGGCCATTAGGC * 14833 CTTATCACATATATATACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 14880 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 14927 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 14974 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 15021 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGG- 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 15067 CTTATCA-ATAT-TACAC-TTCAC--TCAT---ATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 15106 CTTATCACATATATACACTTTCAC 1 CTTATCACATATATACACTTTCAC 15130 GTATACAACT Statistics Matches: 553, Mismatches: 14, Indels: 25 0.93 0.02 0.04 Matches are distributed among these distances: 38 13 0.02 39 7 0.01 40 4 0.01 41 9 0.02 42 5 0.01 43 5 0.01 44 9 0.02 45 43 0.08 46 77 0.14 47 335 0.61 48 14 0.03 49 32 0.06 ACGTcount: A:0.29, C:0.30, G:0.09, T:0.33 Consensus pattern (47 bp): CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC Found at i:15110 original size:39 final size:39 Alignment explanation

Indices: 15046--15125 Score: 119 Period size: 38 Copynumber: 2.1 Consensus size: 39 15036 CACTTTCACA 15046 TTCATCACATCGGCCATTAGGCTTATCA-ATAT-TACAC 1 TTCATCACATCGGCCATTAGGCTTATCACATATATACAC * 15083 TTCACTCATATCGGCCATTAGGCCTTATCACATATATACAC 1 TTCA-TCACATCGGCCATTAGG-CTTATCACATATATACAC 15124 TT 1 TT 15126 TCACGTATAC Statistics Matches: 38, Mismatches: 1, Indels: 4 0.88 0.02 0.09 Matches are distributed among these distances: 37 4 0.11 38 16 0.42 39 7 0.18 40 4 0.11 41 7 0.18 ACGTcount: A:0.29, C:0.28, G:0.10, T:0.34 Consensus pattern (39 bp): TTCATCACATCGGCCATTAGGCTTATCACATATATACAC Found at i:18824 original size:40 final size:39 Alignment explanation

Indices: 18671--19043 Score: 352 Period size: 39 Copynumber: 9.8 Consensus size: 39 18661 CGAGCATGAT * * * 18671 TGCTCTTCGGGACCTAGCCCGGATATACCACGCAGCA-GAA 1 TGCTCTTCGGGACTTAG-CCGGATATATCAC-TAGCACGAA * 18711 TGCTCTTCGGG-TTTAGCACGGATATATCACTAGCACGAA 1 TGCTCTTCGGGACTTAGC-CGGATATATCACTAGCACGAA * * 18750 TGCTCTTC-GAACTTAGTCCGGATACATCACTAGCACGAA 1 TGCTCTTCGGGACTTAG-CCGGATATATCACTAGCACGAA * * 18789 TGCTCTTCGGGATTTAACCTGGATATATCACTAGCACGAA 1 TGCTCTTCGGGACTTAGCC-GGATATATCACTAGCACGAA * * * * * * * 18829 --CT-TGTCCGGA-TTA-TC--A-CTAGCAC--GAATG-C 1 TGCTCT-TCGGGACTTAGCCGGATATATCACTAGCACGAA ** * 18858 T-CT-TCGGAGACTTAGCCGGATATATCA-TAGCACGAA 1 TGCTCTTCGGGACTTAGCCGGATATATCACTAGCACGAA 18894 TGCTCTTCGGGACTTAGCCCGGATATATCACTAGCACGAA 1 TGCTCTTCGGGACTTAG-CCGGATATATCACTAGCACGAA 18934 TGCTCTTC-GGACTTAGCCCGGATATATCACTAGCACGAA 1 TGCTCTTCGGGACTTAG-CCGGATATATCACTAGCACGAA 18973 TGCTCTTC-GGACTTAGCCCGGATATATCACTAGCACGAA 1 TGCTCTTCGGGACTTAG-CCGGATATATCACTAGCACGAA * 19012 TGCTCTTCGGGACTTAGCCAAGATATATCACT 1 TGCTCTTCGGGACTTAGCC-GGATATATCACT 19044 CTCAATTCTC Statistics Matches: 279, Mismatches: 32, Indels: 44 0.79 0.09 0.12 Matches are distributed among these distances: 29 2 0.01 30 9 0.03 31 1 0.00 32 5 0.02 33 2 0.01 34 4 0.01 35 3 0.01 36 2 0.01 37 6 0.02 38 22 0.08 39 151 0.54 40 72 0.26 ACGTcount: A:0.26, C:0.27, G:0.21, T:0.26 Consensus pattern (39 bp): TGCTCTTCGGGACTTAGCCGGATATATCACTAGCACGAA Found at i:18879 original size:66 final size:67 Alignment explanation

Indices: 18757--18893 Score: 208 Period size: 66 Copynumber: 2.1 Consensus size: 67 18747 GAATGCTCTT * 18757 CGAACTTAGTCCGGATACATCACTAGCACGAATGCTCTTCGGGATTTAACCTGGATATATCACTA 1 CGAACTTAGTCCGGATACATCACTAGCACGAATGCTCTTCGGGACTTAACCTGGATATATCA-TA 18822 GCA 65 GCA * * 18825 CGAACTT-GTCCGGAT-TATCACTAGCACGAATGCTCTTCGGAGACTTAGCC-GGATATATCATA 1 CGAACTTAGTCCGGATACATCACTAGCACGAATGCTCTTCGG-GACTTAACCTGGATATATCATA 18887 GCA 65 GCA 18890 CGAA 1 CGAA 18894 TGCTCTTCGG Statistics Matches: 65, Mismatches: 3, Indels: 5 0.89 0.04 0.07 Matches are distributed among these distances: 65 9 0.14 66 34 0.52 67 15 0.23 68 7 0.11 ACGTcount: A:0.29, C:0.25, G:0.20, T:0.26 Consensus pattern (67 bp): CGAACTTAGTCCGGATACATCACTAGCACGAATGCTCTTCGGGACTTAACCTGGATATATCATAG CA Found at i:18968 original size:79 final size:79 Alignment explanation

Indices: 18834--19043 Score: 356 Period size: 79 Copynumber: 2.7 Consensus size: 79 18824 ACGAACTTGT 18834 CCGGAT-TATCACTAGCACGAATGCTCTTCGGAGACTTAG-CCGGATATATCA-TAGCACGAATG 1 CCGGATATATCACTAGCACGAATGCTCTTCGG-GACTTAGCCCGGATATATCACTAGCACGAATG 18896 CTCTTCGGGACTTAGC 65 CTCTTC-GGACTTAGC 18912 CCGGATATATCACTAGCACGAATGCTCTTC-GGACTTAGCCCGGATATATCACTAGCACGAATGC 1 CCGGATATATCACTAGCACGAATGCTCTTCGGGACTTAGCCCGGATATATCACTAGCACGAATGC 18976 TCTTCGGACTTAGC 66 TCTTCGGACTTAGC ** 18990 CCGGATATATCACTAGCACGAATGCTCTTCGGGACTTAGCCAAGATATATCACT 1 CCGGATATATCACTAGCACGAATGCTCTTCGGGACTTAGCCCGGATATATCACT 19044 CTCAATTCTC Statistics Matches: 126, Mismatches: 2, Indels: 7 0.93 0.01 0.05 Matches are distributed among these distances: 77 7 0.06 78 58 0.46 79 61 0.48 ACGTcount: A:0.27, C:0.27, G:0.21, T:0.26 Consensus pattern (79 bp): CCGGATATATCACTAGCACGAATGCTCTTCGGGACTTAGCCCGGATATATCACTAGCACGAATGC TCTTCGGACTTAGC Found at i:18980 original size:184 final size:183 Alignment explanation

Indices: 18671--19011 Score: 526 Period size: 184 Copynumber: 1.9 Consensus size: 183 18661 CGAGCATGAT * * 18671 TGCTCTTCGGGACCTAGCCCGGATATACCACGCAGCAGAATGCTCTTCGGGTTTAGCACGGATAT 1 TGCTCTTCGGGACCTAGCCCGGATATACCAAGCAGCAGAATGCTCTTCGGGCTTAGCACGGATAT * 18736 ATCACTAGCACGAATGCTCTTCGAACTTAGTCCGGATACATCACTAGCACGAATGCTCTTCGGGA 66 ATCACTAGCACGAATGCTCTTCGAACTTAGCCCGGATACATCACTAGCACGAATGCTCTTC-GGA * * 18801 TTTAACCTGGATATATCACTAGCACGAACTTGTCCGGATTATCACTAGCACGAA 130 CTTAACCCGGATATATCACTAGCACGAACTTGTCCGGATTATCACTAGCACGAA * * * 18855 TGCTCTTCGGAGACTTAG-CCGGATATATCATAGCA-C-GAATGCTCTTCGGGACTTAGCCCGGA 1 TGCTCTTCGG-GACCTAGCCCGGATATACCA-AGCAGCAGAATGCTCTTCGGG-CTTAGCACGGA * * 18917 TATATCACTAGCACGAATGCTCTTCGGACTTAGCCCGGATATATCACTAGCACGAATGCTCTTCG 63 TATATCACTAGCACGAATGCTCTTCGAACTTAGCCCGGATACATCACTAGCACGAATGCTCTTCG * 18982 GACTTAGCCCGGATATATCACTAGCACGAA 128 GACTTAACCCGGATATATCACTAGCACGAA 19012 TGCTCTTCGG Statistics Matches: 143, Mismatches: 11, Indels: 7 0.89 0.07 0.04 Matches are distributed among these distances: 183 42 0.29 184 92 0.64 185 9 0.06 ACGTcount: A:0.26, C:0.27, G:0.21, T:0.26 Consensus pattern (183 bp): TGCTCTTCGGGACCTAGCCCGGATATACCAAGCAGCAGAATGCTCTTCGGGCTTAGCACGGATAT ATCACTAGCACGAATGCTCTTCGAACTTAGCCCGGATACATCACTAGCACGAATGCTCTTCGGAC TTAACCCGGATATATCACTAGCACGAACTTGTCCGGATTATCACTAGCACGAA Found at i:19088 original size:12 final size:13 Alignment explanation

Indices: 19061--19089 Score: 51 Period size: 13 Copynumber: 2.3 Consensus size: 13 19051 CTCATGTACA 19061 CATACACATATAG 1 CATACACATATAG 19074 CATACACAT-TAG 1 CATACACATATAG 19086 CATA 1 CATA 19090 TCATTTACAT Statistics Matches: 16, Mismatches: 0, Indels: 1 0.94 0.00 0.06 Matches are distributed among these distances: 12 7 0.44 13 9 0.56 ACGTcount: A:0.45, C:0.24, G:0.07, T:0.24 Consensus pattern (13 bp): CATACACATATAG Found at i:20304 original size:37 final size:37 Alignment explanation

Indices: 20254--20332 Score: 113 Period size: 37 Copynumber: 2.1 Consensus size: 37 20244 TTATTACGAA * * * 20254 GTCTTACCCGGACATAATCTCCACACGAAGTTATCGG 1 GTCTTACCCGGACAAAATCCCCACACGAAGTCATCGG * * 20291 GTCTTACCCGGACAAAATCCCCACGCGTAGTCATCGG 1 GTCTTACCCGGACAAAATCCCCACACGAAGTCATCGG 20328 GTCTT 1 GTCTT 20333 TAGAGCTCGT Statistics Matches: 37, Mismatches: 5, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 37 37 1.00 ACGTcount: A:0.24, C:0.32, G:0.20, T:0.24 Consensus pattern (37 bp): GTCTTACCCGGACAAAATCCCCACACGAAGTCATCGG Found at i:20529 original size:47 final size:47 Alignment explanation

Indices: 20451--20919 Score: 731 Period size: 47 Copynumber: 9.9 Consensus size: 47 20441 CCCTTCGGGA * * * * * * * 20451 CTTATCACATTTATACACTTTCACATCCATCACGTTGGCCACTCGAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * * 20498 CCTGTCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 20545 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 20592 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * * * 20639 ATTATCACATATATACACTTTCACATTCATCACATCAGCTATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * * 20686 CTTATCACATATATACACTTTCGCATTCATCACTTCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * * 20733 CTTACCACATATATACACCTTCACATTCATCACATCGGCCATTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * 20780 CTTATCACATATATACACTTTCACATTCATCACATCGGCCGTTAGGC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC * 20827 CTTATCACATATATATACACTTTCACATTCATCACATCGGCCGTTAGGC 1 CTTATCAC--ATATATACACTTTCACATTCATCACATCGGCCATTAGGC * 20876 CTTATCACATATATATACACTTTAACATTCATCACATCGGCCAT 1 CTTATCAC--ATATATACACTTTCACATTCATCACATCGGCCAT 20920 ATATATACAC Statistics Matches: 392, Mismatches: 28, Indels: 2 0.93 0.07 0.00 Matches are distributed among these distances: 47 311 0.79 49 81 0.21 ACGTcount: A:0.29, C:0.30, G:0.09, T:0.32 Consensus pattern (47 bp): CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGC Found at i:20612 original size:94 final size:94 Alignment explanation

Indices: 20451--21129 Score: 872 Period size: 94 Copynumber: 7.3 Consensus size: 94 20441 CCCTTCGGGA * * * * * * * * * 20451 CTTATCACATTTATACACTTTCACATCCATCACGTTGGCCACTCGACCCTGTCACATATATACAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC 20516 TTTCACATTCATCACATCGGCCATTAGGC 66 TTTCACATTCATCACATCGGCCATTAGGC 20545 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC 20610 TTTCACATTCATCACATCGGCCATTAGGC 66 TTTCACATTCATCACATCGGCCATTAGGC * * * 20639 ATTATCACATATATACACTTTCACATTCATCACATCAGCTATTAGGCCTTATCACATATATACAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC * * 20704 TTTCGCATTCATCACTTCGGCCATTAGGC 66 TTTCACATTCATCACATCGGCCATTAGGC * * 20733 CTTACCACATATATACACCTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC * 20798 TTTCACATTCATCACATCGGCCGTTAGGC 66 TTTCACATTCATCACATCGGCCATTAGGC * 20827 CTTATCACATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCACATATATAT 1 CTTATCAC--ATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCAC--ATATAT * 20892 ACACTTTAACATTCATCACATCGGCCA-TA--- 62 ACACTTTCACATTCATCACATCGGCCATTAGGC * * * * * 20921 -TATAT-ACACT-T-T-CACATTCATCA--CATCGGCCGTTAGGCC-TTA-TCGC--AT---ATA 1 CT-TATCACA-TATATACACTTTCA-CATTCATC--AC-ATCGGCCATTAGGC-CTTATCACATA 20972 TATACACTTTCACATTCATCACATCGGCCATTAGGC 59 TATACACTTTCACATTCATCACATCGGCCATTAGGC 21008 CTTATCACATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATAC 1 CTTATCAC--ATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATAC 21073 ACTTTCACATTCATCACATCGGCCATTAGGC 64 ACTTTCACATTCATCACATCGGCCATTAGGC * 21104 CTTATCAGATATATATACACTTTCAC 1 CTTATC--ACATATATACACTTTCAC 21130 GTATACACAC Statistics Matches: 513, Mismatches: 39, Indels: 64 0.83 0.06 0.10 Matches are distributed among these distances: 83 32 0.06 84 2 0.00 87 3 0.01 88 9 0.02 89 9 0.02 90 15 0.03 91 15 0.03 92 9 0.02 93 9 0.02 94 270 0.53 96 106 0.21 97 2 0.00 98 32 0.06 ACGTcount: A:0.29, C:0.29, G:0.09, T:0.32 Consensus pattern (94 bp): CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC TTTCACATTCATCACATCGGCCATTAGGC Found at i:20705 original size:141 final size:141 Alignment explanation

Indices: 20451--21129 Score: 979 Period size: 141 Copynumber: 4.8 Consensus size: 141 20441 CCCTTCGGGA * * * * * * * * * 20451 CTTATCACATTTATACACTTTCACATCCATCACGTTGGCCACTCGACCCTGTCACATATATACAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC 20516 TTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACACTTTCACATTCATCACATC 66 TTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACACTTTCACATTCATCACATC 20581 GGCCATTAGGC 131 GGCCATTAGGC * 20592 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCATTATCACATATATACAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC * * * * 20657 TTTCACATTCATCACATCAGCTATTAGGCCTTATCACATATATACACTTTCGCATTCATCACTTC 66 TTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACACTTTCACATTCATCACATC 20722 GGCCATTAGGC 131 GGCCATTAGGC * * 20733 CTTACCACATATATACACCTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC 1 CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC * 20798 TTTCACATTCATCACATCGGCCGTTAGGCCTTATCACATATATATACACTTTCACATTCATCACA 66 TTTCACATTCATCACATCGGCCATTAGGCCTTATCAC--ATATATACACTTTCACATTCATCACA * 20863 TCGGCCGTTAGGC 129 TCGGCCATTAGGC * 20876 CTTATCACATATATATACACTTTAACATTCATCACAT----C----GGCC--AT---ATATATAC 1 CTTATCAC--ATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATAC * * 20928 ACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCGCATATATATACACTTTCACATTCATCA 64 ACTTTCACATTCATCACATCGGCCATTAGGCCTTAT--CACATATATACACTTTCACATTCATCA 20993 CATCGGCCATTAGGC 127 CATCGGCCATTAGGC 21008 CTTATCACATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATAC 1 CTTATCAC--ATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATAC * 21073 ACTTTCACATTCATCACATCGGCCATTAGGCCTTATCAGATATATATACACTTTCAC 64 ACTTTCACATTCATCACATCGGCCATTAGGCCTTATC--ACATATATACACTTTCAC 21130 GTATACACAC Statistics Matches: 487, Mismatches: 30, Indels: 38 0.88 0.05 0.07 Matches are distributed among these distances: 132 118 0.24 134 2 0.00 135 2 0.00 136 1 0.00 137 4 0.01 140 4 0.01 141 224 0.46 142 2 0.00 143 44 0.09 145 86 0.18 ACGTcount: A:0.29, C:0.29, G:0.09, T:0.32 Consensus pattern (141 bp): CTTATCACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACAC TTTCACATTCATCACATCGGCCATTAGGCCTTATCACATATATACACTTTCACATTCATCACATC GGCCATTAGGC Found at i:20926 original size:34 final size:34 Alignment explanation

Indices: 20883--20951 Score: 129 Period size: 34 Copynumber: 2.0 Consensus size: 34 20873 GGCCTTATCA 20883 CATATATATACACTTTAACATTCATCACATCGGC 1 CATATATATACACTTTAACATTCATCACATCGGC * 20917 CATATATATACACTTTCACATTCATCACATCGGC 1 CATATATATACACTTTAACATTCATCACATCGGC 20951 C 1 C 20952 GTTAGGCCTT Statistics Matches: 34, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 34 34 1.00 ACGTcount: A:0.33, C:0.29, G:0.06, T:0.32 Consensus pattern (34 bp): CATATATATACACTTTAACATTCATCACATCGGC Found at i:20927 original size:83 final size:83 Alignment explanation

Indices: 20834--21002 Score: 320 Period size: 83 Copynumber: 2.0 Consensus size: 83 20824 GGCCTTATCA 20834 CATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCACATATATATACACTTT 1 CATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCACATATATATACACTTT 20899 AACATTCATCACATCGGC 66 AACATTCATCACATCGGC * 20917 CATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCGCATATATATACACTTT 1 CATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCACATATATATACACTTT * 20982 CACATTCATCACATCGGC 66 AACATTCATCACATCGGC 21000 CAT 1 CAT 21003 TAGGCCTTAT Statistics Matches: 84, Mismatches: 2, Indels: 0 0.98 0.02 0.00 Matches are distributed among these distances: 83 84 1.00 ACGTcount: A:0.30, C:0.28, G:0.09, T:0.33 Consensus pattern (83 bp): CATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCACATATATATACACTTT AACATTCATCACATCGGC Found at i:20971 original size:49 final size:49 Alignment explanation

Indices: 20917--21129 Score: 369 Period size: 49 Copynumber: 4.4 Consensus size: 49 20907 TCACATCGGC * 20917 CATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCG 1 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCG * 20966 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCA 1 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCG 21015 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTAT-- 1 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCG * 21062 CACATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCAG 1 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATC-G 21112 -ATATATATACACTTTCAC 1 CATATATATACACTTTCAC 21130 GTATACACAC Statistics Matches: 157, Mismatches: 4, Indels: 6 0.94 0.02 0.04 Matches are distributed among these distances: 47 46 0.29 49 111 0.71 ACGTcount: A:0.30, C:0.28, G:0.09, T:0.33 Consensus pattern (49 bp): CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCG Found at i:21094 original size:96 final size:97 Alignment explanation

Indices: 20917--21129 Score: 383 Period size: 96 Copynumber: 2.2 Consensus size: 97 20907 TCACATCGGC * * 20917 CATATATATACACTTTCACATTCATCACATCGGCCGTTAGGCCTTATCGCATATATATACACTTT 1 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATC-CACATATATACACTTT 20982 CACATTCATCACATCGGCCATTAGGCCTTATCA 65 CACATTCATCACATCGGCCATTAGGCCTTATCA 21015 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTAT-CACATATATACACTTTC 1 CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCCACATATATACACTTTC 21079 ACATTCATCACATCGGCCATTAGGCCTTATCA 66 ACATTCATCACATCGGCCATTAGGCCTTATCA * 21111 GATATATATACACTTTCAC 1 CATATATATACACTTTCAC 21130 GTATACACAC Statistics Matches: 112, Mismatches: 3, Indels: 2 0.96 0.03 0.02 Matches are distributed among these distances: 96 66 0.59 98 46 0.41 ACGTcount: A:0.30, C:0.28, G:0.09, T:0.33 Consensus pattern (97 bp): CATATATATACACTTTCACATTCATCACATCGGCCATTAGGCCTTATCCACATATATACACTTTC ACATTCATCACATCGGCCATTAGGCCTTATCA Found at i:29735 original size:24 final size:24 Alignment explanation

Indices: 29708--29823 Score: 160 Period size: 24 Copynumber: 4.8 Consensus size: 24 29698 TTGACATTCC * * 29708 AATCCGCACACTTAGTGCCATATA 1 AATCCGCACACATAGTGCCATACA * 29732 AATCTGCACACATAGTGCCATACA 1 AATCCGCACACATAGTGCCATACA * * 29756 ATTCCGCACACATAGTGTCATACA 1 AATCCGCACACATAGTGCCATACA * 29780 AGTCCGCACACATAGTGCCATACA 1 AATCCGCACACATAGTGCCATACA * * 29804 AGTTCGCACACATAGTGCCA 1 AATCCGCACACATAGTGCCA 29824 AAGTCATTTC Statistics Matches: 83, Mismatches: 9, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 24 83 1.00 ACGTcount: A:0.34, C:0.30, G:0.15, T:0.22 Consensus pattern (24 bp): AATCCGCACACATAGTGCCATACA Found at i:29745 original size:48 final size:48 Alignment explanation

Indices: 29665--29823 Score: 153 Period size: 48 Copynumber: 3.3 Consensus size: 48 29655 ATGTTAGCCA * * * * * * 29665 GCACACATAGTGCCTTAGC-AATTCACATACATATTGACATTCCAA-TCC 1 GCACACATAGTGCCATA-CAAATCCGCACACATAGTGACA-TACAATTCC * * * * 29713 GCACACTTAGTGCCATATAAATCTGCACACATAGTGCCATACAATTCC 1 GCACACATAGTGCCATACAAATCCGCACACATAGTGACATACAATTCC * * * 29761 GCACACATAGTGTCATACAAGTCCGCACACATAGTGCCATACAAGTT-C 1 GCACACATAGTGCCATACAAATCCGCACACATAGTGACATACAA-TTCC 29809 GCACACATAGTGCCA 1 GCACACATAGTGCCA 29824 AAGTCATTTC Statistics Matches: 92, Mismatches: 16, Indels: 6 0.81 0.14 0.05 Matches are distributed among these distances: 47 4 0.04 48 86 0.93 49 2 0.02 ACGTcount: A:0.33, C:0.30, G:0.14, T:0.23 Consensus pattern (48 bp): GCACACATAGTGCCATACAAATCCGCACACATAGTGACATACAATTCC Found at i:29769 original size:72 final size:72 Alignment explanation

Indices: 29665--29823 Score: 189 Period size: 72 Copynumber: 2.2 Consensus size: 72 29655 ATGTTAGCCA * * * * * 29665 GCACACATAGTGCCTTAGCAATTCACATACATATTGACATTCCAA-TCCGCACACTTAGTGCCAT 1 GCACACATAGTGCCATAGCAATTCACACACATAGTGACA-TACAAGTCCGCACACATAGTGCCAT * 29729 ATAAA-TC 65 ACAAATTC * * 29736 TGCACACATAGTGCCATA-CAATTCCGCACACATAGTGTCATACAAGTCCGCACACATAGTGCCA 1 -GCACACATAGTGCCATAGCAATT-CACACACATAGTGACATACAAGTCCGCACACATAGTGCCA * 29800 TACAAGTTC 64 TACAAATTC 29809 GCACACATAGTGCCA 1 GCACACATAGTGCCA 29824 AAGTCATTTC Statistics Matches: 75, Mismatches: 9, Indels: 6 0.83 0.10 0.07 Matches are distributed among these distances: 71 9 0.12 72 64 0.85 73 2 0.03 ACGTcount: A:0.33, C:0.30, G:0.14, T:0.23 Consensus pattern (72 bp): GCACACATAGTGCCATAGCAATTCACACACATAGTGACATACAAGTCCGCACACATAGTGCCATA CAAATTC Found at i:35808 original size:19 final size:19 Alignment explanation

Indices: 35786--35824 Score: 69 Period size: 19 Copynumber: 2.1 Consensus size: 19 35776 TTTACTTTTT 35786 CCACCAATGCTATGAAATC 1 CCACCAATGCTATGAAATC * 35805 CCACCAATGCTGTGAAATC 1 CCACCAATGCTATGAAATC 35824 C 1 C 35825 TGCTCCCTCT Statistics Matches: 19, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 19 19 1.00 ACGTcount: A:0.33, C:0.33, G:0.13, T:0.21 Consensus pattern (19 bp): CCACCAATGCTATGAAATC Found at i:38009 original size:39 final size:40 Alignment explanation

Indices: 37816--38025 Score: 221 Period size: 40 Copynumber: 5.3 Consensus size: 40 37806 CTAACGGGAT * * * * 37816 TAAGTCCCGAAGACATTTGTGCTAGTGATTA-ATTCCAGGC 1 TAAGTCCCGAAGGCATTTGTGCGAGTGACTATA-TCCGGGC * * * 37856 TAAGTCTCGAAGGCATTTGTGGGAGTTACTA-ATTCCGGGC 1 TAAGTCCCGAAGGCATTTGTGCGAGTGACTATA-TCCGGGC * 37896 TAAGTCCCGAAGGCATTTGTGCGAGTTACTATATCCGGGC 1 TAAGTCCCGAAGGCATTTGTGCGAGTGACTATATCCGGGC * * 37936 TAAGTCCCGAAGGCATTTGTGCTAGTGACCATATCCGGGC 1 TAAGTCCCGAAGGCATTTGTGCGAGTGACTATATCCGGGC * * * ** * 37976 TAAGACCCGACGGC-CTTGTGCGAGTGGTTATATCC-GGA 1 TAAGTCCCGAAGGCATTTGTGCGAGTGACTATATCCGGGC * 38014 TAAATCCCGAAG 1 TAAGTCCCGAAG 38026 ATACTTGGGT Statistics Matches: 146, Mismatches: 23, Indels: 4 0.84 0.13 0.02 Matches are distributed among these distances: 38 11 0.08 39 16 0.11 40 118 0.81 41 1 0.01 ACGTcount: A:0.24, C:0.22, G:0.27, T:0.27 Consensus pattern (40 bp): TAAGTCCCGAAGGCATTTGTGCGAGTGACTATATCCGGGC Found at i:43342 original size:27 final size:27 Alignment explanation

Indices: 43312--43488 Score: 153 Period size: 27 Copynumber: 6.6 Consensus size: 27 43302 ATATTGAGTC * * 43312 CGCACACTCAGTGCTATATAATCAACT 1 CGCACACTTAGTGCTACATAATCAACT * * 43339 CGCACACTTAGTGCTACGTAATCAAAT 1 CGCACACTTAGTGCTACATAATCAACT * 43366 CGCACACTTA-TGCTACATAGTCAAACT 1 CGCACACTTAGTGCTACATAATC-AACT ** ** * 43393 CGCCACACTTAGTGCCGCATGGTC-ATT 1 CG-CACACTTAGTGCTACATAATCAACT * * ** 43420 CACACACTTAGTGC-ATCATATTCATTT 1 CGCACACTTAGTGCTA-CATAATCAACT * * * 43447 CGCACACTTAGTGCAACATAGTCAAAT 1 CGCACACTTAGTGCTACATAATCAACT 43474 CGCACACTTAGTGCT 1 CGCACACTTAGTGCT 43489 GTACAATTTA Statistics Matches: 123, Mismatches: 21, Indels: 12 0.79 0.13 0.08 Matches are distributed among these distances: 26 27 0.22 27 78 0.63 28 9 0.07 29 9 0.07 ACGTcount: A:0.30, C:0.29, G:0.14, T:0.27 Consensus pattern (27 bp): CGCACACTTAGTGCTACATAATCAACT Done.