Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: scaffold_1052

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 37500
ACGTcount: A:0.32, C:0.16, G:0.19, T:0.33


Found at i:206 original size:44 final size:46

Alignment explanation

Indices: 105--272 Score: 155 Period size: 50 Copynumber: 3.6 Consensus size: 46 95 TTATGAGAGC * * * 105 CAGTGTAAGACCATGTCTAGGACATGGCATCGGCATTGAGACGAGTGT 1 CAGTGTAAGACCATGTCTGGGACAT-GAATCAGCA-TGAGACGAGTGT * 153 CAGTGTAAGA-CATGTCTGGGACATGAATCAGC-TGCGA-GATGTGT 1 CAGTGTAAGACCATGTCTGGGACATGAATCAGCATGAGACGA-GTGT * * * * * 197 CAGTGTAAGACCATGTCTGGGACATGGCATCTGCACGGATATGCGAGAG- 1 CAGTGTAAGACCATGTCTGGGACAT-GAATCAGCA-TGAGA--CGAGTGT 246 CTAGTGTAAGACCATGTCTGGGACATG 1 C-AGTGTAAGACCATGTCTGGGACATG 273 GCATCGGCCT Statistics Matches: 101, Mismatches: 10, Indels: 17 0.79 0.08 0.13 Matches are distributed among these distances: 43 2 0.02 44 18 0.18 45 14 0.14 46 12 0.12 47 13 0.13 48 12 0.12 49 2 0.02 50 26 0.26 51 2 0.02 ACGTcount: A:0.27, C:0.18, G:0.32, T:0.23 Consensus pattern (46 bp): CAGTGTAAGACCATGTCTGGGACATGAATCAGCATGAGACGAGTGT Found at i:3547 original size:43 final size:42 Alignment explanation

Indices: 3458--3566 Score: 130 Period size: 43 Copynumber: 2.6 Consensus size: 42 3448 GATTACGTGT * * * 3458 AAGACCATATTTGGGATATGGCATCGATATGAGACTTCATGT 1 AAGACCATATCTGGGATATGGCATCGATACGAGACTTCATGC * * * 3500 AAGACCATAGCTGGGCTATTGGCATCGATACGAGA-TTATATGC 1 AAGACCATATCTGGGATA-TGGCATCGATACGAGACTT-CATGC * 3543 AAGACCATATCTGGGGTATGGCAT 1 AAGACCATATCTGGGATATGGCAT 3567 TAGTATGATA Statistics Matches: 57, Mismatches: 8, Indels: 4 0.83 0.12 0.06 Matches are distributed among these distances: 42 23 0.40 43 34 0.60 ACGTcount: A:0.30, C:0.17, G:0.26, T:0.28 Consensus pattern (42 bp): AAGACCATATCTGGGATATGGCATCGATACGAGACTTCATGC Found at i:9725 original size:39 final size:39 Alignment explanation

Indices: 9682--9838 Score: 242 Period size: 39 Copynumber: 4.0 Consensus size: 39 9672 AAAGATACTA * * 9682 GAAATGTATCCGGGCTAAAGTCCAGCAGGCTTCGTGCTG 1 GAAATGTATCCAGGCTAAAGTCCCGCAGGCTTCGTGCTG * * 9721 GAAATGTATCCGGGCTAAAGTCCCGTAGGCTTCGTGCTG 1 GAAATGTATCCAGGCTAAAGTCCCGCAGGCTTCGTGCTG ** * 9760 GAAATGTATCCAGGCTAAAGTCCCGTTGGCTTCGTGCAG 1 GAAATGTATCCAGGCTAAAGTCCCGCAGGCTTCGTGCTG * 9799 GAAATGTATCCAGGCTAAAGTCCCGCAGGCTTTGTGCTG 1 GAAATGTATCCAGGCTAAAGTCCCGCAGGCTTCGTGCTG 9838 G 1 G 9839 TAATATAATT Statistics Matches: 109, Mismatches: 9, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 39 109 1.00 ACGTcount: A:0.22, C:0.23, G:0.30, T:0.25 Consensus pattern (39 bp): GAAATGTATCCAGGCTAAAGTCCCGCAGGCTTCGTGCTG Found at i:12110 original size:28 final size:28 Alignment explanation

Indices: 12074--12173 Score: 103 Period size: 28 Copynumber: 3.5 Consensus size: 28 12064 GTGAACATAG 12074 GCACTGTGTGTGCGAGTTCAGTAACCGA 1 GCACTGTGTGTGCGAGTTCAGTAACCGA * * * **** 12102 GCATTGTGTGTGCCAGAT-AGGTTGTTGA 1 GCACTGTGTGTGCGAGTTCA-GTAACCGA * 12130 GGCACTGTGTGTGCGAGTTCCGTAACCGA 1 -GCACTGTGTGTGCGAGTTCAGTAACCGA 12159 GCACTGTGTGTGCGA 1 GCACTGTGTGTGCGA 12174 CATCGATTGT Statistics Matches: 54, Mismatches: 15, Indels: 6 0.72 0.20 0.08 Matches are distributed among these distances: 27 1 0.02 28 34 0.63 29 19 0.35 ACGTcount: A:0.18, C:0.19, G:0.35, T:0.28 Consensus pattern (28 bp): GCACTGTGTGTGCGAGTTCAGTAACCGA Found at i:12141 original size:57 final size:57 Alignment explanation

Indices: 12072--12200 Score: 204 Period size: 57 Copynumber: 2.3 Consensus size: 57 12062 AAGTGAACAT * * * 12072 AGGCACTGTGTGTGCGAGTTCAGTAACCGAGCATTGTGTGTGCCAGATAGGTTGTTG 1 AGGCACTGTGTGTGCGAGTTCAGTAACCGAGCACTGTGTGTGCCACATAGATTGTTG * * * 12129 AGGCACTGTGTGTGCGAGTTCCGTAACCGAGCACTGTGTGTGCGACATCGATTGTTG 1 AGGCACTGTGTGTGCGAGTTCAGTAACCGAGCACTGTGTGTGCCACATAGATTGTTG 12186 AGGCACTGTGTGTGC 1 AGGCACTGTGTGTGC 12201 AAGATCGGTT Statistics Matches: 66, Mismatches: 6, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 57 66 1.00 ACGTcount: A:0.18, C:0.19, G:0.35, T:0.29 Consensus pattern (57 bp): AGGCACTGTGTGTGCGAGTTCAGTAACCGAGCACTGTGTGTGCCACATAGATTGTTG Found at i:12142 original size:29 final size:29 Alignment explanation

Indices: 12106--12230 Score: 117 Period size: 29 Copynumber: 4.3 Consensus size: 29 12096 AACCGAGCAT * * 12106 TGTGTGTGCCAGATAGGTTGTTGAGGCAC 1 TGTGTGTGCGAGATCGGTTGTTGAGGCAC * * **** 12135 TGTGTGTGCGAGTTCCGTAACCGA-GCAC 1 TGTGTGTGCGAGATCGGTTGTTGAGGCAC * * 12163 TGTGTGTGCGACATCGATTGTTGAGGCAC 1 TGTGTGTGCGAGATCGGTTGTTGAGGCAC * * 12192 TGTGTGTGCAAGATCGGTTGTTGGGGCAC 1 TGTGTGTGCGAGATCGGTTGTTGAGGCAC 12221 TAAGTGTGTG 1 T--GTGTGTG 12231 AAATGAGATC Statistics Matches: 73, Mismatches: 20, Indels: 4 0.75 0.21 0.04 Matches are distributed among these distances: 28 20 0.27 29 46 0.63 31 7 0.10 ACGTcount: A:0.17, C:0.16, G:0.37, T:0.30 Consensus pattern (29 bp): TGTGTGTGCGAGATCGGTTGTTGAGGCAC Found at i:15061 original size:39 final size:39 Alignment explanation

Indices: 15016--15214 Score: 290 Period size: 39 Copynumber: 5.1 Consensus size: 39 15006 ATGCAAGATA * * * 15016 CTGGAAATGTATTCGGGCTAAAGTCCCATAGGCTTTGTG 1 CTGGAAATGTATCCGGGCTAAAGTCCCGTAGGCTTCGTG * * 15055 CTGGAAATGTATCCGGGCTAAAGTCCCGCAGGCTTCATG 1 CTGGAAATGTATCCGGGCTAAAGTCCCGTAGGCTTCGTG * 15094 CTGGAAATGTATCCGGGCTAAAGTCTCGTAGGCTTCGTG 1 CTGGAAATGTATCCGGGCTAAAGTCCCGTAGGCTTCGTG * * 15133 CTGGAAATGTATCCAGGCTAAAGTCCCGTTGGCTTCGTG 1 CTGGAAATGTATCCGGGCTAAAGTCCCGTAGGCTTCGTG * * * * 15172 CAGGAAATGTATCCAGGCTAAAGTCCCGCAGGCTTTGTG 1 CTGGAAATGTATCCGGGCTAAAGTCCCGTAGGCTTCGTG 15211 CTGG 1 CTGG 15215 TAATATAATT Statistics Matches: 144, Mismatches: 16, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 39 144 1.00 ACGTcount: A:0.22, C:0.22, G:0.29, T:0.27 Consensus pattern (39 bp): CTGGAAATGTATCCGGGCTAAAGTCCCGTAGGCTTCGTG Found at i:20062 original size:38 final size:38 Alignment explanation

Indices: 19999--20099 Score: 139 Period size: 38 Copynumber: 2.6 Consensus size: 38 19989 ATTTGATGTG * 19999 TATCCGGGTTTAAAGACCCGCAGGCTTCGTGCTGGTAAAA 1 TATCCGGG-TTAAA-TCCCGCAGGCTTCGTGCTGGTAAAA ** 20039 TATCCGGGTTAAATCCCGCAGGCTTCGTGCTGGTAGTA 1 TATCCGGGTTAAATCCCGCAGGCTTCGTGCTGGTAAAA * * 20077 TATCCAGGATAAATCCCGCAGGC 1 TATCCGGGTTAAATCCCGCAGGC 20100 CTAGTACTGG Statistics Matches: 56, Mismatches: 5, Indels: 2 0.89 0.08 0.03 Matches are distributed among these distances: 38 43 0.77 39 5 0.09 40 8 0.14 ACGTcount: A:0.24, C:0.25, G:0.27, T:0.25 Consensus pattern (38 bp): TATCCGGGTTAAATCCCGCAGGCTTCGTGCTGGTAAAA Found at i:20110 original size:38 final size:38 Alignment explanation

Indices: 20015--20122 Score: 135 Period size: 38 Copynumber: 2.8 Consensus size: 38 20005 GGTTTAAAGA * * * 20015 CCCGCAGGCTTCGTGCTGGTAAAATATCCGGGTTAAAT 1 CCCGCAGGCTTCGTGCTGGTAATATATCCAGGATAAAT * 20053 CCCGCAGGCTTCGTGCTGGTAGTATATCCAGGATAAAT 1 CCCGCAGGCTTCGTGCTGGTAATATATCCAGGATAAAT * * * * * 20091 CCCGCAGGCCTAGTACTGGTATTATATTCAGG 1 CCCGCAGGCTTCGTGCTGGTAATATATCCAGG 20123 CCTTCATGCC Statistics Matches: 61, Mismatches: 9, Indels: 0 0.87 0.13 0.00 Matches are distributed among these distances: 38 61 1.00 ACGTcount: A:0.23, C:0.24, G:0.26, T:0.27 Consensus pattern (38 bp): CCCGCAGGCTTCGTGCTGGTAATATATCCAGGATAAAT Found at i:20168 original size:43 final size:43 Alignment explanation

Indices: 20115--20269 Score: 256 Period size: 43 Copynumber: 3.6 Consensus size: 43 20105 ACTGGTATTA * 20115 TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTAATGTG 1 TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTGATGTG * * * 20158 TATTCGGGCCTTCGTGCCTAGCAGGCTTCATGCCGGTGATGTG 1 TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTGATGTG * 20201 TATTCGGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTGATGTG 1 TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTGATGTG * 20244 TATTCAGGCCTTCGTGCCTAGCAGGC 1 TATTCAGGCCTTCATGCCTAGCAGGC 20270 GTAATGCCGG Statistics Matches: 104, Mismatches: 8, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 43 104 1.00 ACGTcount: A:0.14, C:0.26, G:0.30, T:0.30 Consensus pattern (43 bp): TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTGATGTG Found at i:20260 original size:86 final size:86 Alignment explanation

Indices: 20115--20282 Score: 291 Period size: 86 Copynumber: 2.0 Consensus size: 86 20105 ACTGGTATTA * 20115 TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTAATGTGTATTCGGGCCTTCGTGCCTAGC 1 TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTAATGTGTATTCAGGCCTTCGTGCCTAGC * * 20180 AGGCTTCATGCCGGTGATGTG 66 AGGCGTAATGCCGGTGATGTG * * 20201 TATTCGGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTGATGTGTATTCAGGCCTTCGTGCCTAGC 1 TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTAATGTGTATTCAGGCCTTCGTGCCTAGC 20266 AGGCGTAATGCCGGTGA 66 AGGCGTAATGCCGGTGA 20283 AATGATATGT Statistics Matches: 77, Mismatches: 5, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 86 77 1.00 ACGTcount: A:0.14, C:0.26, G:0.31, T:0.29 Consensus pattern (86 bp): TATTCAGGCCTTCATGCCTAGCAGGCTTCGTGCCGGTAATGTGTATTCAGGCCTTCGTGCCTAGC AGGCGTAATGCCGGTGATGTG Found at i:24849 original size:11 final size:11 Alignment explanation

Indices: 24833--24864 Score: 64 Period size: 11 Copynumber: 2.9 Consensus size: 11 24823 GTAGAAAAAT 24833 TATTTTTATTA 1 TATTTTTATTA 24844 TATTTTTATTA 1 TATTTTTATTA 24855 TATTTTTATT 1 TATTTTTATT 24865 GTCTACACTT Statistics Matches: 21, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 11 21 1.00 ACGTcount: A:0.25, C:0.00, G:0.00, T:0.75 Consensus pattern (11 bp): TATTTTTATTA Found at i:27484 original size:39 final size:40 Alignment explanation

Indices: 27367--27591 Score: 219 Period size: 40 Copynumber: 5.7 Consensus size: 40 27357 GCTCCTCGTT * * 27367 CAAATGCCTTCGGGACATAGCCTGG-TTATAGTAACTCGCA 1 CAAATGCCTTCGGGACTTAGCCCGGATT-TAGTAACTCGCA * 27407 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA 1 CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCGCA * * 27447 CAAATGCCTTCGGG-CTTAGCCCGGAATTAGTATCTCGCA 1 CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCGCA * * * * * 27486 CAAATGCCTTC-GGATCTTAGTCCGGATATGGTCACTTAGCA 1 CAAATGCCTTCGGGA-CTTAGCCCGGATTTAGTAAC-TCGCA * * * * * * 27527 CAAA-GCCTTC-GGATCTTAGTCCGAATATGGTCACTTAGCA 1 CAAATGCCTTCGGGA-CTTAGCCCGGATTTAGTAAC-TCGCA 27567 CAAA-GCCTTCGGGACTTAGCCCGGA 1 CAAATGCCTTCGGGACTTAGCCCGGA 27592 CATCATTCAA Statistics Matches: 164, Mismatches: 16, Indels: 10 0.86 0.08 0.05 Matches are distributed among these distances: 38 2 0.01 39 33 0.20 40 116 0.71 41 13 0.08 ACGTcount: A:0.26, C:0.27, G:0.22, T:0.25 Consensus pattern (40 bp): CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCGCA Found at i:27511 original size:79 final size:81 Alignment explanation

Indices: 27367--27591 Score: 212 Period size: 79 Copynumber: 2.8 Consensus size: 81 27357 GCTCCTCGTT * * 27367 CAAATGCCTTCGGGACATAGCCTGGTTATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGG 1 CAAATGCCTTCGGGACATAGCCCGGATATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGG * * 27432 ATTTAGTAAC-TCGCA 66 ATATAGTAACTTAGCA * * ** 27447 CAAATGCCTTCGGG-CTTAGCCCGGA-ATTAGTATCTCGCACAAATGCCTTC-GGATCTTAGTCC 1 CAAATGCCTTCGGGACATAGCCCGGATA-TAGTAACTCGCACAAATGCCTTCGGGA-CTTAACCC * * 27509 GGATATGGTCACTTAGCA 64 GGATATAGTAACTTAGCA * * * * * * * 27527 CAAA-GCCTTC-GGATCTTAGTCCGAATATGGTCACTTAGCACAAA-GCCTTCGGGACTTAGCCC 1 CAAATGCCTTCGGGA-CATAGCCCGGATATAGTAAC-TCGCACAAATGCCTTCGGGACTTAACCC 27589 GGA 64 GGA 27592 CATCATTCAA Statistics Matches: 120, Mismatches: 17, Indels: 16 0.78 0.11 0.10 Matches are distributed among these distances: 78 6 0.05 79 51 0.43 80 51 0.43 81 12 0.10 ACGTcount: A:0.26, C:0.27, G:0.22, T:0.25 Consensus pattern (81 bp): CAAATGCCTTCGGGACATAGCCCGGATATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGG ATATAGTAACTTAGCA Found at i:35404 original size:40 final size:40 Alignment explanation

Indices: 35360--35583 Score: 246 Period size: 40 Copynumber: 5.6 Consensus size: 40 35350 GCTCCTCGTT * 35360 CAAATGCCTTCGGGACATAGCCCGGTTATAGTAACTCGCA 1 CAAATGCCTTCGGGACTTAGCCCGGTTATAGTAACTCGCA * 35400 CAAATGCCTTCGGGACTTAACCCGGATT-TAGTAACTCGCA 1 CAAATGCCTTCGGGACTTAGCCCGG-TTATAGTAACTCGCA * * 35440 CAAATGCCTTCGGG-CTTAGCCCGG-AATTAGTATCTCGCA 1 CAAATGCCTTCGGGACTTAGCCCGGTTA-TAGTAACTCGCA * * 35479 CAAATGCCTTC-GGATCTTAGTCCGGATT-TAGTATCTCGCA 1 CAAATGCCTTCGGGA-CTTAGCCCGG-TTATAGTAACTCGCA * * * * * 35519 CAAATGCCTTC-GGATCTTAGTCCGGATATGGTCACTTAGCA 1 CAAATGCCTTCGGGA-CTTAGCCCGGTTATAGTAAC-TCGCA 35560 CAAA-GCCTTCGGGACTTAGCCCGG 1 CAAATGCCTTCGGGACTTAGCCCGG 35584 ACATCGTTCA Statistics Matches: 161, Mismatches: 13, Indels: 20 0.83 0.07 0.10 Matches are distributed among these distances: 38 2 0.01 39 32 0.20 40 114 0.71 41 13 0.08 ACGTcount: A:0.25, C:0.27, G:0.22, T:0.26 Consensus pattern (40 bp): CAAATGCCTTCGGGACTTAGCCCGGTTATAGTAACTCGCA Found at i:35462 original size:79 final size:78 Alignment explanation

Indices: 35360--35584 Score: 274 Period size: 79 Copynumber: 2.8 Consensus size: 78 35350 GCTCCTCGTT * * 35360 CAAATGCCTTCGGGACATAGCCCGGTTATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGG 1 CAAATGCCTTCGGG-CTTAGCCCGG-TATAGTAACTCGCACAAATGCCTTCGGGACTTAGCCCGG 35425 ATTTAGTAACTCGCA 64 ATTTAGTAACTCGCA * * * 35440 CAAATGCCTTCGGGCTTAGCCCGGAATTAGTATCTCGCACAAATGCCTTC-GGATCTTAGTCCGG 1 CAAATGCCTTCGGGCTTAGCCCGGTA-TAGTAACTCGCACAAATGCCTTCGGGA-CTTAGCCCGG * 35504 ATTTAGTATCTCGCA 64 ATTTAGTAACTCGCA * * * * * 35519 CAAATGCCTTCGGATCTTAGTCCGGATATGGTCACTTAGCACAAA-GCCTTCGGGACTTAGCCCG 1 CAAATGCCTTCGG-GCTTAGCCCGG-TATAGTAAC-TCGCACAAATGCCTTCGGGACTTAGCCCG 35583 GA 63 GA 35585 CATCGTTCAA Statistics Matches: 125, Mismatches: 14, Indels: 12 0.83 0.09 0.08 Matches are distributed among these distances: 78 4 0.03 79 66 0.53 80 43 0.34 81 12 0.10 ACGTcount: A:0.25, C:0.27, G:0.22, T:0.26 Consensus pattern (78 bp): CAAATGCCTTCGGGCTTAGCCCGGTATAGTAACTCGCACAAATGCCTTCGGGACTTAGCCCGGAT TTAGTAACTCGCA Found at i:35544 original size:119 final size:120 Alignment explanation

Indices: 35360--35584 Score: 298 Period size: 119 Copynumber: 1.9 Consensus size: 120 35350 GCTCCTCGTT 35360 CAAATGCCTTCGGGACATAGCCCGGTTATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGG 1 CAAATGCCTTCGGGACATAGCCCGGTTATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGG * * 35425 ATTTAGTAAC-TCGCACAAATGCCTTCGGG-CTTAGCCCGGAATTAGTATCTCGCA 66 ATATAGTAACTTAGCACAAA-GCCTTCGGGACTTAGCCCGGAATTAGTATCTCGCA * * * ** 35479 CAAATGCCTTC-GGATCTTAGTCCGGATT-TAGTATCTCGCACAAATGCCTTC-GGATCTTAGTC 1 CAAATGCCTTCGGGA-CATAGCCCGG-TTATAGTAACTCGCACAAATGCCTTCGGGA-CTTAACC * * 35541 CGGATATGGTCACTTAGCACAAAGCCTTCGGGACTTAGCCCGGA 63 CGGATATAGTAACTTAGCACAAAGCCTTCGGGACTTAGCCCGGA 35585 CATCGTTCAA Statistics Matches: 92, Mismatches: 9, Indels: 9 0.84 0.08 0.08 Matches are distributed among these distances: 118 6 0.07 119 65 0.71 120 21 0.23 ACGTcount: A:0.25, C:0.27, G:0.22, T:0.26 Consensus pattern (120 bp): CAAATGCCTTCGGGACATAGCCCGGTTATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGG ATATAGTAACTTAGCACAAAGCCTTCGGGACTTAGCCCGGAATTAGTATCTCGCA Done.