Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Scaffold2071

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 45480
ACGTcount: A:0.31, C:0.18, G:0.17, T:0.34


Found at i:243 original size:13 final size:13

Alignment explanation

Indices: 225--250 Score: 52 Period size: 13 Copynumber: 2.0 Consensus size: 13 215 CAATTTTTTG 225 TGTATCGATACAT 1 TGTATCGATACAT 238 TGTATCGATACAT 1 TGTATCGATACAT 251 ACTTGCTGTA Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 13 1.00 ACGTcount: A:0.31, C:0.15, G:0.15, T:0.38 Consensus pattern (13 bp): TGTATCGATACAT Found at i:1980 original size:29 final size:28 Alignment explanation

Indices: 1920--1988 Score: 70 Period size: 29 Copynumber: 2.5 Consensus size: 28 1910 TTATTCACGA * 1920 GGATACT-CTAATCCCATTCTTTCGAAG 1 GGATACTCCTAACCCCATTCTTTCGAAG * * * 1947 AGATACTCCTAACCCTCATTCTTTGGAGG 1 GGATACTCCTAACCC-CATTCTTTCGAAG * 1976 GGATATTCC-AACC 1 GGATACTCCTAACC 1989 TCGTTTTTAA Statistics Matches: 34, Mismatches: 6, Indels: 3 0.79 0.14 0.07 Matches are distributed among these distances: 27 6 0.18 28 10 0.29 29 18 0.53 ACGTcount: A:0.26, C:0.28, G:0.16, T:0.30 Consensus pattern (28 bp): GGATACTCCTAACCCCATTCTTTCGAAG Found at i:5403 original size:8 final size:8 Alignment explanation

Indices: 5382--5429 Score: 51 Period size: 9 Copynumber: 5.4 Consensus size: 8 5372 GATAAGGGAG 5382 AAAGAAAA 1 AAAGAAAA 5390 ATAAGAAAA 1 A-AAGAAAA 5399 AAAGAAGAAA 1 AAAG-A-AAA 5409 AGAAGAAAA 1 A-AAGAAAA 5418 AATAGAAAA 1 AA-AGAAAA 5427 AAA 1 AAA 5430 AACTAAGCTA Statistics Matches: 35, Mismatches: 0, Indels: 10 0.78 0.00 0.22 Matches are distributed among these distances: 8 6 0.17 9 21 0.60 10 5 0.14 11 3 0.09 ACGTcount: A:0.81, C:0.00, G:0.15, T:0.04 Consensus pattern (8 bp): AAAGAAAA Found at i:5414 original size:19 final size:19 Alignment explanation

Indices: 5386--5428 Score: 61 Period size: 19 Copynumber: 2.3 Consensus size: 19 5376 AGGGAGAAAG * 5386 AAAAATAAGAAAAAA-AGA 1 AAAAAGAAGAAAAAATAGA 5404 AGAAAAGAAGAAAAAATAGA 1 A-AAAAGAAGAAAAAATAGA 5424 AAAAA 1 AAAAA 5429 AAACTAAGCT Statistics Matches: 22, Mismatches: 1, Indels: 3 0.85 0.04 0.12 Matches are distributed among these distances: 18 1 0.05 19 17 0.77 20 4 0.18 ACGTcount: A:0.81, C:0.00, G:0.14, T:0.05 Consensus pattern (19 bp): AAAAAGAAGAAAAAATAGA Found at i:6580 original size:163 final size:162 Alignment explanation

Indices: 6301--6741 Score: 593 Period size: 163 Copynumber: 2.7 Consensus size: 162 6291 AGTAAAAATT ** * 6301 AAATATTAATCA-TTTTTTAATCTTTTATATTCCTATGGAGGTCGGTAGGGGTATAAAAGATTTC 1 AAATATTAATCATTTTTTTAATCTTTTATATTCCTATGGAAATCCGTAGGGGTATAAAAGATTTC * ** * * * * 6365 AAAAACTTTTGGTATCTTTTCAGATTTCCATTAAGGTATGAATAAT-ATTTTTCGAAAGTAAATA 66 AAAAACTTTTCGTATCTTTTTGGGTTTCCATTAAGGTA-CAA-AATGATTTTTCAAAAATAAATA * * * ** 6429 AATAAAATTAAATTAATACTTTCATGATTGAAAC 129 AACAAAATTAAATTAACACTTTCATAATCAAAAC * * 6463 AAATATTAATCATTTTTTTAATCTTTTATATTCCTATGGAAATCTGTAGAGGTATAAAAGATTTC 1 AAATATTAATCATTTTTTTAATCTTTTATATTCCTATGGAAATCCGTAGGGGTATAAAAGATTTC * * 6528 -AAAACTTTTTGTATCTTTTTGGGTTTTCATTAAGGTACAAAATGGCATTTTTCAAAAATAAATA 66 AAAAACTTTTCGTATCTTTTTGGGTTTCCATTAAGGTACAAAAT-G-ATTTTTCAAAAATAAATA * * 6592 AACAAAATTAAATTGACACTTTCATAATCAAAAT 129 AACAAAATTAAATTAACACTTTCATAATCAAAAC ** 6626 AAATATTAATCATTTTTTT-ATCTTTTATATTTTTATGGAAATCCGTAGGGGTATAAAAGATTTC 1 AAATATTAATCATTTTTTTAATCTTTTATATTCCTATGGAAATCCGTAGGGGTATAAAAGATTTC * * 6690 AAAAACTTTTCGTATCTTTTTGGGTTTCCATTAAGGTACGAAATGATATTTC 66 AAAAACTTTTCGTATCTTTTTGGGTTTCCATTAAGGTACAAAATGATTTTTC 6742 GAGAAAGAGA Statistics Matches: 247, Mismatches: 27, Indels: 11 0.87 0.09 0.04 Matches are distributed among these distances: 160 3 0.01 161 8 0.03 162 86 0.35 163 150 0.61 ACGTcount: A:0.37, C:0.10, G:0.12, T:0.41 Consensus pattern (162 bp): AAATATTAATCATTTTTTTAATCTTTTATATTCCTATGGAAATCCGTAGGGGTATAAAAGATTTC AAAAACTTTTCGTATCTTTTTGGGTTTCCATTAAGGTACAAAATGATTTTTCAAAAATAAATAAA CAAAATTAAATTAACACTTTCATAATCAAAAC Found at i:9234 original size:50 final size:50 Alignment explanation

Indices: 9154--9287 Score: 187 Period size: 50 Copynumber: 2.7 Consensus size: 50 9144 TTCACAATAT * * 9154 GTATCGATACATTATTCATTGTATCGATACATTCTGGGTTTTACCCAGAC 1 GTATCGATACATTTTTCATTGTATCAATACATTCTGGGTTTTACCCAGAC * * * 9204 GTATCGATACATTTTTTATTGTATCAATACATTCTGGGTTTTACTCAGAT 1 GTATCGATACATTTTTCATTGTATCAATACATTCTGGGTTTTACCCAGAC * * ** 9254 GTATCGATACATTTCTCAATGTATCGGTACATTC 1 GTATCGATACATTTTTCATTGTATCAATACATTC 9288 AGGCATTTTT Statistics Matches: 74, Mismatches: 10, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 50 74 1.00 ACGTcount: A:0.26, C:0.18, G:0.15, T:0.41 Consensus pattern (50 bp): GTATCGATACATTTTTCATTGTATCAATACATTCTGGGTTTTACCCAGAC Found at i:9277 original size:20 final size:21 Alignment explanation

Indices: 9247--9286 Score: 64 Period size: 20 Copynumber: 2.0 Consensus size: 21 9237 CTGGGTTTTA 9247 CTCAGATGTATCGATACATTT 1 CTCAGATGTATCGATACATTT * 9268 CTCA-ATGTATCGGTACATT 1 CTCAGATGTATCGATACATT 9287 CAGGCATTTT Statistics Matches: 18, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 20 14 0.78 21 4 0.22 ACGTcount: A:0.28, C:0.20, G:0.15, T:0.38 Consensus pattern (21 bp): CTCAGATGTATCGATACATTT Found at i:11625 original size:13 final size:13 Alignment explanation

Indices: 11607--11632 Score: 52 Period size: 13 Copynumber: 2.0 Consensus size: 13 11597 CAATTTTTTG 11607 TGTATCGATACAT 1 TGTATCGATACAT 11620 TGTATCGATACAT 1 TGTATCGATACAT 11633 ACTTTGTGTA Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 13 1.00 ACGTcount: A:0.31, C:0.15, G:0.15, T:0.38 Consensus pattern (13 bp): TGTATCGATACAT Found at i:11644 original size:32 final size:33 Alignment explanation

Indices: 11587--11650 Score: 103 Period size: 32 Copynumber: 2.0 Consensus size: 33 11577 TACAAGCCAA ** 11587 TGTATCGATACAATTTTTTGTGTATCGATACAT 1 TGTATCGATACAATACTTTGTGTATCGATACAT 11620 TGTATCGATAC-ATACTTTGTGTATCGATACA 1 TGTATCGATACAATACTTTGTGTATCGATACA 11651 AGTTTGGCTA Statistics Matches: 29, Mismatches: 2, Indels: 1 0.91 0.06 0.03 Matches are distributed among these distances: 32 18 0.62 33 11 0.38 ACGTcount: A:0.28, C:0.14, G:0.16, T:0.42 Consensus pattern (33 bp): TGTATCGATACAATACTTTGTGTATCGATACAT Found at i:16294 original size:15 final size:15 Alignment explanation

Indices: 16274--16304 Score: 53 Period size: 15 Copynumber: 2.1 Consensus size: 15 16264 ATTTCGTTTG * 16274 GAGCTTCTTCATTTT 1 GAGCTTCCTCATTTT 16289 GAGCTTCCTCATTTT 1 GAGCTTCCTCATTTT 16304 G 1 G 16305 GACATTTTTA Statistics Matches: 15, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 15 15 1.00 ACGTcount: A:0.13, C:0.23, G:0.16, T:0.48 Consensus pattern (15 bp): GAGCTTCCTCATTTT Found at i:19242 original size:157 final size:158 Alignment explanation

Indices: 19041--19333 Score: 387 Period size: 157 Copynumber: 1.9 Consensus size: 158 19031 TTAAACCCCA * 19041 TTATTTCTGAGGGGATACTCCAACCCCAGCTTTATTTCTAAAACATCGATTTTTCACAATCGAG- 1 TTATTTCCGAGGGGATACTCCAACCCCAGCTTTATTTCTAAAACATCGATTTTTCACAATCG-GT * * * * * 19105 GATACTCCAACCCCATTATTTTTGAGGGGATACTCCAACTCTGGCTTTATTTCTAAAACATTAAT 65 GATACTCCAACCCCATCATTTCTGAGGGGATACTCCAACCCTGGCTTTATCTCCAAAACATTAAT 19170 TTTTCATAATCAGGGATACTTCAGCCTTG 130 TTTTCATAATCAGGGATACTTCAGCCTTG * * * 19199 TTATTTCCGA-GGGATACTCTAACCCC-GATTTTATTTCCT-AAACATCGATTTTTCACCATCGG 1 TTATTTCCGAGGGGATACTCCAACCCCAG-CTTTATTT-CTAAAACATCGATTTTTCACAATCGG * * * * ** * 19261 TGATACTTCAATCTCGTCATTTCTGAGGGGATACTCCAACCCTGTTTTTATCTCCAAAATATTAA 64 TGATACTCCAACCCCATCATTTCTGAGGGGATACTCCAACCCTGGCTTTATCTCCAAAACATTAA 19326 TTTTTCAT 129 TTTTTCAT 19334 CGGGGAATGT Statistics Matches: 116, Mismatches: 16, Indels: 7 0.83 0.12 0.05 Matches are distributed among these distances: 156 2 0.02 157 103 0.89 158 11 0.09 ACGTcount: A:0.27, C:0.23, G:0.13, T:0.37 Consensus pattern (158 bp): TTATTTCCGAGGGGATACTCCAACCCCAGCTTTATTTCTAAAACATCGATTTTTCACAATCGGTG ATACTCCAACCCCATCATTTCTGAGGGGATACTCCAACCCTGGCTTTATCTCCAAAACATTAATT TTTCATAATCAGGGATACTTCAGCCTTG Found at i:19253 original size:78 final size:78 Alignment explanation

Indices: 19026--19302 Score: 299 Period size: 79 Copynumber: 3.5 Consensus size: 78 19016 CAAAATTGGA * * * 19026 GATACTTAAACCCCATTATTTCTGAGGGGATACTCCAACCCC-AGCTTTATTTCTAAAACATCGA 1 GATACTTCAACCTCGTTATTTCTGAGGGGATACTCCAACCCCGA-CTTTATTTCTAAAACATCGA 19090 TTTTTCACAATCGAG 65 TTTTTCACAATCG-G * * * * * * * ** 19105 GATACTCCAACCCCATTATTTTTGAGGGGATACTCCAACTCTGGCTTTATTTCTAAAACATTAAT 1 GATACTTCAACCTCGTTATTTCTGAGGGGATACTCCAACCCCGACTTTATTTCTAAAACATCGAT * 19170 TTTTCATAATCAGG 66 TTTTCACAATC-GG * * * * * 19184 GATACTTCAGCCTTGTTATTTCCGA-GGGATACTCTAACCCCGATTTTATTTCCT-AAACATCGA 1 GATACTTCAACCTCGTTATTTCTGAGGGGATACTCCAACCCCGACTTTATTT-CTAAAACATCGA * 19247 TTTTTCACCATCGG 65 TTTTTCACAATCGG * * 19261 TGATACTTCAATCTCGTCATTTCTGAGGGGATACTCCAACCC 1 -GATACTTCAACCTCGTTATTTCTGAGGGGATACTCCAACCC 19303 TGTTTTTATC Statistics Matches: 162, Mismatches: 31, Indels: 10 0.80 0.15 0.05 Matches are distributed among these distances: 77 2 0.01 78 58 0.36 79 101 0.62 80 1 0.01 ACGTcount: A:0.27, C:0.25, G:0.14, T:0.34 Consensus pattern (78 bp): GATACTTCAACCTCGTTATTTCTGAGGGGATACTCCAACCCCGACTTTATTTCTAAAACATCGAT TTTTCACAATCGG Found at i:21390 original size:12 final size:12 Alignment explanation

Indices: 21372--21403 Score: 55 Period size: 12 Copynumber: 2.7 Consensus size: 12 21362 AAGGTGGTGT 21372 ATTTATTTATTC 1 ATTTATTTATTC * 21384 TTTTATTTATTC 1 ATTTATTTATTC 21396 ATTTATTT 1 ATTTATTT 21404 TGTTTTGTTT Statistics Matches: 18, Mismatches: 2, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 12 18 1.00 ACGTcount: A:0.22, C:0.06, G:0.00, T:0.72 Consensus pattern (12 bp): ATTTATTTATTC Found at i:23894 original size:32 final size:30 Alignment explanation

Indices: 23858--23916 Score: 75 Period size: 32 Copynumber: 1.9 Consensus size: 30 23848 GAAAACAAAT 23858 ACAAAGAGCT-TAGAAAAATAATAACAATATGA 1 ACAAA-AGCTCTAGAAAAAT-ATAA-AATATGA * 23890 ACAAAAGCTCTTGAAAAATATAAAATA 1 ACAAAAGCTCTAGAAAAATATAAAATA 23917 CTTTGATCTA Statistics Matches: 25, Mismatches: 1, Indels: 4 0.83 0.03 0.13 Matches are distributed among these distances: 30 4 0.16 31 8 0.32 32 13 0.52 ACGTcount: A:0.59, C:0.10, G:0.10, T:0.20 Consensus pattern (30 bp): ACAAAAGCTCTAGAAAAATATAAAATATGA Found at i:24215 original size:13 final size:13 Alignment explanation

Indices: 24197--24222 Score: 52 Period size: 13 Copynumber: 2.0 Consensus size: 13 24187 AATTTTTTTG 24197 TGTATCGATACAT 1 TGTATCGATACAT 24210 TGTATCGATACAT 1 TGTATCGATACAT 24223 ACTTGGTGTA Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 13 1.00 ACGTcount: A:0.31, C:0.15, G:0.15, T:0.38 Consensus pattern (13 bp): TGTATCGATACAT Found at i:26801 original size:13 final size:13 Alignment explanation

Indices: 26783--26820 Score: 67 Period size: 13 Copynumber: 2.9 Consensus size: 13 26773 ATAATCCCCC 26783 TGTATCGATACAG 1 TGTATCGATACAG * 26796 TGTATCGATACAT 1 TGTATCGATACAG 26809 TGTATCGATACA 1 TGTATCGATACA 26821 AAGAAAAATG Statistics Matches: 24, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 13 24 1.00 ACGTcount: A:0.32, C:0.16, G:0.18, T:0.34 Consensus pattern (13 bp): TGTATCGATACAG Found at i:28430 original size:15 final size:15 Alignment explanation

Indices: 28394--28431 Score: 58 Period size: 15 Copynumber: 2.5 Consensus size: 15 28384 TTCATCAATT * * 28394 TCATTTGGAGCTTCT 1 TCATTTTGAGCTTCC 28409 TCATTTTGAGCTTCC 1 TCATTTTGAGCTTCC 28424 TCATTTTG 1 TCATTTTG 28432 GACATTTTTA Statistics Matches: 21, Mismatches: 2, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 15 21 1.00 ACGTcount: A:0.13, C:0.21, G:0.16, T:0.50 Consensus pattern (15 bp): TCATTTTGAGCTTCC Found at i:30262 original size:21 final size:21 Alignment explanation

Indices: 30233--30277 Score: 56 Period size: 21 Copynumber: 2.1 Consensus size: 21 30223 AAGTTTTTAT * 30233 TTTTCTTAGCTAAC-TCATTA 1 TTTTCTTAGCCAACTTCATTA * 30253 TTTTCATTAGCCAACTTCTTTA 1 TTTTC-TTAGCCAACTTCATTA 30275 TTT 1 TTT 30278 CAACTTGCAA Statistics Matches: 21, Mismatches: 2, Indels: 2 0.84 0.08 0.08 Matches are distributed among these distances: 20 5 0.24 21 8 0.38 22 8 0.38 ACGTcount: A:0.22, C:0.20, G:0.04, T:0.53 Consensus pattern (21 bp): TTTTCTTAGCCAACTTCATTA Found at i:32194 original size:32 final size:30 Alignment explanation

Indices: 32158--32216 Score: 75 Period size: 32 Copynumber: 1.9 Consensus size: 30 32148 GAAAACAAAT 32158 ACAAAGAGCT-TAGAAAAATAATAACAATATGA 1 ACAAA-AGCTCTAGAAAAAT-ATAA-AATATGA * 32190 ACAAAAGCTCTTGAAAAATATAAAATA 1 ACAAAAGCTCTAGAAAAATATAAAATA 32217 CTTTGATCTA Statistics Matches: 25, Mismatches: 1, Indels: 4 0.83 0.03 0.13 Matches are distributed among these distances: 30 4 0.16 31 8 0.32 32 13 0.52 ACGTcount: A:0.59, C:0.10, G:0.10, T:0.20 Consensus pattern (30 bp): ACAAAAGCTCTAGAAAAATATAAAATATGA Found at i:38185 original size:13 final size:13 Alignment explanation

Indices: 38167--38193 Score: 54 Period size: 13 Copynumber: 2.1 Consensus size: 13 38157 ACATTTGTTC 38167 ATGTATCGATACA 1 ATGTATCGATACA 38180 ATGTATCGATACA 1 ATGTATCGATACA 38193 A 1 A 38194 AGCATAATGT Statistics Matches: 14, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 14 1.00 ACGTcount: A:0.41, C:0.15, G:0.15, T:0.30 Consensus pattern (13 bp): ATGTATCGATACA Found at i:38274 original size:13 final size:13 Alignment explanation

Indices: 38256--38288 Score: 66 Period size: 13 Copynumber: 2.5 Consensus size: 13 38246 ATAATATTCA 38256 ATACAAAGTATCG 1 ATACAAAGTATCG 38269 ATACAAAGTATCG 1 ATACAAAGTATCG 38282 ATACAAA 1 ATACAAA 38289 TACCTATGTA Statistics Matches: 20, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 20 1.00 ACGTcount: A:0.52, C:0.15, G:0.12, T:0.21 Consensus pattern (13 bp): ATACAAAGTATCG Found at i:40833 original size:15 final size:13 Alignment explanation

Indices: 40797--40837 Score: 55 Period size: 13 Copynumber: 3.0 Consensus size: 13 40787 ACATTTTTCT 40797 TTGTATCGATACA 1 TTGTATCGATACA * 40810 TTCTATCGATACA 1 TTGTATCGATACA 40823 TAATGTATCGATACA 1 T--TGTATCGATACA 40838 GGGTAATTAT Statistics Matches: 24, Mismatches: 2, Indels: 2 0.86 0.07 0.07 Matches are distributed among these distances: 13 13 0.54 15 11 0.46 ACGTcount: A:0.34, C:0.17, G:0.12, T:0.37 Consensus pattern (13 bp): TTGTATCGATACA Found at i:41001 original size:20 final size:20 Alignment explanation

Indices: 40958--41003 Score: 58 Period size: 21 Copynumber: 2.2 Consensus size: 20 40948 AAATCTTTTG 40958 CAAAATACTTGTTTTTCACTT 1 CAAAATACTTGTTTTTCAC-T * 40979 CAAATTACTTCGTTTTTCA-T 1 CAAAATACTT-GTTTTTCACT 40999 CAAAA 1 CAAAA 41004 CCAGCATCAA Statistics Matches: 22, Mismatches: 2, Indels: 3 0.81 0.07 0.11 Matches are distributed among these distances: 20 5 0.23 21 9 0.41 22 8 0.36 ACGTcount: A:0.33, C:0.20, G:0.04, T:0.43 Consensus pattern (20 bp): CAAAATACTTGTTTTTCACT Found at i:43411 original size:13 final size:13 Alignment explanation

Indices: 43393--43418 Score: 52 Period size: 13 Copynumber: 2.0 Consensus size: 13 43383 TACAGCAAGT 43393 ATGTATCGATACA 1 ATGTATCGATACA 43406 ATGTATCGATACA 1 ATGTATCGATACA 43419 CAGAAAATTG Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 13 1.00 ACGTcount: A:0.38, C:0.15, G:0.15, T:0.31 Consensus pattern (13 bp): ATGTATCGATACA Found at i:45026 original size:33 final size:33 Alignment explanation

Indices: 44984--45052 Score: 111 Period size: 33 Copynumber: 2.1 Consensus size: 33 44974 AGTTGATCAA * * 44984 TTCACTTTCGCAATGCATGGATGAGCACTTTAG 1 TTCACTTTCGCAACGCATGGATGAGAACTTTAG * 45017 TTCACTTTCGCAGCGCATGGATGAGAACTTTAG 1 TTCACTTTCGCAACGCATGGATGAGAACTTTAG 45050 TTC 1 TTC 45053 TCTTAACCGA Statistics Matches: 33, Mismatches: 3, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 33 33 1.00 ACGTcount: A:0.23, C:0.22, G:0.22, T:0.33 Consensus pattern (33 bp): TTCACTTTCGCAACGCATGGATGAGAACTTTAG Done.