Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Scaffold1316

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 40060
ACGTcount: A:0.32, C:0.19, G:0.19, T:0.31


Found at i:1375 original size:1 final size:1

Alignment explanation

Indices: 1369--1402 Score: 50 Period size: 1 Copynumber: 34.0 Consensus size: 1 1359 AAACCCAAAG ** 1369 AAAAAAAAAAAAAAAAAAAAAAACCAAAAAAAAA 1 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1403 CAAATGATAT Statistics Matches: 31, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 1 31 1.00 ACGTcount: A:0.94, C:0.06, G:0.00, T:0.00 Consensus pattern (1 bp): A Found at i:1403 original size:11 final size:11 Alignment explanation

Indices: 1371--1406 Score: 54 Period size: 11 Copynumber: 3.3 Consensus size: 11 1361 ACCCAAAGAA * 1371 AAAAAAAAAAA 1 AAAAAAAAAAC 1382 AAAAAAAAAAC 1 AAAAAAAAAAC * 1393 CAAAAAAAAAC 1 AAAAAAAAAAC 1404 AAA 1 AAA 1407 TGATATTTAC Statistics Matches: 22, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 11 22 1.00 ACGTcount: A:0.92, C:0.08, G:0.00, T:0.00 Consensus pattern (11 bp): AAAAAAAAAAC Found at i:1406 original size:10 final size:10 Alignment explanation

Indices: 1364--1406 Score: 54 Period size: 10 Copynumber: 4.4 Consensus size: 10 1354 GGAGAAAACC 1364 CAAAGAAAAAA 1 CAAA-AAAAAA 1375 -AAAAAAAAA 1 CAAAAAAAAA * 1384 -AAAAAAAAC 1 CAAAAAAAAA 1393 CAAAAAAAAA 1 CAAAAAAAAA 1403 CAAA 1 CAAA 1407 TGATATTTAC Statistics Matches: 29, Mismatches: 2, Indels: 3 0.85 0.06 0.09 Matches are distributed among these distances: 9 14 0.48 10 15 0.52 ACGTcount: A:0.88, C:0.09, G:0.02, T:0.00 Consensus pattern (10 bp): CAAAAAAAAA Found at i:9798 original size:14 final size:14 Alignment explanation

Indices: 9776--9823 Score: 69 Period size: 14 Copynumber: 3.4 Consensus size: 14 9766 ACCTAAACCG * * 9776 ACCAATTTGTTCAT 1 ACCATTTTGTTCCT 9790 ACCATTTTGTTCCT 1 ACCATTTTGTTCCT * 9804 ACCCTTTTGTTCCT 1 ACCATTTTGTTCCT 9818 ACCATT 1 ACCATT 9824 CCATTCGTAC Statistics Matches: 30, Mismatches: 4, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 14 30 1.00 ACGTcount: A:0.19, C:0.29, G:0.06, T:0.46 Consensus pattern (14 bp): ACCATTTTGTTCCT Found at i:11494 original size:21 final size:20 Alignment explanation

Indices: 11460--11527 Score: 75 Period size: 21 Copynumber: 3.2 Consensus size: 20 11450 ACATTCTTGT 11460 AAAGAGAAAA-CAAAGAAAAGA 1 AAAGA-AAAAGCAAA-AAAAGA * 11481 AAAGAAAAAGCAAAAGAAGAA 1 AAAGAAAAAGCAAAAAAAG-A * 11502 AAAGAAAAAGAAATAAAAAGA 1 AAAGAAAAAGCAA-AAAAAGA 11523 AAAGA 1 AAAGA 11528 GAGGCAAGAG Statistics Matches: 41, Mismatches: 3, Indels: 6 0.82 0.06 0.12 Matches are distributed among these distances: 20 8 0.20 21 28 0.68 22 5 0.12 ACGTcount: A:0.78, C:0.03, G:0.18, T:0.01 Consensus pattern (20 bp): AAAGAAAAAGCAAAAAAAGA Found at i:11495 original size:6 final size:5 Alignment explanation

Indices: 11471--11527 Score: 55 Period size: 5 Copynumber: 11.0 Consensus size: 5 11461 AAGAGAAAAC * 11471 AAAGA AAAGA AAAGAA AAAGCA AAAGA AGA-A AAAGAA AAAGA AATA-A 1 AAAGA AAAGA AAAG-A AAAG-A AAAGA AAAGA AAAG-A AAAGA AA-AGA 11518 AAAGA AAAGA 1 AAAGA AAAGA 11528 GAGGCAAGAG Statistics Matches: 44, Mismatches: 3, Indels: 10 0.77 0.05 0.18 Matches are distributed among these distances: 4 4 0.09 5 24 0.55 6 16 0.36 ACGTcount: A:0.79, C:0.02, G:0.18, T:0.02 Consensus pattern (5 bp): AAAGA Found at i:11520 original size:25 final size:25 Alignment explanation

Indices: 11475--11523 Score: 64 Period size: 27 Copynumber: 1.9 Consensus size: 25 11465 GAAAACAAAG 11475 AAAAGAAAAGAAAAAGCAAAAGAAGA 1 AAAAGAAAAGAAAAA-CAAAAGAAGA * 11501 AAAAGAAAAAGAAATA-AAAAGAA 1 AAAAG-AAAAGAAAAACAAAAGAA 11524 AAGAGAGGCA Statistics Matches: 21, Mismatches: 1, Indels: 3 0.84 0.04 0.12 Matches are distributed among these distances: 25 7 0.33 26 5 0.24 27 9 0.43 ACGTcount: A:0.80, C:0.02, G:0.16, T:0.02 Consensus pattern (25 bp): AAAAGAAAAGAAAAACAAAAGAAGA Found at i:12148 original size:24 final size:24 Alignment explanation

Indices: 12116--12161 Score: 92 Period size: 24 Copynumber: 1.9 Consensus size: 24 12106 CTGATGTTTT 12116 TGATGTGAGCTTGCCTACGAACAA 1 TGATGTGAGCTTGCCTACGAACAA 12140 TGATGTGAGCTTGCCTACGAAC 1 TGATGTGAGCTTGCCTACGAAC 12162 CAAATGGAGA Statistics Matches: 22, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 24 22 1.00 ACGTcount: A:0.26, C:0.22, G:0.26, T:0.26 Consensus pattern (24 bp): TGATGTGAGCTTGCCTACGAACAA Found at i:13667 original size:12 final size:12 Alignment explanation

Indices: 13652--13681 Score: 51 Period size: 12 Copynumber: 2.5 Consensus size: 12 13642 ATTGTAGATT 13652 AAAAAAATCGAA 1 AAAAAAATCGAA * 13664 AAAAAAATTGAA 1 AAAAAAATCGAA 13676 AAAAAA 1 AAAAAA 13682 TTAAAAAAAA Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 12 17 1.00 ACGTcount: A:0.80, C:0.03, G:0.07, T:0.10 Consensus pattern (12 bp): AAAAAAATCGAA Found at i:13689 original size:12 final size:11 Alignment explanation

Indices: 13652--13691 Score: 53 Period size: 11 Copynumber: 3.5 Consensus size: 11 13642 ATTGTAGATT * 13652 AAAAAAATCGAA 1 AAAAAAAT-TAA * 13664 AAAAAAATTGA 1 AAAAAAATTAA 13675 AAAAAAATTAA 1 AAAAAAATTAA 13686 AAAAAA 1 AAAAAA 13692 CAAATTTTTT Statistics Matches: 25, Mismatches: 3, Indels: 1 0.86 0.10 0.03 Matches are distributed among these distances: 11 17 0.68 12 8 0.32 ACGTcount: A:0.80, C:0.03, G:0.05, T:0.12 Consensus pattern (11 bp): AAAAAAATTAA Found at i:14830 original size:20 final size:20 Alignment explanation

Indices: 14807--14860 Score: 63 Period size: 20 Copynumber: 2.7 Consensus size: 20 14797 AGTTTTTCCC * 14807 AGCTCGATTTAGCTCACATG 1 AGCTCAATTTAGCTCACATG * *** 14827 AGCTTAATTTAGCTCGTTTG 1 AGCTCAATTTAGCTCACATG 14847 AGCTCAATTTAGCT 1 AGCTCAATTTAGCT 14861 TACTTTAGCT Statistics Matches: 28, Mismatches: 6, Indels: 0 0.82 0.18 0.00 Matches are distributed among these distances: 20 28 1.00 ACGTcount: A:0.24, C:0.20, G:0.19, T:0.37 Consensus pattern (20 bp): AGCTCAATTTAGCTCACATG Found at i:14842 original size:30 final size:30 Alignment explanation

Indices: 14807--14880 Score: 98 Period size: 30 Copynumber: 2.5 Consensus size: 30 14797 AGTTTTTCCC 14807 AGCTCGATTT-AGCTCACA-TGAGCTTAATTT 1 AGCTCG-TTTGAGCTCA-ATTGAGCTTAATTT * * 14837 AGCTCGTTTGAGCTCAATTTAGCTTACTTT 1 AGCTCGTTTGAGCTCAATTGAGCTTAATTT 14867 AGCTCGTTTGAGCT 1 AGCTCGTTTGAGCT 14881 TGGCTTAAGT Statistics Matches: 40, Mismatches: 2, Indels: 4 0.87 0.04 0.09 Matches are distributed among these distances: 29 4 0.10 30 36 0.90 ACGTcount: A:0.22, C:0.20, G:0.19, T:0.39 Consensus pattern (30 bp): AGCTCGTTTGAGCTCAATTGAGCTTAATTT Found at i:14870 original size:20 final size:20 Alignment explanation

Indices: 14807--14871 Score: 53 Period size: 20 Copynumber: 3.2 Consensus size: 20 14797 AGTTTTTCCC * * * * 14807 AGCTCGATTTAGCTCACATG 1 AGCTCAATTTAGCTTACTTT * 14827 AGCTTAATTTAGC-T-CGTTT 1 AGCTCAATTTAGCTTAC-TTT 14846 GAGCTCAATTTAGCTTACTTT 1 -AGCTCAATTTAGCTTACTTT 14867 AGCTC 1 AGCTC 14872 GTTTGAGCTT Statistics Matches: 35, Mismatches: 6, Indels: 8 0.71 0.12 0.16 Matches are distributed among these distances: 18 1 0.03 19 1 0.03 20 28 0.80 21 4 0.11 22 1 0.03 ACGTcount: A:0.23, C:0.22, G:0.17, T:0.38 Consensus pattern (20 bp): AGCTCAATTTAGCTTACTTT Found at i:17626 original size:17 final size:18 Alignment explanation

Indices: 17587--17628 Score: 52 Period size: 17 Copynumber: 2.4 Consensus size: 18 17577 AAGAAGAAAA 17587 ACAAAA-AGATGAGTGAT 1 ACAAAAGAGATGAGTGAT * 17604 AAAAAAGAGA-GAGTGAT 1 ACAAAAGAGATGAGTGAT * 17621 TCAAAAGA 1 ACAAAAGA 17629 AAAAGAAATG Statistics Matches: 21, Mismatches: 3, Indels: 2 0.81 0.12 0.08 Matches are distributed among these distances: 17 18 0.86 18 3 0.14 ACGTcount: A:0.57, C:0.05, G:0.24, T:0.14 Consensus pattern (18 bp): ACAAAAGAGATGAGTGAT Found at i:20436 original size:24 final size:24 Alignment explanation

Indices: 20404--20451 Score: 96 Period size: 24 Copynumber: 2.0 Consensus size: 24 20394 AGAGGAGGGG 20404 AATGATGTGAGCTTGCCTACGAAC 1 AATGATGTGAGCTTGCCTACGAAC 20428 AATGATGTGAGCTTGCCTACGAAC 1 AATGATGTGAGCTTGCCTACGAAC 20452 CAAATGGAGA Statistics Matches: 24, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 24 24 1.00 ACGTcount: A:0.29, C:0.21, G:0.25, T:0.25 Consensus pattern (24 bp): AATGATGTGAGCTTGCCTACGAAC Found at i:23583 original size:30 final size:29 Alignment explanation

Indices: 23547--23645 Score: 92 Period size: 30 Copynumber: 3.3 Consensus size: 29 23537 CGAGCTCACT 23547 CCTAGCTCATA-TTCAGCTCACGAGCTAAA 1 CCTAGCTCA-ACTTCAGCTCACGAGCTAAA * * * * * 23576 CCATAGCTCAACTTCAGCTTAGGAGTTTAG 1 CC-TAGCTCAACTTCAGCTCACGAGCTAAA * 23606 CCTCAGCTCAACTTTAGCTCACGAGCTAAA 1 CCT-AGCTCAACTTCAGCTCACGAGCTAAA * 23636 GCTTAGCTCA 1 -CCTAGCTCA 23646 TTTTAGTTTA Statistics Matches: 54, Mismatches: 12, Indels: 7 0.74 0.16 0.10 Matches are distributed among these distances: 29 4 0.07 30 48 0.89 31 2 0.04 ACGTcount: A:0.28, C:0.29, G:0.16, T:0.26 Consensus pattern (29 bp): CCTAGCTCAACTTCAGCTCACGAGCTAAA Found at i:25388 original size:30 final size:30 Alignment explanation

Indices: 25354--25438 Score: 80 Period size: 30 Copynumber: 2.8 Consensus size: 30 25344 TAAACTAAAA * 25354 TGAGCTAAGCTTTAGCTCGTGAGCTAAAGT 1 TGAGCTAAGATTTAGCTCGTGAGCTAAAGT * * * * * * * 25384 TGAGCTGAGATTAAACTCCTAAGTTGAAGT 1 TGAGCTAAGATTTAGCTCGTGAGCTAAAGT * * 25414 TGAGCTAAGGTTTAGCTCGCGAGCT 1 TGAGCTAAGATTTAGCTCGTGAGCT 25439 GAATATGAGC Statistics Matches: 39, Mismatches: 16, Indels: 0 0.71 0.29 0.00 Matches are distributed among these distances: 30 39 1.00 ACGTcount: A:0.27, C:0.16, G:0.27, T:0.29 Consensus pattern (30 bp): TGAGCTAAGATTTAGCTCGTGAGCTAAAGT Found at i:26462 original size:30 final size:30 Alignment explanation

Indices: 26428--26524 Score: 106 Period size: 30 Copynumber: 3.2 Consensus size: 30 26418 AGCTCACTCC 26428 TAGCTCATA-TTTAGCTCACGAGCTAAACCT 1 TAGCTCA-ACTTTAGCTCACGAGCTAAACCT * * * * * * 26458 TAGCTCAACTTCAGCTTAGGAGTTTAGCCT 1 TAGCTCAACTTTAGCTCACGAGCTAAACCT * * 26488 CAGCTCAACTTTAGCTCACGAGCTAAAGCT 1 TAGCTCAACTTTAGCTCACGAGCTAAACCT 26518 TAGCTCA 1 TAGCTCA 26525 TTTTAGTTTA Statistics Matches: 51, Mismatches: 15, Indels: 2 0.75 0.22 0.03 Matches are distributed among these distances: 29 1 0.02 30 50 0.98 ACGTcount: A:0.28, C:0.27, G:0.16, T:0.29 Consensus pattern (30 bp): TAGCTCAACTTTAGCTCACGAGCTAAACCT Found at i:28234 original size:9 final size:10 Alignment explanation

Indices: 28219--28258 Score: 53 Period size: 10 Copynumber: 4.0 Consensus size: 10 28209 AAGAAATTCG * 28219 AAAAAAAATT 1 AAAAAAAATC 28229 AAAAAAAATC 1 AAAAAAAATC ** 28239 AAAAAAAAAA 1 AAAAAAAATC 28249 AAAAAAAATC 1 AAAAAAAATC 28259 GAAGTAAAAA Statistics Matches: 25, Mismatches: 5, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 10 25 1.00 ACGTcount: A:0.85, C:0.05, G:0.00, T:0.10 Consensus pattern (10 bp): AAAAAAAATC Found at i:28234 original size:10 final size:10 Alignment explanation

Indices: 28219--28257 Score: 53 Period size: 9 Copynumber: 3.9 Consensus size: 10 28209 AAGAAATTCG * 28219 AAAAAAAATT 1 AAAAAAAATA 28229 AAAAAAAATCA 1 AAAAAAAAT-A 28240 AAAAAAAA-A 1 AAAAAAAATA 28249 AAAAAAAAT 1 AAAAAAAAT 28258 CGAAGTAAAA Statistics Matches: 26, Mismatches: 1, Indels: 4 0.84 0.03 0.13 Matches are distributed among these distances: 9 9 0.35 10 9 0.35 11 8 0.31 ACGTcount: A:0.87, C:0.03, G:0.00, T:0.10 Consensus pattern (10 bp): AAAAAAAATA Found at i:28242 original size:20 final size:20 Alignment explanation

Indices: 28219--28275 Score: 69 Period size: 20 Copynumber: 2.7 Consensus size: 20 28209 AAGAAATTCG ** 28219 AAAAAAAATTAAAAAAAATC 1 AAAAAAAAAAAAAAAAAATC 28239 AAAAAAAAAAAAAAAAAATC 1 AAAAAAAAAAAAAAAAAATC 28259 GAAGTAAAAAAAAAAAA 1 -AA--AAAAAAAAAAAA 28276 GTGAAAAGTC Statistics Matches: 32, Mismatches: 2, Indels: 3 0.86 0.05 0.08 Matches are distributed among these distances: 20 18 0.56 21 2 0.06 23 12 0.38 ACGTcount: A:0.84, C:0.04, G:0.04, T:0.09 Consensus pattern (20 bp): AAAAAAAAAAAAAAAAAATC Found at i:29205 original size:37 final size:37 Alignment explanation

Indices: 29154--29224 Score: 101 Period size: 37 Copynumber: 1.9 Consensus size: 37 29144 CATTCTTGTA 29154 AAGAGAAAACAAAGAAAA-GAAAAGAAAAAGAAAAAGC 1 AAGAGAAAACAAAGAAAATG-AAAGAAAAAGAAAAAGC * 29191 AAGAGAAGAA-AAAGAAAATGAAATAAAAAGAAAA 1 AAGAGAA-AACAAAGAAAATGAAAGAAAAAGAAAA 29225 GAGAGGCAAG Statistics Matches: 31, Mismatches: 1, Indels: 4 0.86 0.03 0.11 Matches are distributed among these distances: 37 28 0.90 38 3 0.10 ACGTcount: A:0.76, C:0.03, G:0.18, T:0.03 Consensus pattern (37 bp): AAGAGAAAACAAAGAAAATGAAAGAAAAAGAAAAAGC Found at i:29224 original size:6 final size:6 Alignment explanation

Indices: 29164--29213 Score: 50 Period size: 6 Copynumber: 8.2 Consensus size: 6 29154 AAGAGAAAAC * 29164 AAAG-A AAAG-A AAAGAA AAAGAA AAAGCAA GAGAAGAA AAAGAA AATGAA 1 AAAGAA AAAGAA AAAGAA AAAGAA AAAG-AA -A-AAGAA AAAGAA AAAGAA 29213 A 1 A 29214 TAAAAAGAAA Statistics Matches: 40, Mismatches: 1, Indels: 7 0.83 0.02 0.15 Matches are distributed among these distances: 5 9 0.22 6 22 0.55 7 3 0.08 8 3 0.08 9 3 0.08 ACGTcount: A:0.76, C:0.02, G:0.20, T:0.02 Consensus pattern (6 bp): AAAGAA Found at i:31470 original size:30 final size:30 Alignment explanation

Indices: 31436--31532 Score: 106 Period size: 30 Copynumber: 3.2 Consensus size: 30 31426 AGCTCACTCC 31436 TAGCTCATA-TTTAGCTCACGAGCTAAACCT 1 TAGCTCA-ACTTTAGCTCACGAGCTAAACCT * * * * * * 31466 TAGCTCAACTTCAGCTTAGGAGTTTAGCCT 1 TAGCTCAACTTTAGCTCACGAGCTAAACCT * * 31496 CAGCTCAACTTTAGCTCACGAGCTAAAGCT 1 TAGCTCAACTTTAGCTCACGAGCTAAACCT 31526 TAGCTCA 1 TAGCTCA 31533 TTTTAGTTTA Statistics Matches: 51, Mismatches: 15, Indels: 2 0.75 0.22 0.03 Matches are distributed among these distances: 29 1 0.02 30 50 0.98 ACGTcount: A:0.28, C:0.27, G:0.16, T:0.29 Consensus pattern (30 bp): TAGCTCAACTTTAGCTCACGAGCTAAACCT Found at i:35335 original size:30 final size:30 Alignment explanation

Indices: 35301--35397 Score: 90 Period size: 30 Copynumber: 3.2 Consensus size: 30 35291 TAAACTAAAA 35301 TGAGCTAAGCTTTAGCTCGTGAGCTAAAGT 1 TGAGCTAAGCTTTAGCTCGTGAGCTAAAGT * * * * * * 35331 TGAGCTGAGGC-TAAACTCCTAAGCTGAAGT 1 TGAGCT-AAGCTTTAGCTCGTGAGCTAAAGT * * 35361 TGAGCTAAGGTTTAGCTCGTGAGTTGAAAG- 1 TGAGCTAAGCTTTAGCTCGTGAGCT-AAAGT 35391 TGAGCTA 1 TGAGCTA 35398 GGAGTGAGCT Statistics Matches: 50, Mismatches: 14, Indels: 6 0.71 0.20 0.09 Matches are distributed among these distances: 29 2 0.04 30 42 0.84 31 6 0.12 ACGTcount: A:0.28, C:0.15, G:0.29, T:0.28 Consensus pattern (30 bp): TGAGCTAAGCTTTAGCTCGTGAGCTAAAGT Found at i:38282 original size:13 final size:13 Alignment explanation

Indices: 38264--38288 Score: 50 Period size: 13 Copynumber: 1.9 Consensus size: 13 38254 AACCTACAAA 38264 AATTAAATAACAT 1 AATTAAATAACAT 38277 AATTAAATAACA 1 AATTAAATAACA 38289 CACAACTATT Statistics Matches: 12, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 12 1.00 ACGTcount: A:0.64, C:0.08, G:0.00, T:0.28 Consensus pattern (13 bp): AATTAAATAACAT Found at i:38534 original size:21 final size:21 Alignment explanation

Indices: 38510--38549 Score: 71 Period size: 21 Copynumber: 1.9 Consensus size: 21 38500 GAAAAAGGAA * 38510 TGAGCTGAAACGAGCTAAATT 1 TGAGCTCAAACGAGCTAAATT 38531 TGAGCTCAAACGAGCTAAA 1 TGAGCTCAAACGAGCTAAA 38550 ACGAGCTCAA Statistics Matches: 18, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 21 18 1.00 ACGTcount: A:0.40, C:0.17, G:0.23, T:0.20 Consensus pattern (21 bp): TGAGCTCAAACGAGCTAAATT Found at i:40028 original size:14 final size:14 Alignment explanation

Indices: 39986--40030 Score: 56 Period size: 14 Copynumber: 3.3 Consensus size: 14 39976 TTAAAGAAGC 39986 AACTCATTAAATTA 1 AACTCATTAAATTA * * 40000 AATTCATCAAA-TA 1 AACTCATTAAATTA * 40013 AACTCATTTAATTA 1 AACTCATTAAATTA 40027 AACT 1 AACT 40031 AAGATGAGTT Statistics Matches: 25, Mismatches: 5, Indels: 2 0.78 0.16 0.06 Matches are distributed among these distances: 13 10 0.40 14 15 0.60 ACGTcount: A:0.49, C:0.16, G:0.00, T:0.36 Consensus pattern (14 bp): AACTCATTAAATTA Done.