Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Scaffold930

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 41042
ACGTcount: A:0.32, C:0.20, G:0.17, T:0.32


Found at i:996 original size:34 final size:32

Alignment explanation

Indices: 958--1040 Score: 85 Period size: 34 Copynumber: 2.5 Consensus size: 32 948 AACTAAAGAG 958 CAAAGAAAAAAAAAAAAGAAGGCAAAAAACAAAA 1 CAAAGAAAAAAAAAAAA-AA-GCAAAAAACAAAA * * *** 992 CAAAGAAAAACAGGAGAAAAAGCAAAAAACACGC 1 CAAAGAAAAA-A-AAAAAAAAGCAAAAAACAAAA 1026 CAAAGAAAAAAAAAA 1 CAAAGAAAAAAAAAA 1041 CCAAACTCCA Statistics Matches: 40, Mismatches: 7, Indels: 6 0.75 0.13 0.11 Matches are distributed among these distances: 32 2 0.05 33 1 0.03 34 30 0.75 35 3 0.08 36 4 0.10 ACGTcount: A:0.75, C:0.12, G:0.13, T:0.00 Consensus pattern (32 bp): CAAAGAAAAAAAAAAAAAAGCAAAAAACAAAA Found at i:5557 original size:18 final size:19 Alignment explanation

Indices: 5515--5595 Score: 62 Period size: 18 Copynumber: 4.3 Consensus size: 19 5505 TCCATCTTCT * 5515 TCTCTCCTC-CTCCT-GTCA 1 TCTCTCCTCTCTCCTCCT-A * * 5533 TCCCTCCTCTCT-CTGCTA 1 TCTCTCCTCTCTCCTCCTA 5551 TCTCTCCTATCTCTCCTCCTA 1 TCTCTCC--TCTCTCCTCCTA * * 5572 TCTCTACTATCT-CTCCTA 1 TCTCTCCTCTCTCCTCCTA 5590 TCTCTC 1 TCTCTC 5596 TTTCTATCAT Statistics Matches: 51, Mismatches: 7, Indels: 10 0.75 0.10 0.15 Matches are distributed among these distances: 18 28 0.55 19 7 0.14 20 5 0.10 21 11 0.22 ACGTcount: A:0.09, C:0.47, G:0.02, T:0.42 Consensus pattern (19 bp): TCTCTCCTCTCTCCTCCTA Found at i:5560 original size:9 final size:9 Alignment explanation

Indices: 5536--5595 Score: 66 Period size: 9 Copynumber: 6.3 Consensus size: 9 5526 CCTGTCATCC * 5536 CTCCTCTCT 1 CTCCTATCT * 5545 CTGCTATCT 1 CTCCTATCT 5554 CTCCTATCTCT 1 CTCCTA--TCT 5565 CCTCCTATCT 1 -CTCCTATCT * 5575 CTACTATCT 1 CTCCTATCT 5584 CTCCTATCT 1 CTCCTATCT 5593 CTC 1 CTC 5596 TTTCTATCAT Statistics Matches: 43, Mismatches: 5, Indels: 6 0.80 0.09 0.11 Matches are distributed among these distances: 9 31 0.72 10 3 0.07 11 3 0.07 12 6 0.14 ACGTcount: A:0.10, C:0.45, G:0.02, T:0.43 Consensus pattern (9 bp): CTCCTATCT Found at i:5574 original size:21 final size:21 Alignment explanation

Indices: 5539--5627 Score: 83 Period size: 21 Copynumber: 4.2 Consensus size: 21 5529 GTCATCCCTC * * 5539 CTCTCTCTGCTATCTCTCCTAT 1 CTCTC-CTCCTATCTCTTCTAT * 5561 CTCTCCTCCTATCTCTACTAT 1 CTCTCCTCCTATCTCTTCTAT * * 5582 CTCTCCT-ATCTCTCTTTCTAT 1 CTCTCCTCCTATCTC-TTCTAT * 5603 CATC-CCTCCTATCTCTTCTTT 1 C-TCTCCTCCTATCTCTTCTAT 5624 CTCT 1 CTCT 5628 TTGAGGGCTA Statistics Matches: 55, Mismatches: 8, Indels: 9 0.76 0.11 0.12 Matches are distributed among these distances: 20 7 0.13 21 36 0.65 22 12 0.22 ACGTcount: A:0.10, C:0.42, G:0.01, T:0.47 Consensus pattern (21 bp): CTCTCCTCCTATCTCTTCTAT Found at i:5603 original size:30 final size:30 Alignment explanation

Indices: 5535--5595 Score: 104 Period size: 30 Copynumber: 2.0 Consensus size: 30 5525 TCCTGTCATC * * 5535 CCTCCTCTCTCTGCTATCTCTCCTATCTCT 1 CCTCCTATCTCTACTATCTCTCCTATCTCT 5565 CCTCCTATCTCTACTATCTCTCCTATCTCT 1 CCTCCTATCTCTACTATCTCTCCTATCTCT 5595 C 1 C 5596 TTTCTATCAT Statistics Matches: 29, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 30 29 1.00 ACGTcount: A:0.10, C:0.46, G:0.02, T:0.43 Consensus pattern (30 bp): CCTCCTATCTCTACTATCTCTCCTATCTCT Found at i:11811 original size:28 final size:29 Alignment explanation

Indices: 11780--11842 Score: 94 Period size: 28 Copynumber: 2.2 Consensus size: 29 11770 CAAGGAAGTA * 11780 AAATCAA-CAAGAAATATGAAAGGAAAAG 1 AAATCAAGCAAGAAATATCAAAGGAAAAG * 11808 AAAT-AAGCAAGAAATTTCAAAGGAAAAG 1 AAATCAAGCAAGAAATATCAAAGGAAAAG 11836 AAATCAA 1 AAATCAA 11843 TAAAGAAGAA Statistics Matches: 31, Mismatches: 2, Indels: 3 0.86 0.06 0.08 Matches are distributed among these distances: 27 2 0.06 28 27 0.87 29 2 0.06 ACGTcount: A:0.63, C:0.08, G:0.16, T:0.13 Consensus pattern (29 bp): AAATCAAGCAAGAAATATCAAAGGAAAAG Found at i:12709 original size:58 final size:58 Alignment explanation

Indices: 12606--12716 Score: 168 Period size: 58 Copynumber: 1.9 Consensus size: 58 12596 AACAATAGGC * * * 12606 CCATAAATATATGCAAATTGGGCTCCACTCTTCTCGGACATGTGAATTGGGCTTCAAG 1 CCATAAATACATGCAAATTGGGCTCCACTCTTCTCGGAAAAGTGAATTGGGCTTCAAG * * * 12664 CCATGAATACATGCAAATTGGGCTTCACTCTTCTTGGAAAAGTGAATTGGGCT 1 CCATAAATACATGCAAATTGGGCTCCACTCTTCTCGGAAAAGTGAATTGGGCT 12717 CTGATGACAA Statistics Matches: 47, Mismatches: 6, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 58 47 1.00 ACGTcount: A:0.28, C:0.21, G:0.22, T:0.30 Consensus pattern (58 bp): CCATAAATACATGCAAATTGGGCTCCACTCTTCTCGGAAAAGTGAATTGGGCTTCAAG Found at i:16255 original size:82 final size:80 Alignment explanation

Indices: 16116--16270 Score: 208 Period size: 82 Copynumber: 1.9 Consensus size: 80 16106 CTAAACAAGA * * 16116 ACAATCCTATCCAATAAGGAAAAGTCTATTCCTTATCCTAATATGATTAGGACTTATCTAAACAA 1 ACAATCCTATCCAATAAGGAAAAGTCTATTCCTTATCCTAATATGATTAGAAATTATCTAAACAA * 16181 CTTAAAATCCTAAAC 66 ATTAAAATCCTAAAC * 16196 ACAATCCTAAT-CAATAAGGATAAAGTCTAACTT-CTTATCAC-AATCTGATTAGAAATTATCTA 1 ACAATCCT-ATCCAATAAGGA-AAAGTCT-A-TTCCTTATC-CTAATATGATTAGAAATTATCTA 16258 AACAAATTAAAAT 61 AACAAATTAAAAT 16271 TGAAAGGTCA Statistics Matches: 66, Mismatches: 4, Indels: 8 0.85 0.05 0.10 Matches are distributed among these distances: 80 17 0.26 81 9 0.14 82 37 0.56 83 3 0.05 ACGTcount: A:0.44, C:0.19, G:0.07, T:0.30 Consensus pattern (80 bp): ACAATCCTATCCAATAAGGAAAAGTCTATTCCTTATCCTAATATGATTAGAAATTATCTAAACAA ATTAAAATCCTAAAC Found at i:24602 original size:12 final size:11 Alignment explanation

Indices: 24580--24655 Score: 52 Period size: 12 Copynumber: 6.9 Consensus size: 11 24570 AGAGAAAGAA * 24580 GAGAGAGGAGG 1 GAGATAGGAGG 24591 GATGATAGGAGG 1 GA-GATAGGAGG * 24603 AGAGACAGGA-- 1 -GAGATAGGAGG * 24613 GAGATAGTA-- 1 GAGATAGGAGG 24622 GAGATAGGAGG 1 GAGATAGGAGG 24633 GATGATAGGAGG 1 GA-GATAGGAGG * 24645 AGAGACAGGAG 1 -GAGATAGGAG 24656 AGATAATAGA Statistics Matches: 53, Mismatches: 6, Indels: 11 0.76 0.09 0.16 Matches are distributed among these distances: 9 15 0.28 11 4 0.08 12 30 0.57 13 4 0.08 ACGTcount: A:0.39, C:0.03, G:0.49, T:0.09 Consensus pattern (11 bp): GAGATAGGAGG Found at i:24612 original size:24 final size:24 Alignment explanation

Indices: 24579--24664 Score: 85 Period size: 24 Copynumber: 3.8 Consensus size: 24 24569 AAGAGAAAGA * 24579 AGAGAGAGGAGGGATGATAGGAGG 1 AGAGACAGGAGGGATGATAGGAGG * 24603 AGAGACAGGA--GA-GAT---AGT 1 AGAGACAGGAGGGATGATAGGAGG * 24621 AGAGATAGGAGGGATGATAGGAGG 1 AGAGACAGGAGGGATGATAGGAGG * * 24645 AGAGACAGGAGAGATAATAG 1 AGAGACAGGAGGGATGATAG 24665 AGAGAGGAGG Statistics Matches: 49, Mismatches: 7, Indels: 12 0.72 0.10 0.18 Matches are distributed among these distances: 18 11 0.22 20 2 0.04 21 6 0.12 22 2 0.04 24 28 0.57 ACGTcount: A:0.42, C:0.02, G:0.45, T:0.10 Consensus pattern (24 bp): AGAGACAGGAGGGATGATAGGAGG Found at i:24631 original size:42 final size:42 Alignment explanation

Indices: 24570--24688 Score: 193 Period size: 42 Copynumber: 2.8 Consensus size: 42 24560 TAGCCCTCAA * * 24570 AGAGAAAGAAGAGAGAGGAGGGATGATAGGAGGAGAGACAGG 1 AGAGATAGTAGAGAGAGGAGGGATGATAGGAGGAGAGACAGG * 24612 AGAGATAGTAGAGATAGGAGGGATGATAGGAGGAGAGACAGG 1 AGAGATAGTAGAGAGAGGAGGGATGATAGGAGGAGAGACAGG * * 24654 AGAGATAATAGAGAGAGGAGGGATGACAGGAGGAG 1 AGAGATAGTAGAGAGAGGAGGGATGATAGGAGGAG 24689 GAGAGAAGAA Statistics Matches: 71, Mismatches: 6, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 42 71 1.00 ACGTcount: A:0.43, C:0.03, G:0.46, T:0.08 Consensus pattern (42 bp): AGAGATAGTAGAGAGAGGAGGGATGATAGGAGGAGAGACAGG Found at i:31178 original size:18 final size:18 Alignment explanation

Indices: 31148--31194 Score: 51 Period size: 18 Copynumber: 2.6 Consensus size: 18 31138 ATTTTTTTCT 31148 CTCCTC-CTCCTATCATCC 1 CTCCTCTCT-CTATCATCC * * 31166 CTCCTCTCTCTATTATCT 1 CTCCTCTCTCTATCATCC * 31184 CTCCTATCTCT 1 CTCCTCTCTCT 31195 CGTTCTGTCA Statistics Matches: 25, Mismatches: 3, Indels: 2 0.83 0.10 0.07 Matches are distributed among these distances: 18 23 0.92 19 2 0.08 ACGTcount: A:0.11, C:0.47, G:0.00, T:0.43 Consensus pattern (18 bp): CTCCTCTCTCTATCATCC Found at i:31191 original size:9 final size:9 Alignment explanation

Indices: 31179--31260 Score: 50 Period size: 9 Copynumber: 9.8 Consensus size: 9 31169 CTCTCTCTAT 31179 TATCTCTCC 1 TATCTCTCC * 31188 TATCTCTCG 1 TATCTCTCC * 31197 T-TCTGT-C 1 TATCTCTCC * * 31204 -ATCTGTTC 1 TATCTCTCC ** 31212 TATCTCTAT 1 TATCTCTCC 31221 TATCTCTCC 1 TATCTCTCC 31230 TATCTCTCC 1 TATCTCTCC * 31239 T-TCTAT-C 1 TATCTCTCC * 31246 -ATCCCTCC 1 TATCTCTCC 31254 TATCTCT 1 TATCTCT 31261 TCTTTCTCTT Statistics Matches: 55, Mismatches: 12, Indels: 12 0.70 0.15 0.15 Matches are distributed among these distances: 7 9 0.16 8 10 0.18 9 36 0.65 ACGTcount: A:0.12, C:0.37, G:0.04, T:0.48 Consensus pattern (9 bp): TATCTCTCC Found at i:31269 original size:42 final size:42 Alignment explanation

Indices: 31151--31260 Score: 157 Period size: 42 Copynumber: 2.6 Consensus size: 42 31141 TTTTTCTCTC * * 31151 CTCCTCCTATCATCCCTCCTCTCTCTATTATCTCTCCTATCT 1 CTCCTTCTATCATCCCTCCTATCTCTATTATCTCTCCTATCT * * ** * 31193 CTCGTTCTGTCATCTGTTCTATCTCTATTATCTCTCCTATCT 1 CTCCTTCTATCATCCCTCCTATCTCTATTATCTCTCCTATCT 31235 CTCCTTCTATCATCCCTCCTATCTCT 1 CTCCTTCTATCATCCCTCCTATCTCT 31261 TCTTTCTCTT Statistics Matches: 56, Mismatches: 12, Indels: 0 0.82 0.18 0.00 Matches are distributed among these distances: 42 56 1.00 ACGTcount: A:0.12, C:0.40, G:0.03, T:0.45 Consensus pattern (42 bp): CTCCTTCTATCATCCCTCCTATCTCTATTATCTCTCCTATCT Found at i:38243 original size:158 final size:156 Alignment explanation

Indices: 38075--38363 Score: 348 Period size: 156 Copynumber: 1.8 Consensus size: 156 38065 CCAAAGCTTT * * * 38075 AATACCTTGTATCCAAATAAGAG-GAACATGTGTTCGTTG-CTCAATCACAAAAAGACCAGAATT 1 AATACCTTGTATCCAAA-AAGAGAGAACATGT-TTCGTAGCCT-AATCACAAAAACAACAG-ATT * * * * 38138 ATTACGAGAATTCATGTATGTCTCATTTGGTAGACGAACTTTGATTCTAAGGGAAATGGCGAAAA 62 ATTACGAGAATGCATGTATATCTCATTTGATAGACGAACTTTGATTCTAAGGAAAATGGCGAAAA 38203 GAAAAGCTTTTCCAATGATACCAATGCATC 127 GAAAAGCTTTTCCAATGATACCAATGCATC * * * * * * 38233 AATATCTTGTATCCAAACAGGGAGAACATGTTTTGTAGCCTAATCACAAAAACAATAGTTTATTA 1 AATACCTTGTATCCAAAAAGAGAGAACATGTTTCGTAGCCTAATCACAAAAACAACAGATTATTA * * * * * * * 38298 CTAGAATGCATGTTTATCTCATTTGATAGATGAATTTTGATTCTATGTAAAATGGTGAAAAGAAA 66 CGAGAATGCATGTATATCTCATTTGATAGACGAACTTTGATTCTAAGGAAAATGGCGAAAAGAAA 38363 A 131 A 38364 AGCTCTTCTT Statistics Matches: 109, Mismatches: 20, Indels: 6 0.81 0.15 0.04 Matches are distributed among these distances: 156 61 0.56 157 22 0.20 158 26 0.24 ACGTcount: A:0.38, C:0.15, G:0.17, T:0.30 Consensus pattern (156 bp): AATACCTTGTATCCAAAAAGAGAGAACATGTTTCGTAGCCTAATCACAAAAACAACAGATTATTA CGAGAATGCATGTATATCTCATTTGATAGACGAACTTTGATTCTAAGGAAAATGGCGAAAAGAAA AGCTTTTCCAATGATACCAATGCATC Found at i:38800 original size:20 final size:20 Alignment explanation

Indices: 38763--38800 Score: 58 Period size: 20 Copynumber: 1.9 Consensus size: 20 38753 TGGAGATTAT * 38763 TACTAGAATCCATGTTTGTC 1 TACTAGAATCCATATTTGTC * 38783 TACTAGAATCTATATTTG 1 TACTAGAATCCATATTTG 38801 CCTCATTTGA Statistics Matches: 16, Mismatches: 2, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 20 16 1.00 ACGTcount: A:0.29, C:0.16, G:0.13, T:0.42 Consensus pattern (20 bp): TACTAGAATCCATATTTGTC Found at i:39918 original size:159 final size:158 Alignment explanation

Indices: 39557--40135 Score: 357 Period size: 159 Copynumber: 3.7 Consensus size: 158 39547 GATAAGAATG * ** * * * * 39557 GAGATTATTATGAGAATA-CACATTCATCTCATTTAAGGGATGAATTTTTATTCTAAGGAAATGT 1 GAGATTATTATGAGAA-ACCATATTTGTCTCATTTAAGGCATGAACTTTGATTCTAAGGAAATGG * * * * 39621 TGGAAAAAAAAGCTTTTCTTGCAAAACCAAAGCATTAATATCATGTATCCAAATGAGAAAAATGT 65 TGGAAAAAAAAGCGTTTCTTGCAAAACCAAAGCATCAATATCATGTATCCAAATAAGAAAAATGC * * * 39686 GCTTTTGCTGCCTAACCATGAATAGACTA 130 ACTTTTGCTGCCCAACCATGAAAAGACTA * ** * * * * 39715 GAGATTACTACAAGAATCCATGTTTGTCTTATTTAAGGGTATGAACTTTGATTCTAAGGAATATG 1 GAGATTATTATGAGAAACCATATTTGTCTCATTTAA-GGCATGAACTTTGATTCTAAGGAA-ATG * * * * * * *** 39780 GTGGAAAAAGATG-GTTTCTTGCAAAAGCAAAGTATCAATATCGTGTATCCTAATAAGGGGAATG 64 GTGGAAAAAAAAGCGTTTCTTGCAAAACCAAAGCATCAATATCATGTATCCAAATAAGAAAAATG * ** ** 39844 CATTTTTTTTGCCCAATTATTGAAAAGACTA 129 CACTTTTGCTGCCCAACCA-TGAAAAGACTA * * * * * 39875 GAGATTATTATGAGAAACCATATTTGTCTCATTTGAGGCATGAA-TTT-ACATCTTAGAAAAATA 1 GAGATTATTATGAGAAACCATATTTGTCTCATTTAAGGCATGAACTTTGA-TTCTAAG-GAAATG * * ** ** * * * * * 39938 GT-GAAAACAAAATCATTTCTTGGGAAATTAAAGCATCATTATCATGTATCCCAATGACAAGAA- 64 GTGGAAAA-AAAAGCGTTTCTTGCAAAACCAAAGCATCAATATCATGTATCCAAATAAGAAAAAT * * * * 40001 -CACTTTTGCAGTGCCAAATCTAT-AAAAGAATG 128 GCACTTTTGC--TGCCCAA-CCATGAAAAGACTA ** * * * * * * ** 40033 GAGATTATTACAAGAATCCATCTTTGCCTCATTTAAGGGATTAACTATGA-TCTAAGGGAAATCA 1 GAGATTATTATGAGAAACCATATTTGTCTCATTTAAGGCATGAACTTTGATTCTAA-GGAAATGG * * * ** * 40097 TGGAAAGAAAAGCCTTTTTTGTGAAACCAAAGTATCAAT 65 TGGAAAAAAAAGCGTTTCTTGCAAAACCAAAGCATCAAT 40136 TACTGTTTTA Statistics Matches: 315, Mismatches: 91, Indels: 30 0.72 0.21 0.07 Matches are distributed among these distances: 157 12 0.04 158 116 0.37 159 132 0.42 160 55 0.17 ACGTcount: A:0.38, C:0.14, G:0.17, T:0.32 Consensus pattern (158 bp): GAGATTATTATGAGAAACCATATTTGTCTCATTTAAGGCATGAACTTTGATTCTAAGGAAATGGT GGAAAAAAAAGCGTTTCTTGCAAAACCAAAGCATCAATATCATGTATCCAAATAAGAAAAATGCA CTTTTGCTGCCCAACCATGAAAAGACTA Found at i:40191 original size:160 final size:158 Alignment explanation

Indices: 40012--40383 Score: 395 Period size: 160 Copynumber: 2.3 Consensus size: 158 40002 ACTTTTGCAG * * 40012 TGCCAAATCTAT-AAAAGAATGGAGATTATTACAAGAATCCATCTTTGCCTCATTTAAGGGATT- 1 TGCCAAATCT-TGAAAAGAATGGAGATTATTACAAGAATCCATGTTTGCCTCATTTAAGCGATTG * ** 40075 AACTATG-ATCTA-AGGGAAATCATGG-A-AAGAAAAGCCTTTTTTGTGAAACCAAAGTATCAAT 65 AACTATGTAT-TATAAGGAAA--ATGGTAGAAGAAAAGCCTTTTTTGTGAAACCAAAACATCAA- * 40136 TACTGTTTTATCCCAATTCGGAAAACAT-TTTTTGT 126 TACTG-TATATCCCAATT-GGAAAACATGTTTTT-T ** * 40171 TGCCAAAATCTTGAAAAGAATTCAGATTATTACATGAATCCATGTTT-CTCTCATTTAAGCGATT 1 TGCC-AAATCTTGAAAAGAATGGAGATTATTACAAGAATCCATGTTTGC-CTCATTTAAGCGATT * * * * 40235 GAACTTTGTATTATAAGGAAAATGGTAGAAGAAAAGCTTTTTTTGTGAAACTAAAACATCAATAT 64 GAACTATGTATTATAAGGAAAATGGTAGAAGAAAAGCCTTTTTTGTGAAACCAAAACATCAATAC * * 40300 TGTATATCCTAATTGGAGAACATGTTTTTT 129 TGTATATCCCAATTGGAAAACATGTTTTTT * * * * * 40330 AGCCTAATCATGAAAA-AATGGAGATT-TATATAAGAATCCATGTTTGTCTCATTT 1 TGCCAAATCTTGAAAAGAATGGAGATTAT-TACAAGAATCCATGTTTGCCTCATTT 40384 GAGAGACAAA Statistics Matches: 179, Mismatches: 23, Indels: 24 0.79 0.10 0.11 Matches are distributed among these distances: 156 1 0.01 157 30 0.17 158 10 0.06 159 18 0.10 160 69 0.39 161 13 0.07 162 38 0.21 ACGTcount: A:0.37, C:0.13, G:0.15, T:0.35 Consensus pattern (158 bp): TGCCAAATCTTGAAAAGAATGGAGATTATTACAAGAATCCATGTTTGCCTCATTTAAGCGATTGA ACTATGTATTATAAGGAAAATGGTAGAAGAAAAGCCTTTTTTGTGAAACCAAAACATCAATACTG TATATCCCAATTGGAAAACATGTTTTTT Done.