Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Scaffold1664

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 20208
ACGTcount: A:0.32, C:0.15, G:0.19, T:0.34


Found at i:2978 original size:13 final size:13

Alignment explanation

Indices: 2960--2985 Score: 52 Period size: 13 Copynumber: 2.0 Consensus size: 13 2950 CAATTTTTTG 2960 TGTATCGATACAT 1 TGTATCGATACAT 2973 TGTATCGATACAT 1 TGTATCGATACAT 2986 ACTTTGGTGT Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 13 1.00 ACGTcount: A:0.31, C:0.15, G:0.15, T:0.38 Consensus pattern (13 bp): TGTATCGATACAT Found at i:2982 original size:33 final size:33 Alignment explanation

Indices: 2940--3004 Score: 96 Period size: 33 Copynumber: 2.0 Consensus size: 33 2930 TACAAGCCAA * * 2940 TGTATCGATACA-ATTTTTTGTGTATCGATACAT 1 TGTATCGATACATA-CTTTGGTGTATCGATACAT 2973 TGTATCGATACATACTTTGGTGTATCGATACA 1 TGTATCGATACATACTTTGGTGTATCGATACA 3005 AGTTTGGCTA Statistics Matches: 29, Mismatches: 2, Indels: 2 0.88 0.06 0.06 Matches are distributed among these distances: 33 28 0.97 34 1 0.03 ACGTcount: A:0.28, C:0.14, G:0.17, T:0.42 Consensus pattern (33 bp): TGTATCGATACATACTTTGGTGTATCGATACAT Found at i:4040 original size:37 final size:36 Alignment explanation

Indices: 3949--4036 Score: 108 Period size: 37 Copynumber: 2.4 Consensus size: 36 3939 TTTAAAAATA 3949 AATATATTTTAATAA-T-TTGAGATAAATATAACTTT 1 AATAT-TTTTAATAATTATTGAGATAAATATAACTTT * * * * 3984 AGTGATTTTTAATTATTATTGATATAAATATAATTTT 1 AAT-ATTTTTAATAATTATTGAGATAAATATAACTTT 4021 AATATTTTTAATAATT 1 AATATTTTTAATAATT 4037 TATTAATTTT Statistics Matches: 44, Mismatches: 6, Indels: 5 0.80 0.11 0.09 Matches are distributed among these distances: 35 10 0.23 36 15 0.34 37 19 0.43 ACGTcount: A:0.42, C:0.01, G:0.06, T:0.51 Consensus pattern (36 bp): AATATTTTTAATAATTATTGAGATAAATATAACTTT Found at i:6373 original size:44 final size:44 Alignment explanation

Indices: 6311--6407 Score: 185 Period size: 44 Copynumber: 2.2 Consensus size: 44 6301 TGAATTATTA 6311 TATATATATATAATTTATTTATATCAACATAAAATTAAATTAAT 1 TATATATATATAATTTATTTATATCAACATAAAATTAAATTAAT 6355 TATATATATATAATTTATTTATATCAACATAAAATTAAATTAAT 1 TATATATATATAATTTATTTATATCAACATAAAATTAAATTAAT * 6399 TTTATATAT 1 TATATATAT 6408 TTTTATAAAA Statistics Matches: 52, Mismatches: 1, Indels: 0 0.98 0.02 0.00 Matches are distributed among these distances: 44 52 1.00 ACGTcount: A:0.48, C:0.04, G:0.00, T:0.47 Consensus pattern (44 bp): TATATATATATAATTTATTTATATCAACATAAAATTAAATTAAT Found at i:6702 original size:24 final size:26 Alignment explanation

Indices: 6673--6743 Score: 69 Period size: 25 Copynumber: 2.8 Consensus size: 26 6663 ATTATAAAAA 6673 ATATAAAATTAATTTAATTTATAT-C 1 ATATAAAATTAATTTAATTTATATAC * 6698 -TATAAAAATAAATTT--TTTATATAC 1 ATAT-AAAATTAATTTAATTTATATAC * 6722 ATATAAACTTAATAATTAATTT 1 ATATAAAATTAAT--TTAATTT 6744 GATACTTCGA Statistics Matches: 36, Mismatches: 3, Indels: 11 0.72 0.06 0.22 Matches are distributed among these distances: 23 7 0.19 24 11 0.31 25 13 0.36 26 2 0.06 28 3 0.08 ACGTcount: A:0.49, C:0.04, G:0.00, T:0.46 Consensus pattern (26 bp): ATATAAAATTAATTTAATTTATATAC Found at i:15750 original size:13 final size:13 Alignment explanation

Indices: 15732--15758 Score: 54 Period size: 13 Copynumber: 2.1 Consensus size: 13 15722 TACAAAACTT 15732 ATATATCGATACA 1 ATATATCGATACA 15745 ATATATCGATACA 1 ATATATCGATACA 15758 A 1 A 15759 CACTTTATGT Statistics Matches: 14, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 14 1.00 ACGTcount: A:0.48, C:0.15, G:0.07, T:0.30 Consensus pattern (13 bp): ATATATCGATACA Found at i:16673 original size:57 final size:57 Alignment explanation

Indices: 16565--16673 Score: 130 Period size: 57 Copynumber: 1.9 Consensus size: 57 16555 TGAAGAAAAA * * ** 16565 TGCCAATGTGCTGAGGCAAGGCCAGCGACATCGGGCTTAGGAATGTTAAAGATGAAG 1 TGCCAATGTGCTGAGGCAAGGCCAGCGACATCGGACTTAAGAATGACAAAGATGAAG * ** * 16622 TGCCAATGTGTTGATTCAAGGCCAGCGACATTGGACTTAA-AGATGACAAAGA 1 TGCCAATGTGCTGAGGCAAGGCCAGCGACATCGGACTTAAGA-ATGACAAAGA 16674 CGCCAATAGC Statistics Matches: 43, Mismatches: 8, Indels: 2 0.81 0.15 0.04 Matches are distributed among these distances: 56 1 0.02 57 42 0.98 ACGTcount: A:0.32, C:0.17, G:0.29, T:0.21 Consensus pattern (57 bp): TGCCAATGTGCTGAGGCAAGGCCAGCGACATCGGACTTAAGAATGACAAAGATGAAG Found at i:16835 original size:91 final size:92 Alignment explanation

Indices: 16721--17046 Score: 545 Period size: 91 Copynumber: 3.6 Consensus size: 92 16711 AAGGATGGAC * * 16721 AAGGCGCCAAATATGCTGATTCAAGGCCAACGATATTGGGACTTGGAGGTGCCAATGTG-TGATT 1 AAGGTGCC-AATATGCTGATTCAAGGCCAGCGATATT-GGACTTGGAGGTGCCAATGTGCTGATT 16785 C-AGGCCAGCTACATTGGACTTAAAGATG 64 CAAGGCCAGCTACATTGGACTTAAAGATG * 16813 AAGGTGCCAATATGCTGATTCAAGGCTAGCGATATTGGACTTGGAGGTGCCAATGTGCTGATTCA 1 AAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTGGAGGTGCCAATGTGCTGATTCA 16878 AGGCCAGCTA-ATTGGACTTAAAGATG 66 AGGCCAGCTACATTGGACTTAAAGATG 16904 AAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTGGAGGTGCCAATGTGCTGATTCA 1 AAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTGGAGGTGCCAATGTGCTGATTCA 16969 AGGCCAGCTACATTGGACTTAAAGATG 66 AGGCCAGCTACATTGGACTTAAAGATG * * * 16996 AAGGTGCCAATATGCT-ATTCAAGGCCAGCAATA-TGTACTTAGAGGTGCCAA 1 AAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTGGAGGTGCCAA 17047 AGGTGCCAAT Statistics Matches: 224, Mismatches: 7, Indels: 8 0.94 0.03 0.03 Matches are distributed among these distances: 90 37 0.17 91 138 0.62 92 49 0.22 ACGTcount: A:0.29, C:0.18, G:0.28, T:0.25 Consensus pattern (92 bp): AAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTGGAGGTGCCAATGTGCTGATTCA AGGCCAGCTACATTGGACTTAAAGATG Found at i:16866 original size:43 final size:46 Alignment explanation

Indices: 16617--17089 Score: 384 Period size: 43 Copynumber: 10.0 Consensus size: 46 16607 ATGTTAAAGA * * * 16617 TGAA-GTGCCAATGTGTTGATTCAAGGCCAGCGACATTGGACTTAAAG 1 TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTT--AG ** * * * 16664 ATGACAAAGACGCCAATA-GCTGATTCAAGGCTAGCGATCTAGGACTTAAGG 1 -TG---AAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTT-A-G * * 16715 ATGGACAAGGCGCCAAATATGCTGATTCAAGGCCAACGATATTGGGACTT-G 1 -T-G--AAGGTGCC-AATATGCTGATTCAAGGCCAGCGATATT-GGACTTAG * * * 16766 -G-AGGTGCCAATGTG-TGATTC-AGGCCAGCTACATTGGACTTAAAG 1 TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTT--AG * 16810 ATGAAGGTGCCAATATGCTGATTCAAGGCTAGCGATATTGGACTT-G 1 -TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTAG * * 16856 -G-AGGTGCCAATGTGCTGATTCAAGGCCAGCTA-ATTGGACTTAAAG 1 TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTT--AG 16901 ATGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTT-G 1 -TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTAG * * * 16947 -G-AGGTGCCAATGTGCTGATTCAAGGCCAGCTACATTGGACTTAAAG 1 TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTT--AG * * 16993 ATGAAGGTGCCAATATGCT-ATTCAAGGCCAGCAATA-TGTACTTAGAGG 1 -TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTT--A-G 17041 TGCCAAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTAG 1 TG---AAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTAG 17090 AAGAAGACGC Statistics Matches: 350, Mismatches: 39, Indels: 71 0.76 0.08 0.15 Matches are distributed among these distances: 41 6 0.02 42 20 0.06 43 72 0.21 44 8 0.02 45 7 0.02 46 4 0.01 47 23 0.07 48 55 0.16 49 41 0.12 50 19 0.05 51 53 0.15 52 17 0.05 53 19 0.05 54 6 0.02 ACGTcount: A:0.30, C:0.18, G:0.27, T:0.24 Consensus pattern (46 bp): TGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGACTTAG Found at i:17078 original size:142 final size:134 Alignment explanation

Indices: 16814--17087 Score: 372 Period size: 142 Copynumber: 2.0 Consensus size: 134 16804 TTAAAGATGA * * * 16814 AGGTGCCAATATGCTGATTCAAGGCTAGCGATATTGGACTTGGAGGTGCCAATGTGCTGATTCAA 1 AGGTGCCAATATGCTGATTCAAGGCCAGCGACATTGGACTTGGAGGTGCCAATATGCTGATTCAA 16879 GGCCAGCTAATTGGACTTAAAGATGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGAC 66 GGCCAGCTAATTGGACTTAAAGATGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGAC 16944 TTGG 131 TTGG * * 16948 AGGTGCCAATGTGCTGATTCAAGGCCAGCTACATTGGACTTAAAGATGAAGGTGCCAATATGCT- 1 AGGTGCCAATATGCTGATTCAAGGCCAGCGACATTGGACTT---G--G-AGGTGCCAATATGCTG * * * 17012 ATTCAAGGCCAGC-AATATGTACTTAGAGGTGCCAAAGGTGCCAATATGCTGATTCAAGGCCAGC 60 ATTCAAGGCCAGCTAAT-TGGACTTAAAGATG---AAGGTGCCAATATGCTGATTCAAGGCCAGC 17076 GATATTGGACTT 121 GATATTGGACTT 17088 AGAAGAAGAC Statistics Matches: 122, Mismatches: 8, Indels: 12 0.86 0.06 0.08 Matches are distributed among these distances: 134 37 0.30 137 1 0.01 138 3 0.02 139 25 0.20 140 14 0.11 142 42 0.34 ACGTcount: A:0.29, C:0.18, G:0.27, T:0.26 Consensus pattern (134 bp): AGGTGCCAATATGCTGATTCAAGGCCAGCGACATTGGACTTGGAGGTGCCAATATGCTGATTCAA GGCCAGCTAATTGGACTTAAAGATGAAGGTGCCAATATGCTGATTCAAGGCCAGCGATATTGGAC TTGG Found at i:17167 original size:28 final size:28 Alignment explanation

Indices: 17136--17260 Score: 153 Period size: 28 Copynumber: 4.4 Consensus size: 28 17126 GTTTGCATCA * * * * 17136 ACTTGTGTGCTTTTGAAGGTTGCCACTG 1 ACTTGTGGGCTTCTAAAGATTGCCACTG 17164 ACTTGTGGGCTTCTAAAGATTGCCACTG 1 ACTTGTGGGCTTCTAAAGATTGCCACTG 17192 ACTTGTGGGCTTCTAAAGATTGCCACTG 1 ACTTGTGGGCTTCTAAAGATTGCCACTG * ** * 17220 A-TTGTGGGCTTTTGAAAAGGGTGCCACTA 1 ACTTGTGGGCTTCT--AAAGATTGCCACTG 17249 ACTTGTGGGCTT 1 ACTTGTGGGCTT 17261 AAAAGGAAAA Statistics Matches: 86, Mismatches: 8, Indels: 4 0.88 0.08 0.04 Matches are distributed among these distances: 27 11 0.13 28 53 0.62 29 12 0.14 30 10 0.12 ACGTcount: A:0.19, C:0.18, G:0.28, T:0.34 Consensus pattern (28 bp): ACTTGTGGGCTTCTAAAGATTGCCACTG Found at i:17289 original size:34 final size:35 Alignment explanation

Indices: 17251--17325 Score: 107 Period size: 35 Copynumber: 2.2 Consensus size: 35 17241 TGCCACTAAC ** * 17251 TTGTGGGCTTA-AAAGGAAAAAGAGTGCTACGGAG 1 TTGTGGGCTTACAAAAAAAAAAGAGTGCCACGGAG * 17285 TTGTGAGCTTACAAAAAAAAAAGAGTGCCACGGAG 1 TTGTGGGCTTACAAAAAAAAAAGAGTGCCACGGAG 17320 TTGTGG 1 TTGTGG 17326 ACTTTGGAAA Statistics Matches: 35, Mismatches: 5, Indels: 1 0.85 0.12 0.02 Matches are distributed among these distances: 34 10 0.29 35 25 0.71 ACGTcount: A:0.36, C:0.11, G:0.32, T:0.21 Consensus pattern (35 bp): TTGTGGGCTTACAAAAAAAAAAGAGTGCCACGGAG Found at i:17352 original size:29 final size:29 Alignment explanation

Indices: 17303--17475 Score: 188 Period size: 29 Copynumber: 5.9 Consensus size: 29 17293 TTACAAAAAA * * * 17303 AAAAGAGTGCCACGGAGTTGTGGACTTTGG 1 AAAAGA-TGCCACTGACTTGTGGGCTTTGG * * 17333 AAAAGATGCCACCGACTTGTGGGCTTCGG 1 AAAAGATGCCACTGACTTGTGGGCTTTGG * * 17362 AAAAGGGTGCCACTGATTTGTGGGCTTT-G 1 AAAA-GATGCCACTGACTTGTGGGCTTTGG * * * * 17391 -AAGGTTGCCACTGACTTGTGGACTTTGAA 1 AAAAGATGCCACTGACTTGTGGGCTTTG-G * * 17420 AAAAGATGCTACTGACTTGTGGGCTTTGA 1 AAAAGATGCCACTGACTTGTGGGCTTTGG 17449 AAAAGATGCCACTGACTTGTGGGCTTT 1 AAAAGATGCCACTGACTTGTGGGCTTT 17476 TGAAGGGTGA Statistics Matches: 121, Mismatches: 18, Indels: 9 0.82 0.12 0.06 Matches are distributed among these distances: 27 20 0.17 28 2 0.02 29 51 0.42 30 48 0.40 ACGTcount: A:0.25, C:0.17, G:0.31, T:0.28 Consensus pattern (29 bp): AAAAGATGCCACTGACTTGTGGGCTTTGG Found at i:17434 original size:57 final size:58 Alignment explanation

Indices: 17156--17475 Score: 208 Period size: 57 Copynumber: 5.4 Consensus size: 58 17146 TTTTGAAGGT 17156 TGCCACTGACTTGTGGGCTTCT----AAAGATTGCCACTGACTTGTGGGCTTCT--AAAGA 1 TGCCACTGACTTGTGGGCTT-TGAAAAAAGA-TGCCACTGACTTGTGGGCTT-TGAAAAGA * * * * 17211 TTGCCACTGA-TTGTGGGCTTTTG-AAAAGGGTGCCACTAACTTGTGGGCTTAAAAGGAAAAAGA 1 -TGCCACTGACTTGTGGGC-TTTGAAAAAAGATGCCACTGACTTGTGGGCTT----TG-AAAAGA * * * * ** * * * 17274 GTGCTACGGAGTTGTGAGCTTACAAAAAAAAAAGAGTGCCACGGAGTTGTGGACTTTGGAAAAGA 1 -TGCCACTGACTTGTGGGCTT----TGAAAAAAGA-TGCCACTGACTTGTGGGCTTT-GAAAAGA * * * * * * * * 17339 TGCCACCGACTTGTGGGCTTCGGAAAAGGGTGCCACTGATTTGTGGGCTTTG-AAGGT 1 TGCCACTGACTTGTGGGCTTTGAAAAAAGATGCCACTGACTTGTGGGCTTTGAAAAGA * * 17396 TGCCACTGACTTGTGGACTTTGAAAAAAGATGCTACTGACTTGTGGGCTTTGAAAAAGA 1 TGCCACTGACTTGTGGGCTTTGAAAAAAGATGCCACTGACTTGTGGGCTTTG-AAAAGA 17455 TGCCACTGACTTGTGGGCTTT 1 TGCCACTGACTTGTGGGCTTT 17476 TGAAGGGTGA Statistics Matches: 201, Mismatches: 43, Indels: 37 0.72 0.15 0.13 Matches are distributed among these distances: 55 9 0.04 56 11 0.05 57 66 0.33 58 4 0.02 59 41 0.20 60 5 0.02 63 14 0.07 64 23 0.11 65 6 0.03 66 1 0.00 68 5 0.02 69 16 0.08 ACGTcount: A:0.26, C:0.17, G:0.29, T:0.28 Consensus pattern (58 bp): TGCCACTGACTTGTGGGCTTTGAAAAAAGATGCCACTGACTTGTGGGCTTTGAAAAGA Found at i:17459 original size:86 final size:87 Alignment explanation

Indices: 17310--17481 Score: 240 Period size: 86 Copynumber: 2.0 Consensus size: 87 17300 AAAAAAAGAG * * * * 17310 TGCCACGGAGTTGTGGACTTTGGAAAAGATGCCACCGACTTGTGGGCTTCGGAAAAGGGTGCCAC 1 TGCCACGGACTTGTGGACTTTGAAAAAGATGCCACCGACTTGTGGGCTTCGGAAAAAGATGCCAC * 17375 TGATTTGTGGGC-TTTGAAGGT 66 TGACTTGTGGGCTTTTGAAGGT * * * * 17396 TGCCACTGACTTGTGGACTTTGAAAAAAGATGCTACTGACTTGTGGGCTT-TGAAAAAGATGCCA 1 TGCCACGGACTTGTGGACTTTG-AAAAAGATGCCACCGACTTGTGGGCTTCGGAAAAAGATGCCA 17460 CTGACTTGTGGGCTTTTGAAGG 65 CTGACTTGTGGGCTTTTGAAGG 17482 GTGAGGAATG Statistics Matches: 75, Mismatches: 9, Indels: 3 0.86 0.10 0.03 Matches are distributed among these distances: 86 43 0.57 87 32 0.43 ACGTcount: A:0.23, C:0.17, G:0.31, T:0.28 Consensus pattern (87 bp): TGCCACGGACTTGTGGACTTTGAAAAAGATGCCACCGACTTGTGGGCTTCGGAAAAAGATGCCAC TGACTTGTGGGCTTTTGAAGGT Found at i:18908 original size:1 final size:1 Alignment explanation

Indices: 18902--18961 Score: 57 Period size: 1 Copynumber: 60.0 Consensus size: 1 18892 CCTTTTGTTG * * * * * * * 18902 TTTTTTTTTGTTTTTTGTTTTGTTTTATTTTTTTTTGTTGTTTTTTTTTGTTTTTTTTTT 1 TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 18962 GCGGAGGATG Statistics Matches: 45, Mismatches: 14, Indels: 0 0.76 0.24 0.00 Matches are distributed among these distances: 1 45 1.00 ACGTcount: A:0.02, C:0.00, G:0.10, T:0.88 Consensus pattern (1 bp): T Found at i:18912 original size:10 final size:10 Alignment explanation

Indices: 18894--18960 Score: 75 Period size: 10 Copynumber: 6.7 Consensus size: 10 18884 ATCCAGACCC * 18894 TTTTGTTGTT 1 TTTTTTTGTT 18904 TTTTTTTGTT 1 TTTTTTTGTT 18914 TTTTGTTT-TGT 1 TTTT-TTTGT-T * 18925 TTTATTT-TT 1 TTTTTTTGTT * 18934 TTTTGTTGTT 1 TTTTTTTGTT 18944 TTTTTTTGTT 1 TTTTTTTGTT 18954 TTTTTTT 1 TTTTTTT 18961 TGCGGAGGAT Statistics Matches: 49, Mismatches: 5, Indels: 6 0.82 0.08 0.10 Matches are distributed among these distances: 9 6 0.12 10 36 0.73 11 7 0.14 ACGTcount: A:0.01, C:0.00, G:0.12, T:0.87 Consensus pattern (10 bp): TTTTTTTGTT Found at i:18912 original size:13 final size:12 Alignment explanation

Indices: 18894--18961 Score: 82 Period size: 13 Copynumber: 5.2 Consensus size: 12 18884 ATCCAGACCC 18894 TTTTGTTGTTTTT 1 TTTTGTT-TTTTT 18907 TTTTGTTTTTTGT 1 TTTTGTTTTTT-T * 18920 TTTGTTTTATTTTT 1 TTT-TGTT-TTTTT 18934 TTTTGTTGTTTTT 1 TTTTGTT-TTTTT 18947 TTTTGTTTTTTT 1 TTTTGTTTTTTT 18959 TTT 1 TTT 18962 GCGGAGGATG Statistics Matches: 49, Mismatches: 3, Indels: 7 0.83 0.05 0.12 Matches are distributed among these distances: 12 12 0.24 13 26 0.53 14 7 0.14 15 4 0.08 ACGTcount: A:0.01, C:0.00, G:0.12, T:0.87 Consensus pattern (12 bp): TTTTGTTTTTTT Done.