Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Ga10g01422.F6

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 4149
ACGTcount: A:0.28, C:0.23, G:0.23, T:0.26


Found at i:59 original size:2 final size:2

Alignment explanation

Indices: 52--95 Score: 70 Period size: 2 Copynumber: 22.0 Consensus size: 2 42 CCCACTCCTC * * 52 CT CT CT CT AT CT CT CT GT CT CT CT CT CT CT CT CT CT CT CT CT 1 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT 94 CT 1 CT 96 TTCATTTTTC Statistics Matches: 38, Mismatches: 4, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 2 38 1.00 ACGTcount: A:0.02, C:0.45, G:0.02, T:0.50 Consensus pattern (2 bp): CT Found at i:2044 original size:33 final size:33 Alignment explanation

Indices: 2006--2330 Score: 217 Period size: 33 Copynumber: 9.8 Consensus size: 33 1996 ACCAACCCGC * * * 2006 GCAGCCAAGTGCGAATCCATATGGCCAGCTTGC 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * ** * 2039 TCAGCCCAGTGCTAATCCATATGGCCAAAATGG 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * 2072 GCAACCAAGTGCTAATCCCTATGGCCAGCCTGA 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA ** * 2105 GCAGCCCAA-TGCTAATCCATATGGCCAAAATGG 1 GCAG-CCAAGTGCTAATCCATATGGCCAGCATGA * * * * 2138 GCAACCAAGTGCTAATCCCTATGGCCAGCCTGT 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * ** * 2171 GCAGCCCAGTACTAACCCATATGGCCAAAATGG 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * 2204 GCA-CTCAAGTGCTAATCCCTATGGCCAGCCTGT 1 GCAGC-CAAGTGCTAATCCATATGGCCAGCATGA * * ** ** * * 2237 GCAGCCCAGTGCTAACCCATACAGCCAAAACGG 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * 2270 GCAGCCAAGTGCTAATCTC-TATGGCCTGCCTGC 1 GCAGCCAAGTGCTAATC-CATATGGCCAGCATGA * * * 2303 GCCA-CCCAGTGCAAACCCATATGGCCAG 1 G-CAGCCAAGTGCTAATCCATATGGCCAG 2331 TCAAGTGCTA Statistics Matches: 217, Mismatches: 68, Indels: 14 0.73 0.23 0.05 Matches are distributed among these distances: 32 6 0.03 33 203 0.94 34 8 0.04 ACGTcount: A:0.27, C:0.32, G:0.22, T:0.18 Consensus pattern (33 bp): GCAGCCAAGTGCTAATCCATATGGCCAGCATGA Found at i:2095 original size:66 final size:65 Alignment explanation

Indices: 2010--2329 Score: 419 Period size: 66 Copynumber: 4.8 Consensus size: 65 2000 ACCCGCGCAG * * * * 2010 CCAAGTGCGAATCCATATGGCCAGCTTGCTCAGCCCAGTGCTAATCCATATGGCCAAAATGGGCA 1 CCAAGTGCTAATCCCTATGGCCAGCCTG-TCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA 2075 A 65 A * * * 2076 CCAAGTGCTAATCCCTATGGCCAGCCTGAGCAGCCCAATGCTAATCCATATGGCCAAAATGGGCA 1 CCAAGTGCTAATCCCTATGGCCAGCCTG-TCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA 2141 A 65 A * 2142 CCAAGTGCTAATCCCTATGGCCAGCCTGTGCAGCCCAGTACTAACCCATATGGCCAAAATGGGC- 1 CCAAGTGCTAATCCCTATGGCCAGCCTGT-CAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA 2206 A 65 A ** * 2207 CTCAAGTGCTAATCCCTATGGCCAGCCTGTGCAGCCCAGTGCTAACCCATACAGCCAAAACGGGC 1 C-CAAGTGCTAATCCCTATGGCCAGCCTGT-CAGCCCAGTGCTAACCCATATGGCCAAAATGGGC * 2272 AG 64 AA * * * * 2274 CCAAGTGCTAATCTCTATGGCCTGCCTGCGCCA-CCCAGTGCAAACCCATATGGCCA 1 CCAAGTGCTAATCCCTATGGCCAGCCT--GTCAGCCCAGTGCTAACCCATATGGCCA 2330 GTCAAGTGCT Statistics Matches: 228, Mismatches: 21, Indels: 10 0.88 0.08 0.04 Matches are distributed among these distances: 65 2 0.01 66 222 0.97 67 3 0.01 68 1 0.00 ACGTcount: A:0.28, C:0.33, G:0.22, T:0.18 Consensus pattern (65 bp): CCAAGTGCTAATCCCTATGGCCAGCCTGTCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCAA Found at i:2396 original size:33 final size:33 Alignment explanation

Indices: 2336--2443 Score: 117 Period size: 33 Copynumber: 3.3 Consensus size: 33 2326 GCCAGTCAAG * * * * * 2336 TGCTAATCCCTATGGCCAGCCAACACAACCAAA 1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA * * * * * 2369 TGCTGATCCTTATGGTCAATCTGCACAACCAAA 1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA * 2402 TGCTAATCCGTATAGTCAACCAGCACAAGCAAA 1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA 2435 TGCTAATCC 1 TGCTAATCC 2444 ATACAGCCAA Statistics Matches: 62, Mismatches: 13, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 33 62 1.00 ACGTcount: A:0.34, C:0.31, G:0.14, T:0.21 Consensus pattern (33 bp): TGCTAATCCGTATAGTCAACCAGCACAACCAAA Found at i:2455 original size:33 final size:33 Alignment explanation

Indices: 2389--2611 Score: 207 Period size: 33 Copynumber: 6.8 Consensus size: 33 2379 TATGGTCAAT * * * * 2389 CTGCACAACCAAATGCTAATCCGTATAGTCAAC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC * 2422 CAGCACAAG-CAAATGCTAATCCATACAGCCAAC 1 CTGCAC-AGCCAAATGCTAATCCATACAGCCAAC * ** * * 2455 CTGTACAGGTACATGCTAATCCATA-TGACCAAC 1 CTGCACAGCCAAATGCTAATCCATACAG-CCAAC * * * 2488 CTGCTCAGCCAAGTGCTAATCCATACAACCAAC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC * * * 2521 CTGCTCAGCCGAATGCTAATCCATACAGCCAGC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC * * ** 2554 CTACACAGCCTAATGCTAATCCATATGGCCAAC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC ** * 2587 CTGTTCAGCCAAGTGCTAATCCATA 1 CTGCACAGCCAAATGCTAATCCATA 2612 TGCCCAACCC Statistics Matches: 153, Mismatches: 33, Indels: 8 0.79 0.17 0.04 Matches are distributed among these distances: 32 3 0.02 33 149 0.97 34 1 0.01 ACGTcount: A:0.34, C:0.32, G:0.14, T:0.20 Consensus pattern (33 bp): CTGCACAGCCAAATGCTAATCCATACAGCCAAC Found at i:2620 original size:99 final size:99 Alignment explanation

Indices: 2433--2611 Score: 288 Period size: 99 Copynumber: 1.8 Consensus size: 99 2423 AGCACAAGCA ** * 2433 AATGCTAATCCATACAGCCAACCTGTACAGGTACATGCTAATCCATATGACCAACCTGCTCAGCC 1 AATGCTAATCCATACAGCCAACCTACACAGCTACATGCTAATCCATATGACCAACCTGCTCAGCC 2498 AAGTGCTAATCCATACAACCAACCTGCTCAGCCG 66 AAGTGCTAATCCATACAACCAACCTGCTCAGCCG * * * 2532 AATGCTAATCCATACAGCCAGCCTACACAGCCTA-ATGCTAATCCATATGGCCAACCTGTTCAGC 1 AATGCTAATCCATACAGCCAACCTACACAG-CTACATGCTAATCCATATGACCAACCTGCTCAGC 2596 CAAGTGCTAATCCATA 65 CAAGTGCTAATCCATA 2612 TGCCCAACCC Statistics Matches: 73, Mismatches: 6, Indels: 2 0.90 0.07 0.02 Matches are distributed among these distances: 99 71 0.97 100 2 0.03 ACGTcount: A:0.32, C:0.32, G:0.14, T:0.21 Consensus pattern (99 bp): AATGCTAATCCATACAGCCAACCTACACAGCTACATGCTAATCCATATGACCAACCTGCTCAGCC AAGTGCTAATCCATACAACCAACCTGCTCAGCCG Found at i:3368 original size:18 final size:17 Alignment explanation

Indices: 3347--3638 Score: 93 Period size: 18 Copynumber: 17.8 Consensus size: 17 3337 TGGGGATGAA 3347 CATGGGCATGAACCCAGG 1 CATGGGCATGAA-CCAGG * * 3365 CATGGGGATGAA-CA-A 1 CATGGGCATGAACCAGG 3380 CATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * * * 3398 CATGGGGATG-AGCA-A 1 CATGGGCATGAACCAGG * * 3413 TATGGGCATAAATCCAGG 1 CATGGGCATGAA-CCAGG * * * 3431 CATGGGGATG-AGCA-A 1 CATGGGCATGAACCAGG * 3446 TATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * * * 3464 CATGGGGATG-AGCA-A 1 CATGGGCATGAACCAGG * 3479 TATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * ** * 3497 CATGGGGATG-AGTA-A 1 CATGGGCATGAACCAGG * ** 3512 TATGGGCATGAATAAAGG 1 CATGGGCATGAA-CCAGG * 3530 CATGGGGATG-A--A-- 1 CATGGGCATGAACCAGG 3542 CATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * * 3560 CATGGGGATGAA-CA-A 1 CATGGGCATGAACCAGG 3575 CATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * 3593 CATGGAG-ATGAA-CA-A 1 CATGG-GCATGAACCAGG * * 3608 TATGGGCATGAGCGCAGG 1 CATGGGCATGAAC-CAGG * 3626 CATGGGGATGAAC 1 CATGGGCATGAAC 3639 ATGGGAATGG Statistics Matches: 192, Mismatches: 53, Indels: 58 0.63 0.17 0.19 Matches are distributed among these distances: 12 9 0.05 13 1 0.01 14 2 0.01 15 61 0.32 16 18 0.09 17 18 0.09 18 82 0.43 19 1 0.01 ACGTcount: A:0.32, C:0.16, G:0.35, T:0.17 Consensus pattern (17 bp): CATGGGCATGAACCAGG Found at i:3387 original size:33 final size:33 Alignment explanation

Indices: 3336--3639 Score: 433 Period size: 33 Copynumber: 9.4 Consensus size: 33 3326 GGGCATGGGA * 3336 ATGGGGATGAAC---ATGGGCATGAACCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3366 ATGGGGATGAACAACATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * * 3399 ATGGGGATGAGCAATATGGGCATAAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3432 ATGGGGATGAGCAATATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3465 ATGGGGATGAGCAATATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC ** ** 3498 ATGGGGATGAGTAATATGGGCATGAATAAAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC 3531 ATGGGGATGAAC---ATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3561 ATGGGGATGAACAACATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * * 3594 ATGGAGATGAACAATATGGGCATG-AGCGCAGGC 1 ATGGGGATGAACAATATGGGCATGAATC-CAGGC 3627 ATGGGGATGAACA 1 ATGGGGATGAACA 3640 TGGGAATGGG Statistics Matches: 251, Mismatches: 16, Indels: 11 0.90 0.06 0.04 Matches are distributed among these distances: 30 40 0.16 32 2 0.01 33 209 0.83 ACGTcount: A:0.33, C:0.15, G:0.36, T:0.17 Consensus pattern (33 bp): ATGGGGATGAACAATATGGGCATGAATCCAGGC Done.