Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Ga10g01422.F2

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 3969
ACGTcount: A:0.28, C:0.24, G:0.23, T:0.26


Found at i:59 original size:2 final size:2

Alignment explanation

Indices: 52--95 Score: 70 Period size: 2 Copynumber: 22.0 Consensus size: 2 42 CCCACTCCTC * * 52 CT CT CT CT AT CT CT CT GT CT CT CT CT CT CT CT CT CT CT CT CT 1 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT 94 CT 1 CT 96 TTCATTTTTC Statistics Matches: 38, Mismatches: 4, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 2 38 1.00 ACGTcount: A:0.02, C:0.45, G:0.02, T:0.50 Consensus pattern (2 bp): CT Found at i:1864 original size:33 final size:33 Alignment explanation

Indices: 1826--2150 Score: 217 Period size: 33 Copynumber: 9.8 Consensus size: 33 1816 ACCAACCCGC * * * 1826 GCAGCCAAGTGCGAATCCATATGGCCAGCTTGC 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * ** * 1859 TCAGCCCAGTGCTAATCCATATGGCCAAAATGG 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * 1892 GCAACCAAGTGCTAATCCCTATGGCCAGCCTGA 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA ** * 1925 GCAGCCCAA-TGCTAATCCATATGGCCAAAATGG 1 GCAG-CCAAGTGCTAATCCATATGGCCAGCATGA * * * * 1958 GCAACCAAGTGCTAATCCCTATGGCCAGCCTGT 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * ** * 1991 GCAGCCCAGTACTAACCCATATGGCCAAAATGG 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * 2024 GCA-CTCAAGTGCTAATCCCTATGGCCAGCCTGT 1 GCAGC-CAAGTGCTAATCCATATGGCCAGCATGA * * ** ** * * 2057 GCAGCCCAGTGCTAACCCATACAGCCAAAACGG 1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA * * * 2090 GCAGCCAAGTGCTAATCTC-TATGGCCTGCCTGC 1 GCAGCCAAGTGCTAATC-CATATGGCCAGCATGA * * * 2123 GCCA-CCCAGTGCAAACCCATATGGCCAG 1 G-CAGCCAAGTGCTAATCCATATGGCCAG 2151 TCAAGTGCTA Statistics Matches: 217, Mismatches: 68, Indels: 14 0.73 0.23 0.05 Matches are distributed among these distances: 32 6 0.03 33 203 0.94 34 8 0.04 ACGTcount: A:0.27, C:0.32, G:0.22, T:0.18 Consensus pattern (33 bp): GCAGCCAAGTGCTAATCCATATGGCCAGCATGA Found at i:1915 original size:66 final size:65 Alignment explanation

Indices: 1830--2149 Score: 419 Period size: 66 Copynumber: 4.8 Consensus size: 65 1820 ACCCGCGCAG * * * * 1830 CCAAGTGCGAATCCATATGGCCAGCTTGCTCAGCCCAGTGCTAATCCATATGGCCAAAATGGGCA 1 CCAAGTGCTAATCCCTATGGCCAGCCTG-TCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA 1895 A 65 A * * * 1896 CCAAGTGCTAATCCCTATGGCCAGCCTGAGCAGCCCAATGCTAATCCATATGGCCAAAATGGGCA 1 CCAAGTGCTAATCCCTATGGCCAGCCTG-TCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA 1961 A 65 A * 1962 CCAAGTGCTAATCCCTATGGCCAGCCTGTGCAGCCCAGTACTAACCCATATGGCCAAAATGGGC- 1 CCAAGTGCTAATCCCTATGGCCAGCCTGT-CAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA 2026 A 65 A ** * 2027 CTCAAGTGCTAATCCCTATGGCCAGCCTGTGCAGCCCAGTGCTAACCCATACAGCCAAAACGGGC 1 C-CAAGTGCTAATCCCTATGGCCAGCCTGT-CAGCCCAGTGCTAACCCATATGGCCAAAATGGGC * 2092 AG 64 AA * * * * 2094 CCAAGTGCTAATCTCTATGGCCTGCCTGCGCCA-CCCAGTGCAAACCCATATGGCCA 1 CCAAGTGCTAATCCCTATGGCCAGCCT--GTCAGCCCAGTGCTAACCCATATGGCCA 2150 GTCAAGTGCT Statistics Matches: 228, Mismatches: 21, Indels: 10 0.88 0.08 0.04 Matches are distributed among these distances: 65 2 0.01 66 222 0.97 67 3 0.01 68 1 0.00 ACGTcount: A:0.28, C:0.33, G:0.22, T:0.18 Consensus pattern (65 bp): CCAAGTGCTAATCCCTATGGCCAGCCTGTCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCAA Found at i:2216 original size:33 final size:33 Alignment explanation

Indices: 2156--2263 Score: 117 Period size: 33 Copynumber: 3.3 Consensus size: 33 2146 GCCAGTCAAG * * * * * 2156 TGCTAATCCCTATGGCCAGCCAACACAACCAAA 1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA * * * * * 2189 TGCTGATCCTTATGGTCAATCTGCACAACCAAA 1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA * 2222 TGCTAATCCGTATAGTCAACCAGCACAAGCAAA 1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA 2255 TGCTAATCC 1 TGCTAATCC 2264 ATACAGCCAA Statistics Matches: 62, Mismatches: 13, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 33 62 1.00 ACGTcount: A:0.34, C:0.31, G:0.14, T:0.21 Consensus pattern (33 bp): TGCTAATCCGTATAGTCAACCAGCACAACCAAA Found at i:2275 original size:33 final size:33 Alignment explanation

Indices: 2209--2431 Score: 207 Period size: 33 Copynumber: 6.8 Consensus size: 33 2199 TATGGTCAAT * * * * 2209 CTGCACAACCAAATGCTAATCCGTATAGTCAAC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC * 2242 CAGCACAAG-CAAATGCTAATCCATACAGCCAAC 1 CTGCAC-AGCCAAATGCTAATCCATACAGCCAAC * ** * * 2275 CTGTACAGGTACATGCTAATCCATA-TGACCAAC 1 CTGCACAGCCAAATGCTAATCCATACAG-CCAAC * * * 2308 CTGCTCAGCCAAGTGCTAATCCATACAACCAAC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC * * * 2341 CTGCTCAGCCGAATGCTAATCCATACAGCCAGC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC * * ** 2374 CTACACAGCCTAATGCTAATCCATATGGCCAAC 1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC ** * 2407 CTGTTCAGCCAAGTGCTAATCCATA 1 CTGCACAGCCAAATGCTAATCCATA 2432 TGCCCAACCC Statistics Matches: 153, Mismatches: 33, Indels: 8 0.79 0.17 0.04 Matches are distributed among these distances: 32 3 0.02 33 149 0.97 34 1 0.01 ACGTcount: A:0.34, C:0.32, G:0.14, T:0.20 Consensus pattern (33 bp): CTGCACAGCCAAATGCTAATCCATACAGCCAAC Found at i:2440 original size:99 final size:99 Alignment explanation

Indices: 2253--2431 Score: 288 Period size: 99 Copynumber: 1.8 Consensus size: 99 2243 AGCACAAGCA ** * 2253 AATGCTAATCCATACAGCCAACCTGTACAGGTACATGCTAATCCATATGACCAACCTGCTCAGCC 1 AATGCTAATCCATACAGCCAACCTACACAGCTACATGCTAATCCATATGACCAACCTGCTCAGCC 2318 AAGTGCTAATCCATACAACCAACCTGCTCAGCCG 66 AAGTGCTAATCCATACAACCAACCTGCTCAGCCG * * * 2352 AATGCTAATCCATACAGCCAGCCTACACAGCCTA-ATGCTAATCCATATGGCCAACCTGTTCAGC 1 AATGCTAATCCATACAGCCAACCTACACAG-CTACATGCTAATCCATATGACCAACCTGCTCAGC 2416 CAAGTGCTAATCCATA 65 CAAGTGCTAATCCATA 2432 TGCCCAACCC Statistics Matches: 73, Mismatches: 6, Indels: 2 0.90 0.07 0.02 Matches are distributed among these distances: 99 71 0.97 100 2 0.03 ACGTcount: A:0.32, C:0.32, G:0.14, T:0.21 Consensus pattern (99 bp): AATGCTAATCCATACAGCCAACCTACACAGCTACATGCTAATCCATATGACCAACCTGCTCAGCC AAGTGCTAATCCATACAACCAACCTGCTCAGCCG Found at i:3188 original size:18 final size:17 Alignment explanation

Indices: 3167--3458 Score: 93 Period size: 18 Copynumber: 17.8 Consensus size: 17 3157 TGGGGATGAA 3167 CATGGGCATGAACCCAGG 1 CATGGGCATGAA-CCAGG * * 3185 CATGGGGATGAA-CA-A 1 CATGGGCATGAACCAGG 3200 CATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * * * 3218 CATGGGGATG-AGCA-A 1 CATGGGCATGAACCAGG * * 3233 TATGGGCATAAATCCAGG 1 CATGGGCATGAA-CCAGG * * * 3251 CATGGGGATG-AGCA-A 1 CATGGGCATGAACCAGG * 3266 TATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * * * 3284 CATGGGGATG-AGCA-A 1 CATGGGCATGAACCAGG * 3299 TATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * ** * 3317 CATGGGGATG-AGTA-A 1 CATGGGCATGAACCAGG * ** 3332 TATGGGCATGAATAAAGG 1 CATGGGCATGAA-CCAGG * 3350 CATGGGGATG-A--A-- 1 CATGGGCATGAACCAGG 3362 CATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * * 3380 CATGGGGATGAA-CA-A 1 CATGGGCATGAACCAGG 3395 CATGGGCATGAATCCAGG 1 CATGGGCATGAA-CCAGG * 3413 CATGGAG-ATGAA-CA-A 1 CATGG-GCATGAACCAGG * * 3428 TATGGGCATGAGCGCAGG 1 CATGGGCATGAAC-CAGG * 3446 CATGGGGATGAAC 1 CATGGGCATGAAC 3459 ATGGGAATGG Statistics Matches: 192, Mismatches: 53, Indels: 58 0.63 0.17 0.19 Matches are distributed among these distances: 12 9 0.05 13 1 0.01 14 2 0.01 15 61 0.32 16 18 0.09 17 18 0.09 18 82 0.43 19 1 0.01 ACGTcount: A:0.32, C:0.16, G:0.35, T:0.17 Consensus pattern (17 bp): CATGGGCATGAACCAGG Found at i:3207 original size:33 final size:33 Alignment explanation

Indices: 3156--3459 Score: 433 Period size: 33 Copynumber: 9.4 Consensus size: 33 3146 GGGCATGGGA * 3156 ATGGGGATGAAC---ATGGGCATGAACCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3186 ATGGGGATGAACAACATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * * 3219 ATGGGGATGAGCAATATGGGCATAAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3252 ATGGGGATGAGCAATATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3285 ATGGGGATGAGCAATATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC ** ** 3318 ATGGGGATGAGTAATATGGGCATGAATAAAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC 3351 ATGGGGATGAAC---ATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * 3381 ATGGGGATGAACAACATGGGCATGAATCCAGGC 1 ATGGGGATGAACAATATGGGCATGAATCCAGGC * * 3414 ATGGAGATGAACAATATGGGCATG-AGCGCAGGC 1 ATGGGGATGAACAATATGGGCATGAATC-CAGGC 3447 ATGGGGATGAACA 1 ATGGGGATGAACA 3460 TGGGAATGGG Statistics Matches: 251, Mismatches: 16, Indels: 11 0.90 0.06 0.04 Matches are distributed among these distances: 30 40 0.16 32 2 0.01 33 209 0.83 ACGTcount: A:0.33, C:0.15, G:0.36, T:0.17 Consensus pattern (33 bp): ATGGGGATGAACAATATGGGCATGAATCCAGGC Done.