Tandem Repeats Finder Program written by:
Gary Benson
Program in Bioinformatics
Boston University
Version 4.09
Sequence: Ga10g01422.F2
Parameters: 2 7 7 80 10 50 1000
Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000
Length: 3969
ACGTcount: A:0.28, C:0.24, G:0.23, T:0.26
Found at i:59 original size:2 final size:2
Alignment explanation
Indices: 52--95 Score: 70
Period size: 2 Copynumber: 22.0 Consensus size: 2
42 CCCACTCCTC
* *
52 CT CT CT CT AT CT CT CT GT CT CT CT CT CT CT CT CT CT CT CT CT
1 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT
94 CT
1 CT
96 TTCATTTTTC
Statistics
Matches: 38, Mismatches: 4, Indels: 0
0.90 0.10 0.00
Matches are distributed among these distances:
2 38 1.00
ACGTcount: A:0.02, C:0.45, G:0.02, T:0.50
Consensus pattern (2 bp):
CT
Found at i:1864 original size:33 final size:33
Alignment explanation
Indices: 1826--2150 Score: 217
Period size: 33 Copynumber: 9.8 Consensus size: 33
1816 ACCAACCCGC
* * *
1826 GCAGCCAAGTGCGAATCCATATGGCCAGCTTGC
1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA
* * ** *
1859 TCAGCCCAGTGCTAATCCATATGGCCAAAATGG
1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA
* * *
1892 GCAACCAAGTGCTAATCCCTATGGCCAGCCTGA
1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA
** *
1925 GCAGCCCAA-TGCTAATCCATATGGCCAAAATGG
1 GCAG-CCAAGTGCTAATCCATATGGCCAGCATGA
* * * *
1958 GCAACCAAGTGCTAATCCCTATGGCCAGCCTGT
1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA
* * * ** *
1991 GCAGCCCAGTACTAACCCATATGGCCAAAATGG
1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA
* * *
2024 GCA-CTCAAGTGCTAATCCCTATGGCCAGCCTGT
1 GCAGC-CAAGTGCTAATCCATATGGCCAGCATGA
* * ** ** * *
2057 GCAGCCCAGTGCTAACCCATACAGCCAAAACGG
1 GCAGCCAAGTGCTAATCCATATGGCCAGCATGA
* * *
2090 GCAGCCAAGTGCTAATCTC-TATGGCCTGCCTGC
1 GCAGCCAAGTGCTAATC-CATATGGCCAGCATGA
* * *
2123 GCCA-CCCAGTGCAAACCCATATGGCCAG
1 G-CAGCCAAGTGCTAATCCATATGGCCAG
2151 TCAAGTGCTA
Statistics
Matches: 217, Mismatches: 68, Indels: 14
0.73 0.23 0.05
Matches are distributed among these distances:
32 6 0.03
33 203 0.94
34 8 0.04
ACGTcount: A:0.27, C:0.32, G:0.22, T:0.18
Consensus pattern (33 bp):
GCAGCCAAGTGCTAATCCATATGGCCAGCATGA
Found at i:1915 original size:66 final size:65
Alignment explanation
Indices: 1830--2149 Score: 419
Period size: 66 Copynumber: 4.8 Consensus size: 65
1820 ACCCGCGCAG
* * * *
1830 CCAAGTGCGAATCCATATGGCCAGCTTGCTCAGCCCAGTGCTAATCCATATGGCCAAAATGGGCA
1 CCAAGTGCTAATCCCTATGGCCAGCCTG-TCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA
1895 A
65 A
* * *
1896 CCAAGTGCTAATCCCTATGGCCAGCCTGAGCAGCCCAATGCTAATCCATATGGCCAAAATGGGCA
1 CCAAGTGCTAATCCCTATGGCCAGCCTG-TCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA
1961 A
65 A
*
1962 CCAAGTGCTAATCCCTATGGCCAGCCTGTGCAGCCCAGTACTAACCCATATGGCCAAAATGGGC-
1 CCAAGTGCTAATCCCTATGGCCAGCCTGT-CAGCCCAGTGCTAACCCATATGGCCAAAATGGGCA
2026 A
65 A
** *
2027 CTCAAGTGCTAATCCCTATGGCCAGCCTGTGCAGCCCAGTGCTAACCCATACAGCCAAAACGGGC
1 C-CAAGTGCTAATCCCTATGGCCAGCCTGT-CAGCCCAGTGCTAACCCATATGGCCAAAATGGGC
*
2092 AG
64 AA
* * * *
2094 CCAAGTGCTAATCTCTATGGCCTGCCTGCGCCA-CCCAGTGCAAACCCATATGGCCA
1 CCAAGTGCTAATCCCTATGGCCAGCCT--GTCAGCCCAGTGCTAACCCATATGGCCA
2150 GTCAAGTGCT
Statistics
Matches: 228, Mismatches: 21, Indels: 10
0.88 0.08 0.04
Matches are distributed among these distances:
65 2 0.01
66 222 0.97
67 3 0.01
68 1 0.00
ACGTcount: A:0.28, C:0.33, G:0.22, T:0.18
Consensus pattern (65 bp):
CCAAGTGCTAATCCCTATGGCCAGCCTGTCAGCCCAGTGCTAACCCATATGGCCAAAATGGGCAA
Found at i:2216 original size:33 final size:33
Alignment explanation
Indices: 2156--2263 Score: 117
Period size: 33 Copynumber: 3.3 Consensus size: 33
2146 GCCAGTCAAG
* * * * *
2156 TGCTAATCCCTATGGCCAGCCAACACAACCAAA
1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA
* * * * *
2189 TGCTGATCCTTATGGTCAATCTGCACAACCAAA
1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA
*
2222 TGCTAATCCGTATAGTCAACCAGCACAAGCAAA
1 TGCTAATCCGTATAGTCAACCAGCACAACCAAA
2255 TGCTAATCC
1 TGCTAATCC
2264 ATACAGCCAA
Statistics
Matches: 62, Mismatches: 13, Indels: 0
0.83 0.17 0.00
Matches are distributed among these distances:
33 62 1.00
ACGTcount: A:0.34, C:0.31, G:0.14, T:0.21
Consensus pattern (33 bp):
TGCTAATCCGTATAGTCAACCAGCACAACCAAA
Found at i:2275 original size:33 final size:33
Alignment explanation
Indices: 2209--2431 Score: 207
Period size: 33 Copynumber: 6.8 Consensus size: 33
2199 TATGGTCAAT
* * * *
2209 CTGCACAACCAAATGCTAATCCGTATAGTCAAC
1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC
*
2242 CAGCACAAG-CAAATGCTAATCCATACAGCCAAC
1 CTGCAC-AGCCAAATGCTAATCCATACAGCCAAC
* ** * *
2275 CTGTACAGGTACATGCTAATCCATA-TGACCAAC
1 CTGCACAGCCAAATGCTAATCCATACAG-CCAAC
* * *
2308 CTGCTCAGCCAAGTGCTAATCCATACAACCAAC
1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC
* * *
2341 CTGCTCAGCCGAATGCTAATCCATACAGCCAGC
1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC
* * **
2374 CTACACAGCCTAATGCTAATCCATATGGCCAAC
1 CTGCACAGCCAAATGCTAATCCATACAGCCAAC
** *
2407 CTGTTCAGCCAAGTGCTAATCCATA
1 CTGCACAGCCAAATGCTAATCCATA
2432 TGCCCAACCC
Statistics
Matches: 153, Mismatches: 33, Indels: 8
0.79 0.17 0.04
Matches are distributed among these distances:
32 3 0.02
33 149 0.97
34 1 0.01
ACGTcount: A:0.34, C:0.32, G:0.14, T:0.20
Consensus pattern (33 bp):
CTGCACAGCCAAATGCTAATCCATACAGCCAAC
Found at i:2440 original size:99 final size:99
Alignment explanation
Indices: 2253--2431 Score: 288
Period size: 99 Copynumber: 1.8 Consensus size: 99
2243 AGCACAAGCA
** *
2253 AATGCTAATCCATACAGCCAACCTGTACAGGTACATGCTAATCCATATGACCAACCTGCTCAGCC
1 AATGCTAATCCATACAGCCAACCTACACAGCTACATGCTAATCCATATGACCAACCTGCTCAGCC
2318 AAGTGCTAATCCATACAACCAACCTGCTCAGCCG
66 AAGTGCTAATCCATACAACCAACCTGCTCAGCCG
* * *
2352 AATGCTAATCCATACAGCCAGCCTACACAGCCTA-ATGCTAATCCATATGGCCAACCTGTTCAGC
1 AATGCTAATCCATACAGCCAACCTACACAG-CTACATGCTAATCCATATGACCAACCTGCTCAGC
2416 CAAGTGCTAATCCATA
65 CAAGTGCTAATCCATA
2432 TGCCCAACCC
Statistics
Matches: 73, Mismatches: 6, Indels: 2
0.90 0.07 0.02
Matches are distributed among these distances:
99 71 0.97
100 2 0.03
ACGTcount: A:0.32, C:0.32, G:0.14, T:0.21
Consensus pattern (99 bp):
AATGCTAATCCATACAGCCAACCTACACAGCTACATGCTAATCCATATGACCAACCTGCTCAGCC
AAGTGCTAATCCATACAACCAACCTGCTCAGCCG
Found at i:3188 original size:18 final size:17
Alignment explanation
Indices: 3167--3458 Score: 93
Period size: 18 Copynumber: 17.8 Consensus size: 17
3157 TGGGGATGAA
3167 CATGGGCATGAACCCAGG
1 CATGGGCATGAA-CCAGG
* *
3185 CATGGGGATGAA-CA-A
1 CATGGGCATGAACCAGG
3200 CATGGGCATGAATCCAGG
1 CATGGGCATGAA-CCAGG
* * *
3218 CATGGGGATG-AGCA-A
1 CATGGGCATGAACCAGG
* *
3233 TATGGGCATAAATCCAGG
1 CATGGGCATGAA-CCAGG
* * *
3251 CATGGGGATG-AGCA-A
1 CATGGGCATGAACCAGG
*
3266 TATGGGCATGAATCCAGG
1 CATGGGCATGAA-CCAGG
* * *
3284 CATGGGGATG-AGCA-A
1 CATGGGCATGAACCAGG
*
3299 TATGGGCATGAATCCAGG
1 CATGGGCATGAA-CCAGG
* ** *
3317 CATGGGGATG-AGTA-A
1 CATGGGCATGAACCAGG
* **
3332 TATGGGCATGAATAAAGG
1 CATGGGCATGAA-CCAGG
*
3350 CATGGGGATG-A--A--
1 CATGGGCATGAACCAGG
3362 CATGGGCATGAATCCAGG
1 CATGGGCATGAA-CCAGG
* *
3380 CATGGGGATGAA-CA-A
1 CATGGGCATGAACCAGG
3395 CATGGGCATGAATCCAGG
1 CATGGGCATGAA-CCAGG
*
3413 CATGGAG-ATGAA-CA-A
1 CATGG-GCATGAACCAGG
* *
3428 TATGGGCATGAGCGCAGG
1 CATGGGCATGAAC-CAGG
*
3446 CATGGGGATGAAC
1 CATGGGCATGAAC
3459 ATGGGAATGG
Statistics
Matches: 192, Mismatches: 53, Indels: 58
0.63 0.17 0.19
Matches are distributed among these distances:
12 9 0.05
13 1 0.01
14 2 0.01
15 61 0.32
16 18 0.09
17 18 0.09
18 82 0.43
19 1 0.01
ACGTcount: A:0.32, C:0.16, G:0.35, T:0.17
Consensus pattern (17 bp):
CATGGGCATGAACCAGG
Found at i:3207 original size:33 final size:33
Alignment explanation
Indices: 3156--3459 Score: 433
Period size: 33 Copynumber: 9.4 Consensus size: 33
3146 GGGCATGGGA
*
3156 ATGGGGATGAAC---ATGGGCATGAACCCAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
*
3186 ATGGGGATGAACAACATGGGCATGAATCCAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
* *
3219 ATGGGGATGAGCAATATGGGCATAAATCCAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
*
3252 ATGGGGATGAGCAATATGGGCATGAATCCAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
*
3285 ATGGGGATGAGCAATATGGGCATGAATCCAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
** **
3318 ATGGGGATGAGTAATATGGGCATGAATAAAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
3351 ATGGGGATGAAC---ATGGGCATGAATCCAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
*
3381 ATGGGGATGAACAACATGGGCATGAATCCAGGC
1 ATGGGGATGAACAATATGGGCATGAATCCAGGC
* *
3414 ATGGAGATGAACAATATGGGCATG-AGCGCAGGC
1 ATGGGGATGAACAATATGGGCATGAATC-CAGGC
3447 ATGGGGATGAACA
1 ATGGGGATGAACA
3460 TGGGAATGGG
Statistics
Matches: 251, Mismatches: 16, Indels: 11
0.90 0.06 0.04
Matches are distributed among these distances:
30 40 0.16
32 2 0.01
33 209 0.83
ACGTcount: A:0.33, C:0.15, G:0.36, T:0.17
Consensus pattern (33 bp):
ATGGGGATGAACAATATGGGCATGAATCCAGGC
Done.