Tandem Repeats Finder Program written by:
Gary Benson
Program in Bioinformatics
Boston University
Version 4.09
Sequence: Scaffold2127
Parameters: 2 7 7 80 10 50 1000
Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000
Length: 22383
ACGTcount: A:0.31, C:0.21, G:0.18, T:0.30
Found at i:2966 original size:118 final size:119
Alignment explanation
Indices: 2785--3006 Score: 299
Period size: 118 Copynumber: 1.9 Consensus size: 119
2775 TCCTCGTTCA
*
2785 AATGCCTTCGGGACATAGCCCGGTTATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGGAT
1 AATGCCTTCGGGACATAGCCCGGATATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGGAT
* *
2850 TTAGTAAC-TCGCACAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCAC
66 ATAGTAACTTAGCACAAA-GCCTTCGGGACTTAACCCGGATTTAGTAACTCGCAC
* * **
2904 AATGCCTTCGGG-CTTAGCCCGGA-ATTAGTATCTCGCACAAATGCCTTC-GGATCTTAGTCCGG
1 AATGCCTTCGGGACATAGCCCGGATA-TAGTAACTCGCACAAATGCCTTCGGGA-CTTAACCCGG
* * *
2966 ATATGGTCACTTAGCACAAAGCCTTCGGGACTTAGCCCGGA
64 ATATAGTAACTTAGCACAAAGCCTTCGGGACTTAACCCGGA
3007 CATCATTCAA
Statistics
Matches: 90, Mismatches: 10, Indels: 7
0.84 0.09 0.07
Matches are distributed among these distances:
117 4 0.04
118 66 0.73
119 20 0.22
ACGTcount: A:0.25, C:0.27, G:0.23, T:0.25
Consensus pattern (119 bp):
AATGCCTTCGGGACATAGCCCGGATATAGTAACTCGCACAAATGCCTTCGGGACTTAACCCGGAT
ATAGTAACTTAGCACAAAGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCAC
Found at i:3006 original size:40 final size:40
Alignment explanation
Indices: 2783--3006 Score: 287
Period size: 40 Copynumber: 5.7 Consensus size: 40
2773 GCTCCTCGTT
* *
2783 CAAATGCCTTCGGGACATAGCCCGGTTATAGTAACTCGCA
1 CAAATGCCTTCGGGACTTAGCCCGGATATAGTAACTCGCA
* *
2823 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA
1 CAAATGCCTTCGGGACTTAGCCCGGATATAGTAACTCGCA
* *
2863 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA
1 CAAATGCCTTCGGGACTTAGCCCGGATATAGTAACTCGCA
*
2903 C-AATGCCTTCGGG-CTTAGCCCGGA-ATTAGTATCTCGCA
1 CAAATGCCTTCGGGACTTAGCCCGGATA-TAGTAACTCGCA
* * * *
2941 CAAATGCCTTC-GGATCTTAGTCCGGATATGGTCACTTAGCA
1 CAAATGCCTTCGGGA-CTTAGCCCGGATATAGTAAC-TCGCA
2982 CAAA-GCCTTCGGGACTTAGCCCGGA
1 CAAATGCCTTCGGGACTTAGCCCGGA
3007 CATCATTCAA
Statistics
Matches: 164, Mismatches: 13, Indels: 14
0.86 0.07 0.07
Matches are distributed among these distances:
38 24 0.15
39 21 0.13
40 107 0.65
41 12 0.07
ACGTcount: A:0.25, C:0.28, G:0.22, T:0.25
Consensus pattern (40 bp):
CAAATGCCTTCGGGACTTAGCCCGGATATAGTAACTCGCA
Found at i:6532 original size:19 final size:19
Alignment explanation
Indices: 6504--6542 Score: 51
Period size: 19 Copynumber: 2.1 Consensus size: 19
6494 TGTTGTTGTC
*
6504 GCTGTCGCACACCATCGTT
1 GCTGTCGCACACAATCGTT
* *
6523 GCTGTTGCACGCAATCGTT
1 GCTGTCGCACACAATCGTT
6542 G
1 G
6543 TCGCCACACC
Statistics
Matches: 17, Mismatches: 3, Indels: 0
0.85 0.15 0.00
Matches are distributed among these distances:
19 17 1.00
ACGTcount: A:0.15, C:0.31, G:0.26, T:0.28
Consensus pattern (19 bp):
GCTGTCGCACACAATCGTT
Found at i:12576 original size:39 final size:40
Alignment explanation
Indices: 12531--12698 Score: 106
Period size: 40 Copynumber: 4.2 Consensus size: 40
12521 CGGGGTTTAG
* * *
12531 CCGGATATAACCACTCGCA-CAAGGCCTTCGGGTCTTAAC
1 CCGGATATAACCACTAGCATAAAGGCCTTCGGGACTTAAC
*** * *
12570 CCGGATATGGTCACTAGCATAAATGCCTTCGGGACTTAGC
1 CCGGATATAACCACTAGCATAAAGGCCTTCGGGACTTAAC
** * * * **
12610 CCGGATATAGTCGCTAGCACAAATGCCTTC-GGATCTTAGT
1 CCGGATATAACCACTAGCATAAAGGCCTTCGGGA-CTTAAC
* ** * * * *
12650 CCGGATGTAGTCGCTTAGCACAAAAGCCTTCGGGACTTAGC
1 CCGGATATAACCAC-TAGCATAAAGGCCTTCGGGACTTAAC
12691 CCGGATAT
1 CCGGATAT
12699 CATTCGAGTA
Statistics
Matches: 109, Mismatches: 16, Indels: 6
0.83 0.12 0.05
Matches are distributed among these distances:
39 18 0.17
40 61 0.56
41 27 0.25
42 3 0.03
ACGTcount: A:0.25, C:0.27, G:0.24, T:0.24
Consensus pattern (40 bp):
CCGGATATAACCACTAGCATAAAGGCCTTCGGGACTTAAC
Found at i:12681 original size:41 final size:40
Alignment explanation
Indices: 12554--12698 Score: 193
Period size: 40 Copynumber: 3.6 Consensus size: 40
12544 CTCGCACAAG
* * * * *
12554 GCCTTCGGGTCTTAACCCGGATATGGTCACTAGCATAAAT
1 GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
12594 GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
1 GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
* * *
12634 GCCTTC-GGATCTTAGTCCGGATGTAGTCGCTTAGCACAAAA
1 GCCTTCGGGA-CTTAGCCCGGATATAGTCGC-TAGCACAAAT
12675 GCCTTCGGGACTTAGCCCGGATAT
1 GCCTTCGGGACTTAGCCCGGATAT
12699 CATTCGAGTA
Statistics
Matches: 92, Mismatches: 10, Indels: 5
0.86 0.09 0.05
Matches are distributed among these distances:
39 3 0.03
40 59 0.64
41 27 0.29
42 3 0.03
ACGTcount: A:0.23, C:0.26, G:0.25, T:0.26
Consensus pattern (40 bp):
GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
Found at i:20592 original size:39 final size:40
Alignment explanation
Indices: 20547--20714 Score: 106
Period size: 40 Copynumber: 4.2 Consensus size: 40
20537 CGGGGTTTAG
* * *
20547 CCGGATATAACCACTCGCA-CAAGGCCTTCGGGTCTTAAC
1 CCGGATATAACCACTAGCATAAAGGCCTTCGGGACTTAAC
*** * *
20586 CCGGATATGGTCACTAGCATAAATGCCTTCGGGACTTAGC
1 CCGGATATAACCACTAGCATAAAGGCCTTCGGGACTTAAC
** * * * **
20626 CCGGATATAGTCGCTAGCACAAATGCCTTC-GGATCTTAGT
1 CCGGATATAACCACTAGCATAAAGGCCTTCGGGA-CTTAAC
* ** * * * *
20666 CCGGATGTAGTCGCTTAGCACAAAAGCCTTCGGGACTTAGC
1 CCGGATATAACCAC-TAGCATAAAGGCCTTCGGGACTTAAC
20707 CCGGATAT
1 CCGGATAT
20715 CATTCGAGTA
Statistics
Matches: 109, Mismatches: 16, Indels: 6
0.83 0.12 0.05
Matches are distributed among these distances:
39 18 0.17
40 61 0.56
41 27 0.25
42 3 0.03
ACGTcount: A:0.25, C:0.27, G:0.24, T:0.24
Consensus pattern (40 bp):
CCGGATATAACCACTAGCATAAAGGCCTTCGGGACTTAAC
Found at i:20697 original size:41 final size:40
Alignment explanation
Indices: 20570--20714 Score: 193
Period size: 40 Copynumber: 3.6 Consensus size: 40
20560 CTCGCACAAG
* * * * *
20570 GCCTTCGGGTCTTAACCCGGATATGGTCACTAGCATAAAT
1 GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
20610 GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
1 GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
* * *
20650 GCCTTC-GGATCTTAGTCCGGATGTAGTCGCTTAGCACAAAA
1 GCCTTCGGGA-CTTAGCCCGGATATAGTCGC-TAGCACAAAT
20691 GCCTTCGGGACTTAGCCCGGATAT
1 GCCTTCGGGACTTAGCCCGGATAT
20715 CATTCGAGTA
Statistics
Matches: 92, Mismatches: 10, Indels: 5
0.86 0.09 0.05
Matches are distributed among these distances:
39 3 0.03
40 59 0.64
41 27 0.29
42 3 0.03
ACGTcount: A:0.23, C:0.26, G:0.25, T:0.26
Consensus pattern (40 bp):
GCCTTCGGGACTTAGCCCGGATATAGTCGCTAGCACAAAT
Done.