Tandem Repeats Finder Program written by:
Gary Benson
Program in Bioinformatics
Boston University
Version 4.09
Sequence: NTFQ01013817.1 Kokia drynarioides strain JFW-HI SEQ_128845, whole genome shotgun sequence
Parameters: 2 7 7 80 10 50 1000
Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000
Length: 333904
ACGTcount: A:0.34, C:0.17, G:0.17, T:0.33
Warning! 141 characters in sequence are not A, C, G, or T
File 2 of 2
Found at i:303506 original size:26 final size:26
Alignment explanation
Indices: 303403--303512 Score: 75
Period size: 26 Copynumber: 4.2 Consensus size: 26
303393 TTATAGTAAA
*
303403 AAAATATAATTTAATTATTT-T-AA-T
1 AAAATATAATTTTATT-TTTATAAATT
* * *
303427 AAATTATATTTTTATAATTTTAAAAATT
1 AAAATATAATTTTAT--TTTTATAAATT
* * *
303455 AAATTA-AATTTTTATATTTAGAAATT
1 AAAATATAA-TTTTATTTTTATAAATT
*
303481 AAAATATAATTTTATTTTTATTAATTT
1 AAAATATAATTTTATTTTTA-TAAATT
303508 AAAAT
1 AAAAT
303513 TTTAAAAATT
Statistics
Matches: 67, Mismatches: 11, Indels: 13
0.74 0.12 0.14
Matches are distributed among these distances:
24 12 0.18
25 3 0.04
26 25 0.37
27 14 0.21
28 13 0.19
ACGTcount: A:0.47, C:0.00, G:0.01, T:0.52
Consensus pattern (26 bp):
AAAATATAATTTTATTTTTATAAATT
Found at i:309385 original size:143 final size:143
Alignment explanation
Indices: 309127--309412 Score: 554
Period size: 143 Copynumber: 2.0 Consensus size: 143
309117 TAACAGGATT
309127 ATCCTACAAACACAACTAATATTAATACTATCCTAGCCACCACTTAAGCACAATACTTAACTAGA
1 ATCCTACAAACACAACTAATATTAATACTATCCTAGCCACCACTTAAGCACAATACTTAACTAGA
*
309192 ATAACTTATGTGTAGCTTTCGGAACCAGTGCAGGAGTAGGTTGCATGGGGAAGGTGGAGGTAAAT
66 ATAACTTATGTGTAGCTTTCGGAACCAGTGCAGGAGTAGGTTGCACGGGGAAGGTGGAGGTAAAT
309257 CGTCAGCCTCGAA
131 CGTCAGCCTCGAA
*
309270 ATCCTACAAACACAATTAATATTAATACTATCCTAGCCACCACTTAAGCACAATACTTAACTAGA
1 ATCCTACAAACACAACTAATATTAATACTATCCTAGCCACCACTTAAGCACAATACTTAACTAGA
309335 ATAACTTATGTGTAGCTTTCGGAACCAGTGCAGGAGTAGGTTGCACGGGGAAGGTGGAGGTAAAT
66 ATAACTTATGTGTAGCTTTCGGAACCAGTGCAGGAGTAGGTTGCACGGGGAAGGTGGAGGTAAAT
309400 CGTCAGCCTCGAA
131 CGTCAGCCTCGAA
309413 GTCGTGATCA
Statistics
Matches: 141, Mismatches: 2, Indels: 0
0.99 0.01 0.00
Matches are distributed among these distances:
143 141 1.00
ACGTcount: A:0.34, C:0.21, G:0.20, T:0.24
Consensus pattern (143 bp):
ATCCTACAAACACAACTAATATTAATACTATCCTAGCCACCACTTAAGCACAATACTTAACTAGA
ATAACTTATGTGTAGCTTTCGGAACCAGTGCAGGAGTAGGTTGCACGGGGAAGGTGGAGGTAAAT
CGTCAGCCTCGAA
Found at i:310279 original size:2 final size:2
Alignment explanation
Indices: 310266--310297 Score: 55
Period size: 2 Copynumber: 16.0 Consensus size: 2
310256 CAACACCTTT
*
310266 AC AC AC AT AC AC AC AC AC AC AC AC AC AC AC AC
1 AC AC AC AC AC AC AC AC AC AC AC AC AC AC AC AC
310298 TATAATTTAA
Statistics
Matches: 28, Mismatches: 2, Indels: 0
0.93 0.07 0.00
Matches are distributed among these distances:
2 28 1.00
ACGTcount: A:0.50, C:0.47, G:0.00, T:0.03
Consensus pattern (2 bp):
AC
Found at i:315239 original size:2 final size:2
Alignment explanation
Indices: 315232--315262 Score: 53
Period size: 2 Copynumber: 15.5 Consensus size: 2
315222 ACATACATTC
*
315232 AT AT AT AT AT AT AT AT AT AT AT AT AC AT AT A
1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A
315263 AAAATAGTCT
Statistics
Matches: 27, Mismatches: 2, Indels: 0
0.93 0.07 0.00
Matches are distributed among these distances:
2 27 1.00
ACGTcount: A:0.52, C:0.03, G:0.00, T:0.45
Consensus pattern (2 bp):
AT
Found at i:316583 original size:21 final size:21
Alignment explanation
Indices: 316537--316583 Score: 51
Period size: 20 Copynumber: 2.3 Consensus size: 21
316527 GGGTTATTTG
* * *
316537 GGTTAAAAGGTTTGGGTTTAA
1 GGTTAAAAGGGTTGGGGTAAA
*
316558 -TTTAAAAGGGTTGGGGTAAA
1 GGTTAAAAGGGTTGGGGTAAA
316578 GGTTAA
1 GGTTAA
316584 TAAAGGTTTC
Statistics
Matches: 20, Mismatches: 5, Indels: 2
0.74 0.19 0.07
Matches are distributed among these distances:
20 16 0.80
21 4 0.20
ACGTcount: A:0.32, C:0.00, G:0.34, T:0.34
Consensus pattern (21 bp):
GGTTAAAAGGGTTGGGGTAAA
Found at i:331380 original size:23 final size:24
Alignment explanation
Indices: 331320--331386 Score: 75
Period size: 23 Copynumber: 2.8 Consensus size: 24
331310 AAAAAATAAA
*
331320 CGGTCAATAGTCAACGGGTC-AGGT
1 CGGTCAA-AGTCAATGGGTCGAGGT
*
331344 CGATCAAAGTCAATGGGTCGA-GT
1 CGGTCAAAGTCAATGGGTCGAGGT
* *
331367 TGGTCAAAGTCAATAGGTCG
1 CGGTCAAAGTCAATGGGTCG
331387 TGTTCGATTT
Statistics
Matches: 37, Mismatches: 5, Indels: 3
0.82 0.11 0.07
Matches are distributed among these distances:
23 30 0.81
24 7 0.19
ACGTcount: A:0.28, C:0.18, G:0.31, T:0.22
Consensus pattern (24 bp):
CGGTCAAAGTCAATGGGTCGAGGT
Done.