Tandem Repeats Finder Program written by:
Gary Benson
Program in Bioinformatics
Boston University
Version 4.09
Sequence: NTFQ01012634.1 Kokia drynarioides strain JFW-HI SEQ_127643, whole genome shotgun sequence
Parameters: 2 7 7 80 10 50 1000
Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000
Length: 29803
ACGTcount: A:0.34, C:0.16, G:0.16, T:0.34
Warning! 14 characters in sequence are not A, C, G, or T
Found at i:479 original size:59 final size:59
Alignment explanation
Indices: 385--672 Score: 348
Period size: 59 Copynumber: 4.9 Consensus size: 59
375 AAGGGTCCCG
* * * *
385 AAACTTTCAAAAATCCTATTTTTTACCCCCAAACTTCTAGAAATCCCATTTATT-ACCCCA
1 AAACTTCCAAAAATCCCA-TTTTTACCCCCAAACTTCTAAAAATCCCATTT-TTGACCTCA
* * *
445 AAACTTCCAAAAATCCCAATTTTACCCCTAAACTT-TCAAAAATCCCATTTTTGACCTTA
1 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCT-AAAAATCCCATTTTTGACCTCA
* * * * *
504 AAACCTCCAAAAATTCCATTTTTACCCCCGAACTTCTAAAAATCCCATTTTTGATCTCG
1 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTCA
**
563 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTTG
1 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTCA
* * * * *
622 GAACTTCC-AAAATTCCATTTTTAACCTCGAAACTTCTAAAAATTCCATTTT
1 AAACTTCCAAAAATCCCATTTTT-ACCCCCAAACTTCTAAAAATCCCATTTT
673 AGCCCCGTAC
Statistics
Matches: 199, Mismatches: 25, Indels: 9
0.85 0.11 0.04
Matches are distributed among these distances:
58 16 0.08
59 166 0.83
60 17 0.09
ACGTcount: A:0.35, C:0.29, G:0.03, T:0.33
Consensus pattern (59 bp):
AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTCA
Found at i:720 original size:59 final size:58
Alignment explanation
Indices: 404--732 Score: 272
Period size: 59 Copynumber: 5.6 Consensus size: 58
394 AAAATCCTAT
* * * * * *
404 TTTTTACCCCCAAACTTCTAGAAATCCCA-TTTATTACCCCAAAACTTCCAAAAATCCCA
1 TTTTTACCCCGAAACTTCTAAAAATCCCATTTTA-GACCCC-GAACTTCCCAAAATTCCA
* * * *** * *
463 ATTTTACCCCTAAACTT-TCAAAAATCCCATTTTTGACCTTAAAACCTCCAAAAATTCCA
1 TTTTTACCCCGAAACTTCT-AAAAATCCCATTTTAGACC-CCGAACTTCCCAAAATTCCA
* * * * *
522 TTTTTACCCCCG-AACTTCTAAAAATCCCATTTTTGATCTCGAAACTTCCAAAAATCCCA
1 TTTTTA-CCCCGAAACTTCTAAAAATCCCATTTTAGACCCCG-AACTTCCCAAAATTCCA
* * **
581 TTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTTGGAACTT-CCAAAATTCCA
1 TTTTTACCCCGAAACTTCTAAAAATCCCATTTTAGACC-CCGAACTTCCCAAAATTCCA
* * *
639 TTTTTAACCTCGAAACTTCTAAAAATTCCATTTTAG-CCCCGTACTTCCCAAAATTCCA
1 TTTTT-ACCCCGAAACTTCTAAAAATCCCATTTTAGACCCCGAACTTCCCAAAATTCCA
* * *
697 TTTTTGACTCCGAAACTTCCTAAAATTACCATTTTA
1 TTTTT-ACCCCGAAACTT-CTAAAAATCCCATTTTA
733 CCCCCGGATG
Statistics
Matches: 226, Mismatches: 33, Indels: 22
0.80 0.12 0.08
Matches are distributed among these distances:
57 5 0.02
58 48 0.21
59 163 0.72
60 10 0.04
ACGTcount: A:0.33, C:0.29, G:0.04, T:0.33
Consensus pattern (58 bp):
TTTTTACCCCGAAACTTCTAAAAATCCCATTTTAGACCCCGAACTTCCCAAAATTCCA
Found at i:722 original size:30 final size:30
Alignment explanation
Indices: 381--731 Score: 249
Period size: 30 Copynumber: 11.9 Consensus size: 30
371 CCCCAAGGGT
* * * *
381 CCCGAAACTTTCAAAAATCCTATTTTTTAC
1 CCCGAAACTTCCTAAAATCCCATTTTTGAC
*
411 CCCCAAACTT-CTAGAAATCCCATTTATT-AC
1 CCCGAAACTTCCTA-AAATCCCATTT-TTGAC
* * *
441 CCCAAAACTTCCAAAAATCCCAATTTT-AC
1 CCCGAAACTTCCTAAAATCCCATTTTTGAC
* * *
470 CCCTAAACTTTCAAAAATCCCATTTTTGAC
1 CCCGAAACTTCCTAAAATCCCATTTTTGAC
*** * * *
500 CTTAAAACCTCCAAAAATTCCATTTTT-ACC
1 CCCGAAACTTCCTAAAATCCCATTTTTGA-C
*
530 CCCG-AACTT-CTAAAAATCCCATTTTTGAT
1 CCCGAAACTTCCT-AAAATCCCATTTTTGAC
* *
559 CTCGAAACTTCCAAAAATCCCATTTTT-AC
1 CCCGAAACTTCCTAAAATCCCATTTTTGAC
*
588 CCCCAAACTT-CTAAAAATCCCATTTTTGAC
1 CCCGAAACTTCCT-AAAATCCCATTTTTGAC
** * * *
618 CTTGGAACTTCC-AAAATTCCATTTTTAAC
1 CCCGAAACTTCCTAAAATCCCATTTTTGAC
* * *
647 CTCGAAACTT-CTAAAAATTCCATTTTAG-C
1 CCCGAAACTTCCT-AAAATCCCATTTTTGAC
* * *
676 CCCG-TACTTCCCAAAATTCCATTTTTGAC
1 CCCGAAACTTCCTAAAATCCCATTTTTGAC
* *
705 TCCGAAACTTCCTAAAATTACCATTTT
1 CCCGAAACTTCCTAAAA-TCCCATTTT
732 ACCCCCGGAT
Statistics
Matches: 257, Mismatches: 46, Indels: 35
0.76 0.14 0.10
Matches are distributed among these distances:
28 21 0.08
29 106 0.41
30 116 0.45
31 14 0.05
ACGTcount: A:0.33, C:0.30, G:0.04, T:0.33
Consensus pattern (30 bp):
CCCGAAACTTCCTAAAATCCCATTTTTGAC
Found at i:756 original size:59 final size:58
Alignment explanation
Indices: 565--785 Score: 170
Period size: 59 Copynumber: 3.8 Consensus size: 58
555 TGATCTCGAA
* * * * **
565 ACTTCCAAAAATCCCATTTTT-ACCCCCAAACTT-CTAAAAATCCCATTTTTGA-CCTTGG
1 ACTTCCAAAAATTCCATTTTTGA-CCCGAAACTTCCTAAAATTACCA-TTTT-ACCCCCGG
* * *
623 AACTTCC-AAAATTCCATTTTTAACCTCGAAACTT-CTAAAAATT-CCATTTTAGCCCCGT
1 -ACTTCCAAAAATTCCATTTTTGACC-CGAAACTTCCT-AAAATTACCATTTTACCCCCGG
*
681 ACTTCCCAAAATTCCATTTTTGACTCCGAAACTTCCTAAAATTACCATTTTACCCCCGG
1 ACTTCCAAAAATTCCATTTTTGAC-CCGAAACTTCCTAAAATTACCATTTTACCCCCGG
* ** *
740 A-TGTCCAAAAAATCCA-TTTTGAACCCCGAATTTTCCCAAAATTACC
1 ACT-TCCAAAAATTCCATTTTTG-A-CCCGAAACTTCCTAAAATTACC
786 GTTTCACTCT
Statistics
Matches: 137, Mismatches: 14, Indels: 22
0.79 0.08 0.13
Matches are distributed among these distances:
57 7 0.05
58 58 0.42
59 66 0.48
60 6 0.04
ACGTcount: A:0.32, C:0.30, G:0.06, T:0.32
Consensus pattern (58 bp):
ACTTCCAAAAATTCCATTTTTGACCCGAAACTTCCTAAAATTACCATTTTACCCCCGG
Found at i:4242 original size:41 final size:39
Alignment explanation
Indices: 4197--4287 Score: 94
Period size: 39 Copynumber: 2.3 Consensus size: 39
4187 AATAGTTTTT
* *
4197 TAACGGCGTTTGGATCGA-AAACGCCGTAAAAAGTAAAGCAA
1 TAACGGCGTTT--ATC-ATAAACGCCGTAAAAAGCAAAACAA
* * *
4238 TAACGGTGTTTTTCATAAACGCCGTAAAAGGCAAAACAA
1 TAACGGCGTTTATCATAAACGCCGTAAAAAGCAAAACAA
*
4277 TAGCGGCGTTT
1 TAACGGCGTTT
4288 TCCCATAATC
Statistics
Matches: 42, Mismatches: 7, Indels: 4
0.79 0.13 0.08
Matches are distributed among these distances:
38 1 0.02
39 31 0.74
41 10 0.24
ACGTcount: A:0.37, C:0.18, G:0.23, T:0.22
Consensus pattern (39 bp):
TAACGGCGTTTATCATAAACGCCGTAAAAAGCAAAACAA
Found at i:4259 original size:39 final size:40
Alignment explanation
Indices: 4215--4327 Score: 111
Period size: 39 Copynumber: 2.9 Consensus size: 40
4205 TTTGGATCGA
* *
4215 AAACGCCGTAAAAAGTAAAGCAATAACGGTGTTTT-TCAT
1 AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTTTCCCAT
* * * *
4254 AAACGCCGTAAAAGGCAAAACAATAGCGGCGTTTTCCCAT
1 AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTTTCCCAT
* * * * **
4294 AATCGTCGCAGAAAGTAAAGCAATAGTGGCGTTT
1 AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTT
4328 ATGAGAAAAA
Statistics
Matches: 59, Mismatches: 14, Indels: 1
0.80 0.19 0.01
Matches are distributed among these distances:
39 30 0.51
40 29 0.49
ACGTcount: A:0.38, C:0.19, G:0.21, T:0.22
Consensus pattern (40 bp):
AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTTTCCCAT
Found at i:4318 original size:40 final size:38
Alignment explanation
Indices: 4215--4369 Score: 103
Period size: 40 Copynumber: 3.9 Consensus size: 38
4205 TTTGGATCGA
* * * *
4215 AAACGCCGTAAAAAGTAAAGCAATAACGGTGTTTTTCAT
1 AAACGCCG-CAAAAGTAAAGCAATAGCGGCGTTTTCCAT
* * *
4254 AAACGCCGTAAAAGGCAAAACAATAGCGGCGTTTTCCCAT
1 AAACGCCGCAAAA-GTAAAGCAATAGCGGCGTTTT-CCAT
* * * ** *
4294 AATCGTCGCAGAAAGTAAAGCAATAGTGGCGTTTATGAGAA
1 AAACGCCGCA-AAAGTAAAGCAATAGCGGCGTTT-T-CCAT
* *
4335 AAACGTCGCAAAAGTTAAGAGCATTAGCGGCGTTT
1 AAACGCCGCAAAAG-TAA-AGCAATAGCGGCGTTT
4370 ATAACAAAAT
Statistics
Matches: 91, Mismatches: 19, Indels: 9
0.76 0.16 0.08
Matches are distributed among these distances:
38 4 0.04
39 25 0.27
40 31 0.34
41 17 0.19
42 14 0.15
ACGTcount: A:0.38, C:0.17, G:0.23, T:0.22
Consensus pattern (38 bp):
AAACGCCGCAAAAGTAAAGCAATAGCGGCGTTTTCCAT
Found at i:4395 original size:41 final size:40
Alignment explanation
Indices: 4299--4489 Score: 140
Period size: 41 Copynumber: 4.7 Consensus size: 40
4289 CCCATAATCG
* * *
4299 TCGCAGAAAGTAA-AGCAATAGTGGCGTTTATGAGA-AAAACG
1 TCGCA-AAAGTAAGAGCATTAGCGGCGTTTAT-A-ACAAAACA
*
4340 TCGCAAAAGTTAAGAGCATTAGCGGCGTTTATAACAAAATA
1 TCGCAAAAG-TAAGAGCATTAGCGGCGTTTATAACAAAACA
* * *
4381 TCGCAAAATGTAAGAGCATTAGCGACG---ATGACAAAACG
1 TCGCAAAA-GTAAGAGCATTAGCGGCGTTTATAACAAAACA
* * * * *
4419 CCGCAAAAGGTAAGAGTATTAGCGGCGTTTATGAGAAAACG
1 TCGCAAAA-GTAAGAGCATTAGCGGCGTTTATAACAAAACA
* * * *
4460 CCACAAAAAATAAGAGCAATAGCGGCGTTT
1 TCGC-AAAAGTAAGAGCATTAGCGGCGTTT
4490 TCCCATAGAC
Statistics
Matches: 125, Mismatches: 17, Indels: 16
0.79 0.11 0.10
Matches are distributed among these distances:
38 31 0.25
40 5 0.04
41 68 0.54
42 21 0.17
ACGTcount: A:0.41, C:0.16, G:0.24, T:0.19
Consensus pattern (40 bp):
TCGCAAAAGTAAGAGCATTAGCGGCGTTTATAACAAAACA
Found at i:4467 original size:79 final size:79
Alignment explanation
Indices: 4334--4483 Score: 192
Period size: 79 Copynumber: 1.9 Consensus size: 79
4324 GTTTATGAGA
* * * * * **
4334 AAAACGTCGCAAAAGTTAAGAGCATTAGCGGCGTTTATAACAAAATATCGCAAAATGTAAGAGCA
1 AAAACGCCGCAAAAGGTAAGAGCATTAGCGGCGTTTATAACAAAACACCACAAAAAATAAGAGCA
*
4399 TTAGCGACGATGAC
66 ATAGCGACGATGAC
* * * *
4413 AAAACGCCGCAAAAGGTAAGAGTATTAGCGGCGTTTATGAGAAAACGCCACAAAAAATAAGAGCA
1 AAAACGCCGCAAAAGGTAAGAGCATTAGCGGCGTTTATAACAAAACACCACAAAAAATAAGAGCA
4478 ATAGCG
66 ATAGCG
4484 GCGTTTTCCC
Statistics
Matches: 59, Mismatches: 12, Indels: 0
0.83 0.17 0.00
Matches are distributed among these distances:
79 59 1.00
ACGTcount: A:0.43, C:0.17, G:0.23, T:0.17
Consensus pattern (79 bp):
AAAACGCCGCAAAAGGTAAGAGCATTAGCGGCGTTTATAACAAAACACCACAAAAAATAAGAGCA
ATAGCGACGATGAC
Found at i:5088 original size:10 final size:10
Alignment explanation
Indices: 5073--5110 Score: 51
Period size: 10 Copynumber: 3.7 Consensus size: 10
5063 GCTCCATGCT
5073 AATTTTTTTG
1 AATTTTTTTG
5083 AATTTTTTAT-
1 AATTTTTT-TG
5093 AATATTTTTTG
1 AAT-TTTTTTG
5104 AATTTTT
1 AATTTTT
5111 ATTTTTATTT
Statistics
Matches: 25, Mismatches: 0, Indels: 6
0.81 0.00 0.19
Matches are distributed among these distances:
10 16 0.64
11 9 0.36
ACGTcount: A:0.26, C:0.00, G:0.05, T:0.68
Consensus pattern (10 bp):
AATTTTTTTG
Found at i:5100 original size:20 final size:21
Alignment explanation
Indices: 5072--5142 Score: 76
Period size: 21 Copynumber: 3.4 Consensus size: 21
5062 TGCTCCATGC
5072 TAAT-TTTTTTGAATTTTTTA
1 TAATATTTTTTGAATTTTTTA
5092 TAATATTTTTTGAA-TTTTTA
1 TAATATTTTTTGAATTTTTTA
** *
5112 TTTTTATTTTTTGACTATTTTTA
1 -TAATATTTTTTGAAT-TTTTTA
5135 TAAT-TTTT
1 TAATATTTT
5143 AAATTATTTA
Statistics
Matches: 42, Mismatches: 5, Indels: 7
0.78 0.09 0.13
Matches are distributed among these distances:
20 10 0.24
21 24 0.57
22 2 0.05
23 6 0.14
ACGTcount: A:0.24, C:0.01, G:0.04, T:0.70
Consensus pattern (21 bp):
TAATATTTTTTGAATTTTTTA
Found at i:5704 original size:34 final size:31
Alignment explanation
Indices: 5643--5705 Score: 83
Period size: 34 Copynumber: 1.9 Consensus size: 31
5633 CCAATAAATA
5643 ATTTAAAAATTATAAAAAAAAATAAATCAAG
1 ATTTAAAAATTATAAAAAAAAATAAATCAAG
5674 ATTTAAAAAAGTTCATAAAAAAATAA-AAATCA
1 ATTT-AAAAA-TT-ATAAAAAAA-AATAAATCA
5706 GCATGAAATA
Statistics
Matches: 28, Mismatches: 0, Indels: 5
0.85 0.00 0.15
Matches are distributed among these distances:
31 4 0.14
32 5 0.18
33 2 0.07
34 15 0.54
35 2 0.07
ACGTcount: A:0.67, C:0.05, G:0.03, T:0.25
Consensus pattern (31 bp):
ATTTAAAAATTATAAAAAAAAATAAATCAAG
Found at i:14031 original size:79 final size:79
Alignment explanation
Indices: 13946--14102 Score: 217
Period size: 79 Copynumber: 2.0 Consensus size: 79
13936 ATTGGACAAC
** * **
13946 GGTACTGATACCGTTGATATACTGAGTCCTCCAAACAGTCCT-TCGATAGAACGACATCGATGAA
1 GGTACTGATACCGTTGATATACCAAGTCCTCCAAACAGTCCTCT-GACAGAACGACATCGACAAA
*
14010 ATAGATGCGGACAGT
65 ATAGACGCGGACAGT
* * *
14025 GGTACTGATACCGTTGATATACCAAGTCCTCCAAATAGTCCTCTGGCAGAATGACATCGACAAAA
1 GGTACTGATACCGTTGATATACCAAGTCCTCCAAACAGTCCTCTGACAGAACGACATCGACAAAA
14090 TAGACGCGGACAG
66 TAGACGCGGACAG
14103 CGAAGCCGCA
Statistics
Matches: 68, Mismatches: 9, Indels: 2
0.86 0.11 0.03
Matches are distributed among these distances:
79 67 0.99
80 1 0.01
ACGTcount: A:0.32, C:0.23, G:0.22, T:0.22
Consensus pattern (79 bp):
GGTACTGATACCGTTGATATACCAAGTCCTCCAAACAGTCCTCTGACAGAACGACATCGACAAAA
TAGACGCGGACAGT
Found at i:17608 original size:41 final size:38
Alignment explanation
Indices: 17563--17666 Score: 109
Period size: 41 Copynumber: 2.6 Consensus size: 38
17553 TAAAAAAATT
17563 AAAGGTAAAGCAATAGCGGCATTTATGAGAAAAACGTCACA
1 AAAGGT-AAGCAATAGCGGCATTTATGAGAAAAACG-C-CA
* * * *
17604 AAAGGTAAGTCAATAGTGGCGTTTATGGGAAAAACGCCT
1 AAAGGTAAG-CAATAGCGGCATTTATGAGAAAAACGCCA
* *
17643 AAAGGTCAAGCAATAACAGCATTT
1 AAAGGT-AAGCAATAGCGGCATTT
17667 TCCCATAAAC
Statistics
Matches: 53, Mismatches: 8, Indels: 6
0.79 0.12 0.09
Matches are distributed among these distances:
39 17 0.32
40 7 0.13
41 29 0.55
ACGTcount: A:0.42, C:0.14, G:0.23, T:0.20
Consensus pattern (38 bp):
AAAGGTAAGCAATAGCGGCATTTATGAGAAAAACGCCA
Found at i:20647 original size:43 final size:43
Alignment explanation
Indices: 20574--20708 Score: 175
Period size: 43 Copynumber: 3.2 Consensus size: 43
20564 TTGTTAATAT
* *
20574 TAGCGGCGTTTGTGGGGAAAA-CGCCACTAAAGATCATGTTTTA
1 TAGCGGCGTTTGT-GGGAAAAGCGCCGCTAAAGATCATGTTCTA
* *
20617 TAGCGGTGTTTGTGGGAAAAGCGCTGCTAAAGATCATGTTCTA
1 TAGCGGCGTTTGTGGGAAAAGCGCCGCTAAAGATCATGTTCTA
* * * *
20660 TAACGGCGTTTGTTGG-AAAGCGCCGCTAAAGGTTATGTTCTA
1 TAGCGGCGTTTGTGGGAAAAGCGCCGCTAAAGATCATGTTCTA
20702 TAGCGGC
1 TAGCGGC
20709 ATTTTTTCGT
Statistics
Matches: 80, Mismatches: 11, Indels: 3
0.85 0.12 0.03
Matches are distributed among these distances:
42 36 0.45
43 44 0.55
ACGTcount: A:0.25, C:0.16, G:0.30, T:0.29
Consensus pattern (43 bp):
TAGCGGCGTTTGTGGGAAAAGCGCCGCTAAAGATCATGTTCTA
Found at i:22780 original size:6 final size:6
Alignment explanation
Indices: 22769--22796 Score: 56
Period size: 6 Copynumber: 4.7 Consensus size: 6
22759 TAAACTCGAA
22769 TTTTAT TTTTAT TTTTAT TTTTAT TTTT
1 TTTTAT TTTTAT TTTTAT TTTTAT TTTT
22797 CCACTCTCGC
Statistics
Matches: 22, Mismatches: 0, Indels: 0
1.00 0.00 0.00
Matches are distributed among these distances:
6 22 1.00
ACGTcount: A:0.14, C:0.00, G:0.00, T:0.86
Consensus pattern (6 bp):
TTTTAT
Found at i:23613 original size:20 final size:19
Alignment explanation
Indices: 23556--23619 Score: 55
Period size: 20 Copynumber: 3.5 Consensus size: 19
23546 AGAATATAAT
*
23556 AGTACAAAAATATATTAAAG
1 AGTATAAAAATATA-TAAAG
*
23576 AGTGT-AAAAT-T-T-AAG
1 AGTATAAAAATATATAAAG
23591 AGTATAAAAATATATGAAAG
1 AGTATAAAAATATAT-AAAG
*
23611 AGTGTAAAA
1 AGTATAAAA
23620 CTACTTAAGT
Statistics
Matches: 35, Mismatches: 4, Indels: 10
0.71 0.08 0.20
Matches are distributed among these distances:
15 7 0.20
16 6 0.17
17 1 0.03
18 2 0.06
19 5 0.14
20 14 0.40
ACGTcount: A:0.56, C:0.02, G:0.16, T:0.27
Consensus pattern (19 bp):
AGTATAAAAATATATAAAG
Found at i:24952 original size:21 final size:21
Alignment explanation
Indices: 24920--24968 Score: 91
Period size: 21 Copynumber: 2.4 Consensus size: 21
24910 AGATATCAAG
24920 TAGGTA-TAAATTATAAAATT
1 TAGGTACTAAATTATAAAATT
24940 TAGGTACTAAATTATAAAATT
1 TAGGTACTAAATTATAAAATT
24961 TAGGTACT
1 TAGGTACT
24969 TAGTACATAT
Statistics
Matches: 28, Mismatches: 0, Indels: 1
0.97 0.00 0.03
Matches are distributed among these distances:
20 6 0.21
21 22 0.79
ACGTcount: A:0.45, C:0.04, G:0.12, T:0.39
Consensus pattern (21 bp):
TAGGTACTAAATTATAAAATT
Done.