Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: NTFQ01012634.1 Kokia drynarioides strain JFW-HI SEQ_127643, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 29803
ACGTcount: A:0.34, C:0.16, G:0.16, T:0.34

Warning! 14 characters in sequence are not A, C, G, or T


Found at i:479 original size:59 final size:59

Alignment explanation

Indices: 385--672 Score: 348 Period size: 59 Copynumber: 4.9 Consensus size: 59 375 AAGGGTCCCG * * * * 385 AAACTTTCAAAAATCCTATTTTTTACCCCCAAACTTCTAGAAATCCCATTTATT-ACCCCA 1 AAACTTCCAAAAATCCCA-TTTTTACCCCCAAACTTCTAAAAATCCCATTT-TTGACCTCA * * * 445 AAACTTCCAAAAATCCCAATTTTACCCCTAAACTT-TCAAAAATCCCATTTTTGACCTTA 1 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCT-AAAAATCCCATTTTTGACCTCA * * * * * 504 AAACCTCCAAAAATTCCATTTTTACCCCCGAACTTCTAAAAATCCCATTTTTGATCTCG 1 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTCA ** 563 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTTG 1 AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTCA * * * * * 622 GAACTTCC-AAAATTCCATTTTTAACCTCGAAACTTCTAAAAATTCCATTTT 1 AAACTTCCAAAAATCCCATTTTT-ACCCCCAAACTTCTAAAAATCCCATTTT 673 AGCCCCGTAC Statistics Matches: 199, Mismatches: 25, Indels: 9 0.85 0.11 0.04 Matches are distributed among these distances: 58 16 0.08 59 166 0.83 60 17 0.09 ACGTcount: A:0.35, C:0.29, G:0.03, T:0.33 Consensus pattern (59 bp): AAACTTCCAAAAATCCCATTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTCA Found at i:720 original size:59 final size:58 Alignment explanation

Indices: 404--732 Score: 272 Period size: 59 Copynumber: 5.6 Consensus size: 58 394 AAAATCCTAT * * * * * * 404 TTTTTACCCCCAAACTTCTAGAAATCCCA-TTTATTACCCCAAAACTTCCAAAAATCCCA 1 TTTTTACCCCGAAACTTCTAAAAATCCCATTTTA-GACCCC-GAACTTCCCAAAATTCCA * * * *** * * 463 ATTTTACCCCTAAACTT-TCAAAAATCCCATTTTTGACCTTAAAACCTCCAAAAATTCCA 1 TTTTTACCCCGAAACTTCT-AAAAATCCCATTTTAGACC-CCGAACTTCCCAAAATTCCA * * * * * 522 TTTTTACCCCCG-AACTTCTAAAAATCCCATTTTTGATCTCGAAACTTCCAAAAATCCCA 1 TTTTTA-CCCCGAAACTTCTAAAAATCCCATTTTAGACCCCG-AACTTCCCAAAATTCCA * * ** 581 TTTTTACCCCCAAACTTCTAAAAATCCCATTTTTGACCTTGGAACTT-CCAAAATTCCA 1 TTTTTACCCCGAAACTTCTAAAAATCCCATTTTAGACC-CCGAACTTCCCAAAATTCCA * * * 639 TTTTTAACCTCGAAACTTCTAAAAATTCCATTTTAG-CCCCGTACTTCCCAAAATTCCA 1 TTTTT-ACCCCGAAACTTCTAAAAATCCCATTTTAGACCCCGAACTTCCCAAAATTCCA * * * 697 TTTTTGACTCCGAAACTTCCTAAAATTACCATTTTA 1 TTTTT-ACCCCGAAACTT-CTAAAAATCCCATTTTA 733 CCCCCGGATG Statistics Matches: 226, Mismatches: 33, Indels: 22 0.80 0.12 0.08 Matches are distributed among these distances: 57 5 0.02 58 48 0.21 59 163 0.72 60 10 0.04 ACGTcount: A:0.33, C:0.29, G:0.04, T:0.33 Consensus pattern (58 bp): TTTTTACCCCGAAACTTCTAAAAATCCCATTTTAGACCCCGAACTTCCCAAAATTCCA Found at i:722 original size:30 final size:30 Alignment explanation

Indices: 381--731 Score: 249 Period size: 30 Copynumber: 11.9 Consensus size: 30 371 CCCCAAGGGT * * * * 381 CCCGAAACTTTCAAAAATCCTATTTTTTAC 1 CCCGAAACTTCCTAAAATCCCATTTTTGAC * 411 CCCCAAACTT-CTAGAAATCCCATTTATT-AC 1 CCCGAAACTTCCTA-AAATCCCATTT-TTGAC * * * 441 CCCAAAACTTCCAAAAATCCCAATTTT-AC 1 CCCGAAACTTCCTAAAATCCCATTTTTGAC * * * 470 CCCTAAACTTTCAAAAATCCCATTTTTGAC 1 CCCGAAACTTCCTAAAATCCCATTTTTGAC *** * * * 500 CTTAAAACCTCCAAAAATTCCATTTTT-ACC 1 CCCGAAACTTCCTAAAATCCCATTTTTGA-C * 530 CCCG-AACTT-CTAAAAATCCCATTTTTGAT 1 CCCGAAACTTCCT-AAAATCCCATTTTTGAC * * 559 CTCGAAACTTCCAAAAATCCCATTTTT-AC 1 CCCGAAACTTCCTAAAATCCCATTTTTGAC * 588 CCCCAAACTT-CTAAAAATCCCATTTTTGAC 1 CCCGAAACTTCCT-AAAATCCCATTTTTGAC ** * * * 618 CTTGGAACTTCC-AAAATTCCATTTTTAAC 1 CCCGAAACTTCCTAAAATCCCATTTTTGAC * * * 647 CTCGAAACTT-CTAAAAATTCCATTTTAG-C 1 CCCGAAACTTCCT-AAAATCCCATTTTTGAC * * * 676 CCCG-TACTTCCCAAAATTCCATTTTTGAC 1 CCCGAAACTTCCTAAAATCCCATTTTTGAC * * 705 TCCGAAACTTCCTAAAATTACCATTTT 1 CCCGAAACTTCCTAAAA-TCCCATTTT 732 ACCCCCGGAT Statistics Matches: 257, Mismatches: 46, Indels: 35 0.76 0.14 0.10 Matches are distributed among these distances: 28 21 0.08 29 106 0.41 30 116 0.45 31 14 0.05 ACGTcount: A:0.33, C:0.30, G:0.04, T:0.33 Consensus pattern (30 bp): CCCGAAACTTCCTAAAATCCCATTTTTGAC Found at i:756 original size:59 final size:58 Alignment explanation

Indices: 565--785 Score: 170 Period size: 59 Copynumber: 3.8 Consensus size: 58 555 TGATCTCGAA * * * * ** 565 ACTTCCAAAAATCCCATTTTT-ACCCCCAAACTT-CTAAAAATCCCATTTTTGA-CCTTGG 1 ACTTCCAAAAATTCCATTTTTGA-CCCGAAACTTCCTAAAATTACCA-TTTT-ACCCCCGG * * * 623 AACTTCC-AAAATTCCATTTTTAACCTCGAAACTT-CTAAAAATT-CCATTTTAGCCCCGT 1 -ACTTCCAAAAATTCCATTTTTGACC-CGAAACTTCCT-AAAATTACCATTTTACCCCCGG * 681 ACTTCCCAAAATTCCATTTTTGACTCCGAAACTTCCTAAAATTACCATTTTACCCCCGG 1 ACTTCCAAAAATTCCATTTTTGAC-CCGAAACTTCCTAAAATTACCATTTTACCCCCGG * ** * 740 A-TGTCCAAAAAATCCA-TTTTGAACCCCGAATTTTCCCAAAATTACC 1 ACT-TCCAAAAATTCCATTTTTG-A-CCCGAAACTTCCTAAAATTACC 786 GTTTCACTCT Statistics Matches: 137, Mismatches: 14, Indels: 22 0.79 0.08 0.13 Matches are distributed among these distances: 57 7 0.05 58 58 0.42 59 66 0.48 60 6 0.04 ACGTcount: A:0.32, C:0.30, G:0.06, T:0.32 Consensus pattern (58 bp): ACTTCCAAAAATTCCATTTTTGACCCGAAACTTCCTAAAATTACCATTTTACCCCCGG Found at i:4242 original size:41 final size:39 Alignment explanation

Indices: 4197--4287 Score: 94 Period size: 39 Copynumber: 2.3 Consensus size: 39 4187 AATAGTTTTT * * 4197 TAACGGCGTTTGGATCGA-AAACGCCGTAAAAAGTAAAGCAA 1 TAACGGCGTTT--ATC-ATAAACGCCGTAAAAAGCAAAACAA * * * 4238 TAACGGTGTTTTTCATAAACGCCGTAAAAGGCAAAACAA 1 TAACGGCGTTTATCATAAACGCCGTAAAAAGCAAAACAA * 4277 TAGCGGCGTTT 1 TAACGGCGTTT 4288 TCCCATAATC Statistics Matches: 42, Mismatches: 7, Indels: 4 0.79 0.13 0.08 Matches are distributed among these distances: 38 1 0.02 39 31 0.74 41 10 0.24 ACGTcount: A:0.37, C:0.18, G:0.23, T:0.22 Consensus pattern (39 bp): TAACGGCGTTTATCATAAACGCCGTAAAAAGCAAAACAA Found at i:4259 original size:39 final size:40 Alignment explanation

Indices: 4215--4327 Score: 111 Period size: 39 Copynumber: 2.9 Consensus size: 40 4205 TTTGGATCGA * * 4215 AAACGCCGTAAAAAGTAAAGCAATAACGGTGTTTT-TCAT 1 AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTTTCCCAT * * * * 4254 AAACGCCGTAAAAGGCAAAACAATAGCGGCGTTTTCCCAT 1 AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTTTCCCAT * * * * ** 4294 AATCGTCGCAGAAAGTAAAGCAATAGTGGCGTTT 1 AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTT 4328 ATGAGAAAAA Statistics Matches: 59, Mismatches: 14, Indels: 1 0.80 0.19 0.01 Matches are distributed among these distances: 39 30 0.51 40 29 0.49 ACGTcount: A:0.38, C:0.19, G:0.21, T:0.22 Consensus pattern (40 bp): AAACGCCGTAAAAAGTAAAGCAATAACGGCGTTTTCCCAT Found at i:4318 original size:40 final size:38 Alignment explanation

Indices: 4215--4369 Score: 103 Period size: 40 Copynumber: 3.9 Consensus size: 38 4205 TTTGGATCGA * * * * 4215 AAACGCCGTAAAAAGTAAAGCAATAACGGTGTTTTTCAT 1 AAACGCCG-CAAAAGTAAAGCAATAGCGGCGTTTTCCAT * * * 4254 AAACGCCGTAAAAGGCAAAACAATAGCGGCGTTTTCCCAT 1 AAACGCCGCAAAA-GTAAAGCAATAGCGGCGTTTT-CCAT * * * ** * 4294 AATCGTCGCAGAAAGTAAAGCAATAGTGGCGTTTATGAGAA 1 AAACGCCGCA-AAAGTAAAGCAATAGCGGCGTTT-T-CCAT * * 4335 AAACGTCGCAAAAGTTAAGAGCATTAGCGGCGTTT 1 AAACGCCGCAAAAG-TAA-AGCAATAGCGGCGTTT 4370 ATAACAAAAT Statistics Matches: 91, Mismatches: 19, Indels: 9 0.76 0.16 0.08 Matches are distributed among these distances: 38 4 0.04 39 25 0.27 40 31 0.34 41 17 0.19 42 14 0.15 ACGTcount: A:0.38, C:0.17, G:0.23, T:0.22 Consensus pattern (38 bp): AAACGCCGCAAAAGTAAAGCAATAGCGGCGTTTTCCAT Found at i:4395 original size:41 final size:40 Alignment explanation

Indices: 4299--4489 Score: 140 Period size: 41 Copynumber: 4.7 Consensus size: 40 4289 CCCATAATCG * * * 4299 TCGCAGAAAGTAA-AGCAATAGTGGCGTTTATGAGA-AAAACG 1 TCGCA-AAAGTAAGAGCATTAGCGGCGTTTAT-A-ACAAAACA * 4340 TCGCAAAAGTTAAGAGCATTAGCGGCGTTTATAACAAAATA 1 TCGCAAAAG-TAAGAGCATTAGCGGCGTTTATAACAAAACA * * * 4381 TCGCAAAATGTAAGAGCATTAGCGACG---ATGACAAAACG 1 TCGCAAAA-GTAAGAGCATTAGCGGCGTTTATAACAAAACA * * * * * 4419 CCGCAAAAGGTAAGAGTATTAGCGGCGTTTATGAGAAAACG 1 TCGCAAAA-GTAAGAGCATTAGCGGCGTTTATAACAAAACA * * * * 4460 CCACAAAAAATAAGAGCAATAGCGGCGTTT 1 TCGC-AAAAGTAAGAGCATTAGCGGCGTTT 4490 TCCCATAGAC Statistics Matches: 125, Mismatches: 17, Indels: 16 0.79 0.11 0.10 Matches are distributed among these distances: 38 31 0.25 40 5 0.04 41 68 0.54 42 21 0.17 ACGTcount: A:0.41, C:0.16, G:0.24, T:0.19 Consensus pattern (40 bp): TCGCAAAAGTAAGAGCATTAGCGGCGTTTATAACAAAACA Found at i:4467 original size:79 final size:79 Alignment explanation

Indices: 4334--4483 Score: 192 Period size: 79 Copynumber: 1.9 Consensus size: 79 4324 GTTTATGAGA * * * * * ** 4334 AAAACGTCGCAAAAGTTAAGAGCATTAGCGGCGTTTATAACAAAATATCGCAAAATGTAAGAGCA 1 AAAACGCCGCAAAAGGTAAGAGCATTAGCGGCGTTTATAACAAAACACCACAAAAAATAAGAGCA * 4399 TTAGCGACGATGAC 66 ATAGCGACGATGAC * * * * 4413 AAAACGCCGCAAAAGGTAAGAGTATTAGCGGCGTTTATGAGAAAACGCCACAAAAAATAAGAGCA 1 AAAACGCCGCAAAAGGTAAGAGCATTAGCGGCGTTTATAACAAAACACCACAAAAAATAAGAGCA 4478 ATAGCG 66 ATAGCG 4484 GCGTTTTCCC Statistics Matches: 59, Mismatches: 12, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 79 59 1.00 ACGTcount: A:0.43, C:0.17, G:0.23, T:0.17 Consensus pattern (79 bp): AAAACGCCGCAAAAGGTAAGAGCATTAGCGGCGTTTATAACAAAACACCACAAAAAATAAGAGCA ATAGCGACGATGAC Found at i:5088 original size:10 final size:10 Alignment explanation

Indices: 5073--5110 Score: 51 Period size: 10 Copynumber: 3.7 Consensus size: 10 5063 GCTCCATGCT 5073 AATTTTTTTG 1 AATTTTTTTG 5083 AATTTTTTAT- 1 AATTTTTT-TG 5093 AATATTTTTTG 1 AAT-TTTTTTG 5104 AATTTTT 1 AATTTTT 5111 ATTTTTATTT Statistics Matches: 25, Mismatches: 0, Indels: 6 0.81 0.00 0.19 Matches are distributed among these distances: 10 16 0.64 11 9 0.36 ACGTcount: A:0.26, C:0.00, G:0.05, T:0.68 Consensus pattern (10 bp): AATTTTTTTG Found at i:5100 original size:20 final size:21 Alignment explanation

Indices: 5072--5142 Score: 76 Period size: 21 Copynumber: 3.4 Consensus size: 21 5062 TGCTCCATGC 5072 TAAT-TTTTTTGAATTTTTTA 1 TAATATTTTTTGAATTTTTTA 5092 TAATATTTTTTGAA-TTTTTA 1 TAATATTTTTTGAATTTTTTA ** * 5112 TTTTTATTTTTTGACTATTTTTA 1 -TAATATTTTTTGAAT-TTTTTA 5135 TAAT-TTTT 1 TAATATTTT 5143 AAATTATTTA Statistics Matches: 42, Mismatches: 5, Indels: 7 0.78 0.09 0.13 Matches are distributed among these distances: 20 10 0.24 21 24 0.57 22 2 0.05 23 6 0.14 ACGTcount: A:0.24, C:0.01, G:0.04, T:0.70 Consensus pattern (21 bp): TAATATTTTTTGAATTTTTTA Found at i:5704 original size:34 final size:31 Alignment explanation

Indices: 5643--5705 Score: 83 Period size: 34 Copynumber: 1.9 Consensus size: 31 5633 CCAATAAATA 5643 ATTTAAAAATTATAAAAAAAAATAAATCAAG 1 ATTTAAAAATTATAAAAAAAAATAAATCAAG 5674 ATTTAAAAAAGTTCATAAAAAAATAA-AAATCA 1 ATTT-AAAAA-TT-ATAAAAAAA-AATAAATCA 5706 GCATGAAATA Statistics Matches: 28, Mismatches: 0, Indels: 5 0.85 0.00 0.15 Matches are distributed among these distances: 31 4 0.14 32 5 0.18 33 2 0.07 34 15 0.54 35 2 0.07 ACGTcount: A:0.67, C:0.05, G:0.03, T:0.25 Consensus pattern (31 bp): ATTTAAAAATTATAAAAAAAAATAAATCAAG Found at i:14031 original size:79 final size:79 Alignment explanation

Indices: 13946--14102 Score: 217 Period size: 79 Copynumber: 2.0 Consensus size: 79 13936 ATTGGACAAC ** * ** 13946 GGTACTGATACCGTTGATATACTGAGTCCTCCAAACAGTCCT-TCGATAGAACGACATCGATGAA 1 GGTACTGATACCGTTGATATACCAAGTCCTCCAAACAGTCCTCT-GACAGAACGACATCGACAAA * 14010 ATAGATGCGGACAGT 65 ATAGACGCGGACAGT * * * 14025 GGTACTGATACCGTTGATATACCAAGTCCTCCAAATAGTCCTCTGGCAGAATGACATCGACAAAA 1 GGTACTGATACCGTTGATATACCAAGTCCTCCAAACAGTCCTCTGACAGAACGACATCGACAAAA 14090 TAGACGCGGACAG 66 TAGACGCGGACAG 14103 CGAAGCCGCA Statistics Matches: 68, Mismatches: 9, Indels: 2 0.86 0.11 0.03 Matches are distributed among these distances: 79 67 0.99 80 1 0.01 ACGTcount: A:0.32, C:0.23, G:0.22, T:0.22 Consensus pattern (79 bp): GGTACTGATACCGTTGATATACCAAGTCCTCCAAACAGTCCTCTGACAGAACGACATCGACAAAA TAGACGCGGACAGT Found at i:17608 original size:41 final size:38 Alignment explanation

Indices: 17563--17666 Score: 109 Period size: 41 Copynumber: 2.6 Consensus size: 38 17553 TAAAAAAATT 17563 AAAGGTAAAGCAATAGCGGCATTTATGAGAAAAACGTCACA 1 AAAGGT-AAGCAATAGCGGCATTTATGAGAAAAACG-C-CA * * * * 17604 AAAGGTAAGTCAATAGTGGCGTTTATGGGAAAAACGCCT 1 AAAGGTAAG-CAATAGCGGCATTTATGAGAAAAACGCCA * * 17643 AAAGGTCAAGCAATAACAGCATTT 1 AAAGGT-AAGCAATAGCGGCATTT 17667 TCCCATAAAC Statistics Matches: 53, Mismatches: 8, Indels: 6 0.79 0.12 0.09 Matches are distributed among these distances: 39 17 0.32 40 7 0.13 41 29 0.55 ACGTcount: A:0.42, C:0.14, G:0.23, T:0.20 Consensus pattern (38 bp): AAAGGTAAGCAATAGCGGCATTTATGAGAAAAACGCCA Found at i:20647 original size:43 final size:43 Alignment explanation

Indices: 20574--20708 Score: 175 Period size: 43 Copynumber: 3.2 Consensus size: 43 20564 TTGTTAATAT * * 20574 TAGCGGCGTTTGTGGGGAAAA-CGCCACTAAAGATCATGTTTTA 1 TAGCGGCGTTTGT-GGGAAAAGCGCCGCTAAAGATCATGTTCTA * * 20617 TAGCGGTGTTTGTGGGAAAAGCGCTGCTAAAGATCATGTTCTA 1 TAGCGGCGTTTGTGGGAAAAGCGCCGCTAAAGATCATGTTCTA * * * * 20660 TAACGGCGTTTGTTGG-AAAGCGCCGCTAAAGGTTATGTTCTA 1 TAGCGGCGTTTGTGGGAAAAGCGCCGCTAAAGATCATGTTCTA 20702 TAGCGGC 1 TAGCGGC 20709 ATTTTTTCGT Statistics Matches: 80, Mismatches: 11, Indels: 3 0.85 0.12 0.03 Matches are distributed among these distances: 42 36 0.45 43 44 0.55 ACGTcount: A:0.25, C:0.16, G:0.30, T:0.29 Consensus pattern (43 bp): TAGCGGCGTTTGTGGGAAAAGCGCCGCTAAAGATCATGTTCTA Found at i:22780 original size:6 final size:6 Alignment explanation

Indices: 22769--22796 Score: 56 Period size: 6 Copynumber: 4.7 Consensus size: 6 22759 TAAACTCGAA 22769 TTTTAT TTTTAT TTTTAT TTTTAT TTTT 1 TTTTAT TTTTAT TTTTAT TTTTAT TTTT 22797 CCACTCTCGC Statistics Matches: 22, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 22 1.00 ACGTcount: A:0.14, C:0.00, G:0.00, T:0.86 Consensus pattern (6 bp): TTTTAT Found at i:23613 original size:20 final size:19 Alignment explanation

Indices: 23556--23619 Score: 55 Period size: 20 Copynumber: 3.5 Consensus size: 19 23546 AGAATATAAT * 23556 AGTACAAAAATATATTAAAG 1 AGTATAAAAATATA-TAAAG * 23576 AGTGT-AAAAT-T-T-AAG 1 AGTATAAAAATATATAAAG 23591 AGTATAAAAATATATGAAAG 1 AGTATAAAAATATAT-AAAG * 23611 AGTGTAAAA 1 AGTATAAAA 23620 CTACTTAAGT Statistics Matches: 35, Mismatches: 4, Indels: 10 0.71 0.08 0.20 Matches are distributed among these distances: 15 7 0.20 16 6 0.17 17 1 0.03 18 2 0.06 19 5 0.14 20 14 0.40 ACGTcount: A:0.56, C:0.02, G:0.16, T:0.27 Consensus pattern (19 bp): AGTATAAAAATATATAAAG Found at i:24952 original size:21 final size:21 Alignment explanation

Indices: 24920--24968 Score: 91 Period size: 21 Copynumber: 2.4 Consensus size: 21 24910 AGATATCAAG 24920 TAGGTA-TAAATTATAAAATT 1 TAGGTACTAAATTATAAAATT 24940 TAGGTACTAAATTATAAAATT 1 TAGGTACTAAATTATAAAATT 24961 TAGGTACT 1 TAGGTACT 24969 TAGTACATAT Statistics Matches: 28, Mismatches: 0, Indels: 1 0.97 0.00 0.03 Matches are distributed among these distances: 20 6 0.21 21 22 0.79 ACGTcount: A:0.45, C:0.04, G:0.12, T:0.39 Consensus pattern (21 bp): TAGGTACTAAATTATAAAATT Done.