Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: NTFQ01013911.1 Kokia drynarioides strain JFW-HI SEQ_128941, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 12103
ACGTcount: A:0.34, C:0.19, G:0.19, T:0.27

Warning! 50 characters in sequence are not A, C, G, or T


Found at i:751 original size:19 final size:19

Alignment explanation

Indices: 722--767 Score: 83 Period size: 19 Copynumber: 2.4 Consensus size: 19 712 TTATTTTGCT 722 CTTTAGACTTTCATTTCATC 1 CTTT-GACTTTCATTTCATC 742 CTTTGACTTTCATTTCATC 1 CTTTGACTTTCATTTCATC 761 CTTTGAC 1 CTTTGAC 768 AGATCCCCAA Statistics Matches: 26, Mismatches: 0, Indels: 1 0.96 0.00 0.04 Matches are distributed among these distances: 19 22 0.85 20 4 0.15 ACGTcount: A:0.17, C:0.26, G:0.07, T:0.50 Consensus pattern (19 bp): CTTTGACTTTCATTTCATC Found at i:4596 original size:24 final size:24 Alignment explanation

Indices: 4546--4596 Score: 84 Period size: 24 Copynumber: 2.1 Consensus size: 24 4536 TGTATACCAA ** 4546 TTATTGTTTCCTTTGATCCTCTTT 1 TTATTGTTTCCTTTGATCCTCCCT 4570 TTATTGTTTCCTTTGATCCTCCCT 1 TTATTGTTTCCTTTGATCCTCCCT 4594 TTA 1 TTA 4597 ATAGAATTTT Statistics Matches: 25, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 24 25 1.00 ACGTcount: A:0.10, C:0.24, G:0.08, T:0.59 Consensus pattern (24 bp): TTATTGTTTCCTTTGATCCTCCCT Found at i:5465 original size:211 final size:210 Alignment explanation

Indices: 5178--5854 Score: 876 Period size: 211 Copynumber: 3.2 Consensus size: 210 5168 CCGGCTTCAC * * * 5178 GATGAGACACCGAGAAGCAGGTCGAAACAATAAAAGGTCAGCTTCCTGATGAGATACTAAGAAGT 1 GATGAGACACTGAGAAGCAGGTCGAAGCAATAAAAGGTCAGCTTCCTGATGAGATACTGAGAAGT * * * 5243 AAACCAAATTCGTTTTCCTAATGAGATACGGAGAAGCGAATTAAAACAAACGATGCGGTCATCTT 66 -AACTAAATTCGTCTTCCTGATGAGATACGGAGAAGCGAATTAAAACAAACGATGCGGTCATCTT * * 5308 CCTGATGAGATACTAAGAAGAATACCAAATCAAACCCAAACGAGGCTCGAAACGAGCAAAATCTT 130 CCTGACGAGATACTAAGAAGAAGACCAAATCAAACCCAAACGAGGCTCGAAACGAGCAAAATCTT * 5373 TGAACCCCGGCTTCCT 195 TGAACCCCAGCTTCCT * 5389 GATGAGACACTGAGAAACAGGTCGAAGCAATAAAAGGTCAGCTTCCTGATGAGATACTGAGAAGT 1 GATGAGACACTGAGAAGCAGGTCGAAGCAATAAAAGGTCAGCTTCCTGATGAGATACTGAGAAGT * * 5454 GTACTAAATTCGTCTTCCTGATGAGATACGAAGAAGCGAATTAAAACAAACGATGCGGTCATCTT 66 -AACTAAATTCGTCTTCCTGATGAGATACGGAGAAGCGAATTAAAACAAACGATGCGGTCATCTT 5519 CCTGACGAGATAC-AGAGAAGAAGACCAAATCAAACCCAAACGAGGCTCGAAACGAGCAAAATCT 130 CCTGACGAGATACTA-AGAAGAAGACCAAATCAAACCCAAACGAGGCTCGAAACGAGCAAAATCT * * 5583 TTGAACCCTAGCTTTCT 194 TTGAACCCCAGCTTCCT * * * 5600 GATGAGACACTAAGAAGCAGGTTGAAGCAATAAAAGGTTAGCTTCCTGATGAGATACTGAGAAGT 1 GATGAGACACTGAGAAGCAGGTCGAAGCAATAAAAGGTCAGCTTCCTGATGAGATACTGAGAAGT * * * * * * * * 5665 AAATCAAATTCGTCTTCCTGATGAGGTACAGAGAAGCGAATTGAAACGAGCAGCGACGTGATCAT 66 AACT-AAATTCGTCTTCCTGATGAGATACGGAGAAGCGAATTAAAAC-A--AACGATGCGGTCAT * ** * * 5730 CTTCCTGACAAGACGCTGAGAAGAAGA-C---TCAAA--C--A--AGGCTCGAAACCAGCAAAAT 127 CTTCCTGACGAGATACTAAGAAGAAGACCAAATCAAACCCAAACGAGGCTCGAAACGAGCAAAAT * 5785 CTTCGAACCCCAGCTTCCT 192 CTTTGAACCCCAGCTTCCT * * * * 5804 CATGAGACA-TGGGGAAGCAGGTCGAAGCAATAAAA-GTCATCTTCCCGATGA 1 GATGAGACACT-GAGAAGCAGGTCGAAGCAATAAAAGGTCAGCTTCCTGATGA 5855 AAATACCGAG Statistics Matches: 415, Mismatches: 44, Indels: 22 0.86 0.09 0.05 Matches are distributed among these distances: 203 14 0.03 204 64 0.15 206 1 0.00 208 1 0.00 210 8 0.02 211 293 0.71 212 1 0.00 213 1 0.00 214 32 0.08 ACGTcount: A:0.38, C:0.21, G:0.22, T:0.19 Consensus pattern (210 bp): GATGAGACACTGAGAAGCAGGTCGAAGCAATAAAAGGTCAGCTTCCTGATGAGATACTGAGAAGT AACTAAATTCGTCTTCCTGATGAGATACGGAGAAGCGAATTAAAACAAACGATGCGGTCATCTTC CTGACGAGATACTAAGAAGAAGACCAAATCAAACCCAAACGAGGCTCGAAACGAGCAAAATCTTT GAACCCCAGCTTCCT Found at i:6380 original size:17 final size:17 Alignment explanation

Indices: 6355--6432 Score: 61 Period size: 17 Copynumber: 4.5 Consensus size: 17 6345 CCCAATCAAC * 6355 TTAAATTTATTTTAAAA 1 TTAAATTTATTCTAAAA * * 6372 TTAAGTTTATTCTAAAT 1 TTAAATTTATTCTAAAA * * * 6389 TTAAATTTGGTT-GAAAT 1 TTAAATTT-ATTCTAAAA 6406 TTAAATTTATT-TATAAA 1 TTAAATTTATTCTA-AAA 6423 TTTAAATTTA 1 -TTAAATTTA 6433 AAATTTATTT Statistics Matches: 49, Mismatches: 9, Indels: 5 0.78 0.14 0.08 Matches are distributed among these distances: 16 3 0.06 17 35 0.71 18 11 0.22 ACGTcount: A:0.41, C:0.01, G:0.05, T:0.53 Consensus pattern (17 bp): TTAAATTTATTCTAAAA Found at i:7124 original size:13 final size:13 Alignment explanation

Indices: 7106--7138 Score: 59 Period size: 12 Copynumber: 2.6 Consensus size: 13 7096 TGAGATTTAT 7106 TATTATTAAAAAA 1 TATTATTAAAAAA 7119 TATTATT-AAAAA 1 TATTATTAAAAAA 7131 TATTATTA 1 TATTATTA 7139 TTAATTAATA Statistics Matches: 19, Mismatches: 0, Indels: 2 0.90 0.00 0.10 Matches are distributed among these distances: 12 12 0.63 13 7 0.37 ACGTcount: A:0.55, C:0.00, G:0.00, T:0.45 Consensus pattern (13 bp): TATTATTAAAAAA Found at i:7157 original size:28 final size:28 Alignment explanation

Indices: 7103--7157 Score: 65 Period size: 28 Copynumber: 2.0 Consensus size: 28 7093 AAATGAGATT * ** 7103 TATTATTATTAAAAAATATTATTAAAAA 1 TATTATTATTAAAAAATAATAAAAAAAA ** 7131 TATTATTATTAATTAATAATAAAAAAA 1 TATTATTATTAAAAAATAATAAAAAAA 7158 CAAAGAGTCG Statistics Matches: 22, Mismatches: 5, Indels: 0 0.81 0.19 0.00 Matches are distributed among these distances: 28 22 1.00 ACGTcount: A:0.58, C:0.00, G:0.00, T:0.42 Consensus pattern (28 bp): TATTATTATTAAAAAATAATAAAAAAAA Found at i:7767 original size:12 final size:12 Alignment explanation

Indices: 7750--7774 Score: 50 Period size: 12 Copynumber: 2.1 Consensus size: 12 7740 GGGAAAGGGC 7750 GAGAGGATTTTG 1 GAGAGGATTTTG 7762 GAGAGGATTTTG 1 GAGAGGATTTTG 7774 G 1 G 7775 GTTTTTTTAA Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 12 13 1.00 ACGTcount: A:0.24, C:0.00, G:0.44, T:0.32 Consensus pattern (12 bp): GAGAGGATTTTG Found at i:8469 original size:58 final size:59 Alignment explanation

Indices: 8294--8579 Score: 355 Period size: 59 Copynumber: 4.9 Consensus size: 59 8284 CCCCAGATTG * * * * * * * 8294 TCCAAAAATTATCATTTTA-CCCTCGAGCTTTCAAAAATCTCATTTTTGACCTTGAACCT 1 TCCAAAAATTACCATTTTACCCCT-AAACTTCCAAAAATCCCATTTTTGACCCTAAACCT * * * 8353 TTCTAAAATTACCATTTTACCCCTAAACTTCCAAAAATCCCATTTTTGACCCCAAACCT 1 TCCAAAAATTACCATTTTACCCCTAAACTTCCAAAAATCCCATTTTTGACCCTAAACCT * * 8412 TCCAAAAATTACCATTTTA-CCCTAAACTTCCAAAAATCCCATTTTTAACCCTGAACCT 1 TCCAAAAATTACCATTTTACCCCTAAACTTCCAAAAATCCCATTTTTGACCCTAAACCT * * * 8470 TCCAAATATTACCATTTTATCCCC-AAACTCCCAAAAATCTCATTTTTGACCCTAAACCT 1 TCCAAAAATTACCATTTTA-CCCCTAAACTTCCAAAAATCCCATTTTTGACCCTAAACCT * * * 8529 TCCAAAAATTACCATTTTACCCCCAAAC-TCCGAAAAATCCCCTTTTCGACC 1 TCCAAAAATTACCATTTTACCCCTAAACTTCC-AAAAATCCCATTTTTGACC 8580 ACGAAACACC Statistics Matches: 197, Mismatches: 25, Indels: 10 0.85 0.11 0.04 Matches are distributed among these distances: 58 60 0.30 59 130 0.66 60 7 0.04 ACGTcount: A:0.34, C:0.31, G:0.03, T:0.32 Consensus pattern (59 bp): TCCAAAAATTACCATTTTACCCCTAAACTTCCAAAAATCCCATTTTTGACCCTAAACCT Found at i:8498 original size:117 final size:117 Alignment explanation

Indices: 8294--8574 Score: 379 Period size: 117 Copynumber: 2.4 Consensus size: 117 8284 CCCCAGATTG * * * * * * * * 8294 TCCAAAAATTATCATTTTACCCTCGAGCTTTCAAAAATCTCATTTTTGACCTTGAACCTTTCTAA 1 TCCAAAAATTACCATTTTACCCT-AAACTTCCAAAAATCCCATTTTTAACCCTGAACCTTTCCAA * 8359 AATTACCATTTTACCCCTAAACTTCCAAAAATCCCATTTTTGACCCCAAACCT 65 AATTACCATTTTACCCCTAAACTCCCAAAAATCCCATTTTTGACCCCAAACCT 8412 TCCAAAAATTACCATTTTACCCTAAACTTCCAAAAATCCCATTTTTAACCCTGAACC-TTCCAAA 1 TCCAAAAATTACCATTTTACCCTAAACTTCCAAAAATCCCATTTTTAACCCTGAACCTTTCCAAA * * 8476 TATTACCATTTTATCCCC-AAACTCCCAAAAATCTCATTTTTGACCCTAAACCT 66 -ATTACCATTTTA-CCCCTAAACTCCCAAAAATCCCATTTTTGACCCCAAACCT * * 8529 TCCAAAAATTACCATTTTACCCCCAAAC-TCCGAAAAATCCCCTTTT 1 TCCAAAAATTACCATTTTA-CCCTAAACTTCC-AAAAATCCCATTTT 8575 CGACCACGAA Statistics Matches: 146, Mismatches: 13, Indels: 8 0.87 0.08 0.05 Matches are distributed among these distances: 116 6 0.04 117 94 0.64 118 46 0.32 ACGTcount: A:0.34, C:0.31, G:0.03, T:0.32 Consensus pattern (117 bp): TCCAAAAATTACCATTTTACCCTAAACTTCCAAAAATCCCATTTTTAACCCTGAACCTTTCCAAA ATTACCATTTTACCCCTAAACTCCCAAAAATCCCATTTTTGACCCCAAACCT Found at i:9807 original size:49 final size:49 Alignment explanation

Indices: 9743--10086 Score: 189 Period size: 49 Copynumber: 7.0 Consensus size: 49 9733 CAAGAAGCAT * * 9743 GAAGGGAAAGATTTAAGCCGCAATGGCAAATCTAGTACCACGAAGATATG 1 GAAGGAAAAG-TTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATG * * ** * * * * 9793 GAAGGAAAAGTTTAAGTCGCAACAGTGAACCTTGTACCTCAGAA-ACAT- 1 GAAGGAAAAGTTTAAGCCGCAACGGCAAATCTAGTACCAC-GAAGATATG * * 9841 GAAGGGAAATA-TTTAAGCCGCAATGACAAATCTAGTACCACGAAGATATG 1 GAA-GGAAA-AGTTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATG * * * * * * * * 9891 GAGGGAAAGGTTTAAGTCGTAACGGCAAATCTTGTACCTC-AAAAGCAT- 1 GAAGGAAAAGTTTAAGCCGCAACGGCAAATCTAGTACCACGAAGA-TATG * * ** * * 9939 GAAGGGAAAGATTTAAGCCGTAACGGTTAATCCAGTACCATGAAGATATG 1 GAAGGAAAAG-TTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATG * * * * * * * * * 9989 GAGGGAAAGGTTTAAGTCACAACGACGAACCTTGTACCTCAGAAG--ATG 1 GAAGGAAAAGTTTAAGCCGCAACGGCAAATCTAGTACCAC-GAAGATATG * * * * * * 10037 AGATGGGAAAGATTTAAGTCGTAACGGCGAATCTAGTACGACGAAGATAT 1 -GAAGGAAAAG-TTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATAT 10087 AAGTCGCAAC Statistics Matches: 213, Mismatches: 66, Indels: 29 0.69 0.21 0.09 Matches are distributed among these distances: 48 19 0.09 49 140 0.66 50 52 0.24 51 2 0.01 ACGTcount: A:0.38, C:0.16, G:0.25, T:0.21 Consensus pattern (49 bp): GAAGGAAAAGTTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATG Found at i:9909 original size:98 final size:97 Alignment explanation

Indices: 9735--10086 Score: 438 Period size: 98 Copynumber: 3.6 Consensus size: 97 9725 TTCATTACCA * * * 9735 AGAAGCATGAAGGGAAAGATTTAAGCCGCAATGGCAAATCTAGTACCACGAAGATATGGAAGGAA 1 AGAAACATGAAGGGAAAGATTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATGGAGGGAA * * 9800 AAGTTTAAGTCGCAACAGTGAACCTTGTACCTC 66 AGGTTTAAGTCGCAAC-GCGAACCTTGTACCTC * * * 9833 AGAAACATGAAGGGAAATATTTAAGCCGCAATGACAAATCTAGTACCACGAAGATATGGAGGGAA 1 AGAAACATGAAGGGAAAGATTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATGGAGGGAA * * * 9898 AGGTTTAAGTCGTAACGGCAAATCTTGTACCTC 66 AGGTTTAAGTCGCAAC-GCGAACCTTGTACCTC * ** * * 9931 A-AAAGCATGAAGGGAAAGATTTAAGCCGTAACGGTTAATCCAGTACCATGAAGATATGGAGGGA 1 AGAAA-CATGAAGGGAAAGATTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATGGAGGGA * 9995 AAGGTTTAAGTCACAACGACGAACCTTGTACCTC 65 AAGGTTTAAGTCGCAACG-CGAACCTTGTACCTC * * * * * 10029 AG-AAGATGAGATGGGAAAGATTTAAGTCGTAACGGCGAATCTAGTACGACGAAGATAT 1 AGAAACATGA-A-GGGAAAGATTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATAT 10087 AAGTCGCAAC Statistics Matches: 220, Mismatches: 29, Indels: 9 0.85 0.11 0.03 Matches are distributed among these distances: 97 8 0.04 98 172 0.78 99 40 0.18 ACGTcount: A:0.39, C:0.16, G:0.25, T:0.20 Consensus pattern (97 bp): AGAAACATGAAGGGAAAGATTTAAGCCGCAACGGCAAATCTAGTACCACGAAGATATGGAGGGAA AGGTTTAAGTCGCAACGCGAACCTTGTACCTC Found at i:10176 original size:49 final size:49 Alignment explanation

Indices: 10123--10238 Score: 119 Period size: 49 Copynumber: 2.4 Consensus size: 49 10113 AAAGAACATG * * 10123 AAGGGAAAGATTGAAG-CCGCAACGGTGAGTCC-GGTACCAGAAAGATTTC 1 AAGGGAAAGATT-AAGACCGCAACGGTGA-ACCAGGTACCAGAAAGACTTC * * * * * 10172 AAGGGAAAGGTTACGACCGTAACGGTGAACCAGGTACCATAAAGACTTG 1 AAGGGAAAGATTAAGACCGCAACGGTGAACCAGGTACCAGAAAGACTTC * * 10221 AAGGGAAAGGTTACGACC 1 AAGGGAAAGATTAAGACC 10239 ACGACAGCGA Statistics Matches: 58, Mismatches: 7, Indels: 4 0.84 0.10 0.06 Matches are distributed among these distances: 48 4 0.07 49 54 0.93 ACGTcount: A:0.36, C:0.18, G:0.30, T:0.16 Consensus pattern (49 bp): AAGGGAAAGATTAAGACCGCAACGGTGAACCAGGTACCAGAAAGACTTC Found at i:10251 original size:49 final size:49 Alignment explanation

Indices: 10155--10251 Score: 122 Period size: 49 Copynumber: 2.0 Consensus size: 49 10145 CGGTGAGTCC * ** * * 10155 GGTACCAGAAAGATTTCAAGGGAAAGGTTACGACCGTAACGGTGAACCA 1 GGTACCAGAAAGACTTCAAGGGAAAGGTTACGACCACAACAGCGAACCA * * * 10204 GGTACCATAAAGACTTGAAGGGAAAGGTTACGACCACGACAGCGAACC 1 GGTACCAGAAAGACTTCAAGGGAAAGGTTACGACCACAACAGCGAACC 10252 CAATACCTTA Statistics Matches: 40, Mismatches: 8, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 49 40 1.00 ACGTcount: A:0.37, C:0.21, G:0.28, T:0.14 Consensus pattern (49 bp): GGTACCAGAAAGACTTCAAGGGAAAGGTTACGACCACAACAGCGAACCA Found at i:10497 original size:91 final size:91 Alignment explanation

Indices: 10341--10514 Score: 240 Period size: 91 Copynumber: 1.9 Consensus size: 91 10331 ATAGCAAATA * * * * 10341 TTTATCTCTCTTAAGTTACAGTAAGAGCAAGATAAAGCTTCAACGTCAAACCCTATCCTCTTGAA 1 TTTATCTCTCTGAAGTTACAGTAAAAGCAAGATAAAGCTTCAACATCAAACCCTATCCTCCTGAA * 10406 GTTATGGTAAAGTTGGATAACAAGTC 66 GTTATAGTAAAGTTGGATAACAAGTC * * * * * 10432 TTTATCTCTCTGAAGTTGCAGTAAAAGCAAGGTGAAGCTTCAACATCAAATCCTATCTTCCTGAA 1 TTTATCTCTCTGAAGTTACAGTAAAAGCAAGATAAAGCTTCAACATCAAACCCTATCCTCCTGAA * * 10497 GTTGTAGTGAAGTTGGAT 66 GTTATAGTAAAGTTGGAT 10515 TAAAAACAAA Statistics Matches: 71, Mismatches: 12, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 91 71 1.00 ACGTcount: A:0.32, C:0.18, G:0.18, T:0.32 Consensus pattern (91 bp): TTTATCTCTCTGAAGTTACAGTAAAAGCAAGATAAAGCTTCAACATCAAACCCTATCCTCCTGAA GTTATAGTAAAGTTGGATAACAAGTC Found at i:10611 original size:104 final size:104 Alignment explanation

Indices: 10429--10623 Score: 273 Period size: 104 Copynumber: 1.9 Consensus size: 104 10419 TGGATAACAA * * * * * 10429 GTCTTTATCTCTCTGAAGTTGCAGTAAAAGCAAGGTGAAGCTTCAACATCAAATCCTATCTTCCT 1 GTCTTTATCTCTCTGAAGTTACAGTAAAAGCAAGATAAAACTTCAACATCAAATCCAATCTTCCT * * 10494 GAAGTTGTAGTGAAGTTGGATTAAAAACAAAAATAATAG 66 GAAGTTGCAGTGAAGTTGGATCAAAAACAAAAATAATAG * * * * * 10533 GTCTTTATTTCTCTGAAGTTACAGTAAGAGCAAGATAAAACTTTAACTTCAAATTCAATCTTCCT 1 GTCTTTATCTCTCTGAAGTTACAGTAAAAGCAAGATAAAACTTCAACATCAAATCCAATCTTCCT * 10598 GAAGTTGCGGTGAAGTTGGATCAAAA 66 GAAGTTGCAGTGAAGTTGGATCAAAA 10624 CCACAGTAGC Statistics Matches: 78, Mismatches: 13, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 104 78 1.00 ACGTcount: A:0.35, C:0.15, G:0.18, T:0.31 Consensus pattern (104 bp): GTCTTTATCTCTCTGAAGTTACAGTAAAAGCAAGATAAAACTTCAACATCAAATCCAATCTTCCT GAAGTTGCAGTGAAGTTGGATCAAAAACAAAAATAATAG Found at i:11836 original size:24 final size:25 Alignment explanation

Indices: 11786--11844 Score: 75 Period size: 26 Copynumber: 2.4 Consensus size: 25 11776 TTATTTATTT ** 11786 ATATATATATATATACACGAATGTAC 1 ATATATATATATATACAC-AACATAC * 11812 ATATATATATATATGCA-AACATAC 1 ATATATATATATATACACAACATAC 11836 ATATATATA 1 ATATATATA 11845 CACGTATGTA Statistics Matches: 30, Mismatches: 3, Indels: 2 0.86 0.09 0.06 Matches are distributed among these distances: 24 14 0.47 26 16 0.53 ACGTcount: A:0.49, C:0.10, G:0.05, T:0.36 Consensus pattern (25 bp): ATATATATATATATACACAACATAC Done.