Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: NTFQ01009315.1 Kokia drynarioides strain JFW-HI SEQ_124022, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 26182
ACGTcount: A:0.35, C:0.16, G:0.16, T:0.34


Found at i:2805 original size:14 final size:14

Alignment explanation

Indices: 2786--2837 Score: 50 Period size: 14 Copynumber: 3.5 Consensus size: 14 2776 TCTTGAAAAA ** 2786 AAAAATAAGGAAGT 1 AAAAATAAAAAAGT 2800 AAAAATAAAAAAGT 1 AAAAATAAAAAAGT 2814 AAAAAAGTAAAAAAAGT 1 -AAAAA-T-AAAAAAGT * 2831 GAAAATA 1 AAAAATA 2838 CAAGACCCTC Statistics Matches: 32, Mismatches: 3, Indels: 6 0.78 0.07 0.15 Matches are distributed among these distances: 14 13 0.41 15 6 0.19 16 5 0.16 17 8 0.25 ACGTcount: A:0.73, C:0.00, G:0.13, T:0.13 Consensus pattern (14 bp): AAAAATAAAAAAGT Found at i:2835 original size:8 final size:8 Alignment explanation

Indices: 2796--2827 Score: 50 Period size: 8 Copynumber: 4.2 Consensus size: 8 2786 AAAAATAAGG 2796 AAGT-AAA 1 AAGTAAAA 2803 AA-TAAAA 1 AAGTAAAA 2810 AAGTAAAA 1 AAGTAAAA 2818 AAGTAAAA 1 AAGTAAAA 2826 AA 1 AA 2828 AGTGAAAATA Statistics Matches: 23, Mismatches: 0, Indels: 3 0.88 0.00 0.12 Matches are distributed among these distances: 6 1 0.04 7 7 0.30 8 15 0.65 ACGTcount: A:0.78, C:0.00, G:0.09, T:0.12 Consensus pattern (8 bp): AAGTAAAA Found at i:5369 original size:12 final size:12 Alignment explanation

Indices: 5356--5456 Score: 84 Period size: 12 Copynumber: 8.8 Consensus size: 12 5346 CTAAATAACC * 5356 AAAATAACAACC 1 AAAATAACAACA * 5368 AAAACAACAACA 1 AAAATAACAACA * 5380 AGAAT---AACA 1 AAAATAACAACA * 5389 ATAA-AACAACA 1 AAAATAACAACA ** 5400 AAAATTGCAACA 1 AAAATAACAACA * * 5412 AAAATAACAGCT 1 AAAATAACAACA 5424 AAAATAACAACA 1 AAAATAACAACA * * 5436 AAAACAACAATA 1 AAAATAACAACA 5448 AAAATAACA 1 AAAATAACA 5457 TCATCAAAAC Statistics Matches: 68, Mismatches: 17, Indels: 8 0.73 0.18 0.09 Matches are distributed among these distances: 9 7 0.10 11 7 0.10 12 54 0.79 ACGTcount: A:0.69, C:0.18, G:0.03, T:0.10 Consensus pattern (12 bp): AAAATAACAACA Found at i:5385 original size:24 final size:23 Alignment explanation

Indices: 5355--5456 Score: 93 Period size: 24 Copynumber: 4.4 Consensus size: 23 5345 TCTAAATAAC * 5355 CAAAATAACAACCAAAACAACAA 1 CAAAATAACAACAAAAACAACAA * 5378 CAAGAATAACAA-TAAAACAAC-A 1 CAA-AATAACAACAAAAACAACAA ** * * 5400 -AAAATTGCAACAAAAATAACAG 1 CAAAATAACAACAAAAACAACAA 5422 CTAAAATAACAACAAAAACAACAA 1 C-AAAATAACAACAAAAACAACAA * 5446 TAAAAATAACA 1 -CAAAATAACA 5457 TCATCAAAAC Statistics Matches: 62, Mismatches: 11, Indels: 11 0.74 0.13 0.13 Matches are distributed among these distances: 20 6 0.10 21 9 0.15 22 1 0.02 23 11 0.18 24 35 0.56 ACGTcount: A:0.69, C:0.19, G:0.03, T:0.10 Consensus pattern (23 bp): CAAAATAACAACAAAAACAACAA Found at i:13904 original size:63 final size:62 Alignment explanation

Indices: 13772--13905 Score: 164 Period size: 63 Copynumber: 2.1 Consensus size: 62 13762 TTTAAGTTAT ** * 13772 ATTTGCATTTTCATTTTCATTTTGGAGAAAAGCACATTCTTTTTCAAACGATTCTGTTTCTCC 1 ATTT-CATTTTCATTTTCATTTTGGAGAAAAGCACATAATTTTTCAAACGATTCTATTTCTCC * * 13835 ATTTTCATTTTCATTTTCATTTTGGAGAAAA-C-CATAATTATTTCCAAATGTTTCATATTTCTC 1 A-TTTCATTTTCATTTTCATTTTGGAGAAAAGCACATAATT-TTT-CAAACGATTC-TATTTCTC 13898 C 62 C 13899 ATTTCAT 1 ATTTCAT 13906 CATTTATATT Statistics Matches: 62, Mismatches: 5, Indels: 8 0.83 0.07 0.11 Matches are distributed among these distances: 61 5 0.08 62 4 0.06 63 41 0.66 64 12 0.19 ACGTcount: A:0.26, C:0.18, G:0.08, T:0.48 Consensus pattern (62 bp): ATTTCATTTTCATTTTCATTTTGGAGAAAAGCACATAATTTTTCAAACGATTCTATTTCTCC Found at i:21727 original size:29 final size:27 Alignment explanation

Indices: 21657--21731 Score: 87 Period size: 29 Copynumber: 2.6 Consensus size: 27 21647 TTTTTGATTC * * 21657 AATTTGATACTTGAATTTGACATTTTTT 1 AATTTG-TACTTAAACTTGACATTTTTT * 21685 AATTTGGTAATTAAACTTGACAATTTTTT 1 AATTT-GTACTTAAACTTGAC-ATTTTTT 21714 ATATTTGTACTTAAACTT 1 A-ATTTGTACTTAAACTT 21732 TTTGGGGTCC Statistics Matches: 40, Mismatches: 4, Indels: 5 0.82 0.08 0.10 Matches are distributed among these distances: 28 16 0.40 29 20 0.50 30 4 0.10 ACGTcount: A:0.32, C:0.08, G:0.09, T:0.51 Consensus pattern (27 bp): AATTTGTACTTAAACTTGACATTTTTT Found at i:22070 original size:60 final size:59 Alignment explanation

Indices: 21936--22120 Score: 218 Period size: 60 Copynumber: 3.2 Consensus size: 59 21926 AATAAATTTT ** * * * * * 21936 GGTACCAAATTGAATCTAAAAAAAA-CTTAGGAACCAAATTAGGAAAAAATGCCAAGTTCA 1 GGTACCAAATTGGGTCCAAAAAAAAGTTTAGGTACCAAATTA--AGAAAATGTCAAGTTCA * * * 21996 AGTATCAAACTGGGTCCAAAAAAAAGTTTAGGTACCAAATTAAGAAAAGTGTCAAGTTCA 1 GGTACCAAATTGGGTCCAAAAAAAAGTTTAGGTACCAAATTAAGAAAA-TGTCAAGTTCA 22056 GGTACCAAATTGGGT-C--AAAAAAGTTTAGGTACCAAATT-AGAAAATGTCAAGTTCA 1 GGTACCAAATTGGGTCCAAAAAAAAGTTTAGGTACCAAATTAAGAAAATGTCAAGTTCA 22111 GGTACCAAAT 1 GGTACCAAAT 22121 GTTATATAAA Statistics Matches: 110, Mismatches: 13, Indels: 9 0.83 0.10 0.07 Matches are distributed among these distances: 55 21 0.19 56 6 0.05 57 22 0.20 59 6 0.05 60 41 0.37 61 14 0.13 ACGTcount: A:0.45, C:0.14, G:0.18, T:0.23 Consensus pattern (59 bp): GGTACCAAATTGGGTCCAAAAAAAAGTTTAGGTACCAAATTAAGAAAATGTCAAGTTCA Found at i:25006 original size:97 final size:97 Alignment explanation

Indices: 24831--25006 Score: 248 Period size: 97 Copynumber: 1.8 Consensus size: 97 24821 AACTTTGGAA ** * 24831 AAGGATATTCGATTATCTCGATTTGAAGAAAGGTCGCACCTAGTAAGTTAAGGCACAGATTTTCA 1 AAGGATATTCGATTATCTCGATTTGAAGAAAAATCGCACCTAGTAAGTTAAGGCACAAATTTTCA * * 24896 GAATCAGAGATAAAGAAACATTGCCTCGATTT 66 AAATCAGAAATAAAGAAACATTGCCTCGATTT * * * 24928 AAGGGTATTCGATTAT-TCCGATTTGAAGAAAAATCGTACCTAGTAAGTTAAGGCATAAATTTTC 1 AAGGATATTCGATTATCT-CGATTTGAAGAAAAATCGCACCTAGTAAGTTAAGGCACAAATTTTC 24992 AAAACTC-GAAATAAA 65 AAAA-TCAGAAATAAA 25007 AGAATATTAC Statistics Matches: 69, Mismatches: 8, Indels: 4 0.85 0.10 0.05 Matches are distributed among these distances: 96 1 0.01 97 66 0.96 98 2 0.03 ACGTcount: A:0.39, C:0.14, G:0.19, T:0.28 Consensus pattern (97 bp): AAGGATATTCGATTATCTCGATTTGAAGAAAAATCGCACCTAGTAAGTTAAGGCACAAATTTTCA AAATCAGAAATAAAGAAACATTGCCTCGATTT Found at i:25382 original size:28 final size:29 Alignment explanation

Indices: 25351--25430 Score: 78 Period size: 28 Copynumber: 2.8 Consensus size: 29 25341 ACTTTGGGGG 25351 CAAAATGGTTATTTTTGGAA-AAAAGGGT 1 CAAAATGGTTATTTTTGGAAGAAAAGGGT * ** 25379 CAAAAATGGAT-TTTTTGGAAGTTTAAGGGT 1 C-AAAATGGTTATTTTTGGAAG-AAAAGGGT * 25409 -AAAATGGTAATTTTTGG-AGAAA 1 CAAAATGGTTATTTTTGGAAGAAA 25431 TCATGGTCAA Statistics Matches: 41, Mismatches: 7, Indels: 9 0.72 0.12 0.16 Matches are distributed among these distances: 27 1 0.02 28 19 0.46 29 15 0.37 30 6 0.15 ACGTcount: A:0.39, C:0.03, G:0.25, T:0.34 Consensus pattern (29 bp): CAAAATGGTTATTTTTGGAAGAAAAGGGT Found at i:25539 original size:30 final size:30 Alignment explanation

Indices: 25390--25541 Score: 93 Period size: 30 Copynumber: 5.1 Consensus size: 30 25380 AAAAATGGAT * 25390 TTTTTGGAAGTTTAAGGGTAAAATGGT-AA 1 TTTTTGGAAGTTTAGGGGTAAAATGGTAAA ** * * 25419 TTTTTGGAGAAATCA-TGGTCAAAAAT-G-AAA 1 TTTTTGGA-AGTTTAGGGGT--AAAATGGTAAA * * * 25449 TTTTTGGAAGTTTGGGGGTAAAACGGT-AT 1 TTTTTGGAAGTTTAGGGGTAAAATGGTAAA ** * * * 25478 TTTTTGGAGAAATCATGGTTAAAAAT-G-AAA 1 TTTTTGGA-AGTTTAGGGGT-AAAATGGTAAA 25508 TTTTTGGAAGTTTAGGGGTAAAATGGTAAA 1 TTTTTGGAAGTTTAGGGGTAAAATGGTAAA 25538 TTTT 1 TTTT 25542 GGAGAAATCG Statistics Matches: 87, Mismatches: 24, Indels: 23 0.65 0.18 0.17 Matches are distributed among these distances: 28 9 0.10 29 30 0.34 30 39 0.45 31 9 0.10 ACGTcount: A:0.35, C:0.03, G:0.26, T:0.37 Consensus pattern (30 bp): TTTTTGGAAGTTTAGGGGTAAAATGGTAAA Found at i:25541 original size:29 final size:29 Alignment explanation

Indices: 25391--25544 Score: 75 Period size: 29 Copynumber: 5.2 Consensus size: 29 25381 AAAATGGATT * * 25391 TTTTGGAAGTTTAAGGGTAAAATGGTAAT 1 TTTTGGAAGTTTAGGGGTAAAATGGTAAA ** * * 25420 TTTTGGAGAAATCA-TGGTCAAAAAT-G-AAA 1 TTTTGGA-AGTTTAGGGGT--AAAATGGTAAA * * ** 25449 TTTTTGGAAGTTTGGGGGTAAAACGGTATT 1 -TTTTGGAAGTTTAGGGGTAAAATGGTAAA ** * * * 25479 TTTTGGAGAAATCATGGTTAAAAAT-G-AAA 1 TTTTGGA-AGTTTAGGGGT-AAAATGGTAAA 25508 TTTTTGGAAGTTTAGGGGTAAAATGGTAAA 1 -TTTTGGAAGTTTAGGGGTAAAATGGTAAA 25538 TTTTGGA 1 TTTTGGA 25545 GAAATCGGGG Statistics Matches: 86, Mismatches: 27, Indels: 24 0.63 0.20 0.18 Matches are distributed among these distances: 28 9 0.10 29 37 0.43 30 31 0.36 31 9 0.10 ACGTcount: A:0.35, C:0.03, G:0.27, T:0.36 Consensus pattern (29 bp): TTTTGGAAGTTTAGGGGTAAAATGGTAAA Found at i:25568 original size:29 final size:29 Alignment explanation

Indices: 25479--25570 Score: 73 Period size: 29 Copynumber: 3.1 Consensus size: 29 25469 AAACGGTATT ** * 25479 TTTTGGAGAAATCATGGTTAAAAATGAAA 1 TTTTGGAGAAATCGGGGTTAAAAATGGAA ** * 25508 TTTTTGGA-AGTTTAGGGG-T-AAAATGGTAAA 1 -TTTTGGAGA-AATCGGGGTTAAAAATGG--AA 25538 TTTTGGAGAAATCGGGGTTAAAAATGGAA 1 TTTTGGAGAAATCGGGGTTAAAAATGGAA 25567 TTTT 1 TTTT 25571 TGAAAAGTTT Statistics Matches: 47, Mismatches: 9, Indels: 13 0.68 0.13 0.19 Matches are distributed among these distances: 28 6 0.13 29 20 0.43 30 14 0.30 31 7 0.15 ACGTcount: A:0.37, C:0.02, G:0.26, T:0.35 Consensus pattern (29 bp): TTTTGGAGAAATCGGGGTTAAAAATGGAA Found at i:25572 original size:30 final size:31 Alignment explanation

Indices: 25409--25572 Score: 110 Period size: 30 Copynumber: 5.5 Consensus size: 31 25399 GTTTAAGGGT ** * 25409 AAAATGGTAATTTTTGGAGAAATCATGGTCA 1 AAAATGGTAATTTTTGGAGAAATCGGGGTTA * ** * * 25440 AAAAT-GAAATTTTTGGA-AGTTTGGGGGT- 1 AAAATGGTAATTTTTGGAGAAATCGGGGTTA * * ** 25468 AAAACGGTATTTTTTGGAGAAATCATGGTTA 1 AAAATGGTAATTTTTGGAGAAATCGGGGTTA * ** * 25499 AAAAT-GAAATTTTTGGA-AGTTTAGGGG-T- 1 AAAATGGTAATTTTTGGAGA-AATCGGGGTTA * 25527 AAAATGGTAAATTTTGGAGAAATCGGGGTTA 1 AAAATGGTAATTTTTGGAGAAATCGGGGTTA 25558 AAAATGG-AATTTTTG 1 AAAATGGTAATTTTTG 25573 AAAAGTTTGG Statistics Matches: 94, Mismatches: 31, Indels: 17 0.66 0.22 0.12 Matches are distributed among these distances: 28 9 0.10 29 31 0.33 30 38 0.40 31 16 0.17 ACGTcount: A:0.37, C:0.03, G:0.26, T:0.34 Consensus pattern (31 bp): AAAATGGTAATTTTTGGAGAAATCGGGGTTA Found at i:25585 original size:59 final size:59 Alignment explanation

Indices: 25290--25585 Score: 359 Period size: 59 Copynumber: 5.1 Consensus size: 59 25280 GGACACTTTG * * * * 25290 GGGGTAAAA-GGTAA-TTTTGGAAAAATCAGGGTCAAAAATGAAATTTTTAGAACTTTG 1 GGGGTAAAATGGTAATTTTTGGAGAAATCAGGGTCAAAAATGAAATTTTTGGAAGTTTA * * * * * 25347 GGGGCAAAATGGTTATTTTTGGA-AAA-AAGGGTCAAAAATGGATTTTTTGGAAGTTTA 1 GGGGTAAAATGGTAATTTTTGGAGAAATCAGGGTCAAAAATGAAATTTTTGGAAGTTTA * * * 25404 AGGGTAAAATGGTAATTTTTGGAGAAATCATGGTCAAAAATGAAATTTTTGGAAGTTTG 1 GGGGTAAAATGGTAATTTTTGGAGAAATCAGGGTCAAAAATGAAATTTTTGGAAGTTTA * * * * 25463 GGGGTAAAACGGTATTTTTTGGAGAAATCATGGTTAAAAATGAAATTTTTGGAAGTTTA 1 GGGGTAAAATGGTAATTTTTGGAGAAATCAGGGTCAAAAATGAAATTTTTGGAAGTTTA * * * * * 25522 GGGGTAAAATGGTAAATTTTGGAGAAATCGGGGTTAAAAATGGAATTTTTGAAAAGTTT- 1 GGGGTAAAATGGTAATTTTTGGAGAAATCAGGGTCAAAAATGAAATTTTTG-GAAGTTTA 25581 GGGGT 1 GGGGT 25586 CGAAAATATG Statistics Matches: 206, Mismatches: 28, Indels: 8 0.85 0.12 0.03 Matches are distributed among these distances: 57 53 0.26 58 10 0.05 59 137 0.67 60 6 0.03 ACGTcount: A:0.36, C:0.03, G:0.27, T:0.33 Consensus pattern (59 bp): GGGGTAAAATGGTAATTTTTGGAGAAATCAGGGTCAAAAATGAAATTTTTGGAAGTTTA Done.