Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: NTFQ01013933.1 Kokia drynarioides strain JFW-HI SEQ_128963, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 81255
ACGTcount: A:0.33, C:0.18, G:0.17, T:0.33

Warning! 22 characters in sequence are not A, C, G, or T


Found at i:6442 original size:41 final size:41

Alignment explanation

Indices: 6397--6480 Score: 141 Period size: 41 Copynumber: 2.0 Consensus size: 41 6387 GCATGATACT * 6397 AACATTGATGAAGATGCAGTTTTAAGTTGTTCATAGCTGTA 1 AACATTGATGAAGATGCAGTTTTAAGTTGTTCATAGCTATA * * 6438 AACATTGATGAAGTTGCAGTTTTAAGTTGTTTATAGCTATA 1 AACATTGATGAAGATGCAGTTTTAAGTTGTTCATAGCTATA 6479 AA 1 AA 6481 TTGTCAGCGC Statistics Matches: 40, Mismatches: 3, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 41 40 1.00 ACGTcount: A:0.33, C:0.08, G:0.20, T:0.38 Consensus pattern (41 bp): AACATTGATGAAGATGCAGTTTTAAGTTGTTCATAGCTATA Found at i:9232 original size:34 final size:34 Alignment explanation

Indices: 9175--9240 Score: 80 Period size: 34 Copynumber: 1.9 Consensus size: 34 9165 ATATATATGG ** * 9175 AAATGCAATGACAATGTAGATGC-AGCGACAATAA 1 AAATGCAATGACAATAAAAATGCAAG-GACAATAA * 9209 AAATGCAGTGACAATAAAAATGCAAGGACAAT 1 AAATGCAATGACAATAAAAATGCAAGGACAAT 9241 TATACTATGG Statistics Matches: 27, Mismatches: 4, Indels: 2 0.82 0.12 0.06 Matches are distributed among these distances: 34 25 0.93 35 2 0.07 ACGTcount: A:0.50, C:0.14, G:0.20, T:0.17 Consensus pattern (34 bp): AAATGCAATGACAATAAAAATGCAAGGACAATAA Found at i:9240 original size:17 final size:17 Alignment explanation

Indices: 9175--9232 Score: 71 Period size: 17 Copynumber: 3.4 Consensus size: 17 9165 ATATATATGG * ** 9175 AAATGCAATGACAATGT 1 AAATGCAGTGACAATAA * * 9192 AGATGCAGCGACAATAA 1 AAATGCAGTGACAATAA 9209 AAATGCAGTGACAATAA 1 AAATGCAGTGACAATAA 9226 AAATGCA 1 AAATGCA 9233 AGGACAATTA Statistics Matches: 34, Mismatches: 7, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 17 34 1.00 ACGTcount: A:0.50, C:0.14, G:0.19, T:0.17 Consensus pattern (17 bp): AAATGCAGTGACAATAA Found at i:11416 original size:16 final size:16 Alignment explanation

Indices: 11395--11425 Score: 53 Period size: 16 Copynumber: 1.9 Consensus size: 16 11385 ATAATGTGAA 11395 AATAAAGATAAAATGT 1 AATAAAGATAAAATGT * 11411 AATAAAGGTAAAATG 1 AATAAAGATAAAATG 11426 GGATCCACAA Statistics Matches: 14, Mismatches: 1, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 16 14 1.00 ACGTcount: A:0.61, C:0.00, G:0.16, T:0.23 Consensus pattern (16 bp): AATAAAGATAAAATGT Found at i:11987 original size:245 final size:245 Alignment explanation

Indices: 11557--12039 Score: 842 Period size: 245 Copynumber: 2.0 Consensus size: 245 11547 GATCTATGAC * 11557 TTTAGTATGCACACTAACCACACCATCCAGTGCTGGCAATGCAACACATTTAAGAACAAGTGGAC 1 TTTAGTATGCACACTAACCACACCATCCAGTGCTGGCAATGCAACACACTTAAGAACAAGTGGAC * * * 11622 CAAATTCCTTCAATCCCAAAGCAACTTCCATTATCTTGCTTGTTAGTTTAAACTGTCACTGCAGC 66 AAAATTCCATCAATCCCAAAGCAACTTCCATTATCTTGCTTGTTAGTTTAAACTGTCACTGCACC * * * 11687 TTCTAGTGACTATGACTGCAATAACACTTTCGTGCTAAAATCATTCAAACCCAAAGATTAAACAG 131 TTCCAATGACTATGACTACAATAACACTTTCGTGCTAAAATCATTCAAACCCAAAGATTAAACAG * 11752 TAAAAGAAAGCTTGAATAAGATATCATTTAGTCTAATAATCAATATATCA 196 TAAAAGAAAGCTTGAATAAGATATCATTTAGTATAATAATCAATATATCA * * 11802 TTTAGTATGCACACTAACCACACCTTCCAGTGTTGGCAATGCAACACACTTAAGAACAAGTGGAC 1 TTTAGTATGCACACTAACCACACCATCCAGTGCTGGCAATGCAACACACTTAAGAACAAGTGGAC * 11867 AAAATTCCATCAATCCCAAAGCAACTT-CAGTTGTCTTGCTTGTTAGTTTAAACTGTCACTGCAC 66 AAAATTCCATCAATCCCAAAGCAACTTCCA-TTATCTTGCTTGTTAGTTTAAACTGTCACTGCAC * 11931 CTTCCAATGACTATGACTACAATAACACTTTCGTGCTAAAATCATTCAAACCCAAAGATTAAATA 130 CTTCCAATGACTATGACTACAATAACACTTTCGTGCTAAAATCATTCAAACCCAAAGATTAAACA 11996 GTAAAAGAAAGCTTGAATAAGATATCATTTAGTATAATAATCAA 195 GTAAAAGAAAGCTTGAATAAGATATCATTTAGTATAATAATCAA 12040 GAGTACTTCC Statistics Matches: 225, Mismatches: 12, Indels: 2 0.94 0.05 0.01 Matches are distributed among these distances: 244 2 0.01 245 223 0.99 ACGTcount: A:0.37, C:0.22, G:0.13, T:0.28 Consensus pattern (245 bp): TTTAGTATGCACACTAACCACACCATCCAGTGCTGGCAATGCAACACACTTAAGAACAAGTGGAC AAAATTCCATCAATCCCAAAGCAACTTCCATTATCTTGCTTGTTAGTTTAAACTGTCACTGCACC TTCCAATGACTATGACTACAATAACACTTTCGTGCTAAAATCATTCAAACCCAAAGATTAAACAG TAAAAGAAAGCTTGAATAAGATATCATTTAGTATAATAATCAATATATCA Found at i:12584 original size:37 final size:37 Alignment explanation

Indices: 12543--12631 Score: 151 Period size: 37 Copynumber: 2.4 Consensus size: 37 12533 AAAATATTCC * 12543 TGCGGTGACAGTTTTGGGTGTAATCTGGAAGTGCTCA 1 TGCGGTGACAGTTTTGGGTGCAATCTGGAAGTGCTCA * 12580 TGCGGTAACAGTTTTGGGTGCAATCTGGAAGTGCTCA 1 TGCGGTGACAGTTTTGGGTGCAATCTGGAAGTGCTCA * 12617 TGTGGTGACAGTTTT 1 TGCGGTGACAGTTTT 12632 AGGCACAAAT Statistics Matches: 48, Mismatches: 4, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 37 48 1.00 ACGTcount: A:0.19, C:0.13, G:0.34, T:0.34 Consensus pattern (37 bp): TGCGGTGACAGTTTTGGGTGCAATCTGGAAGTGCTCA Found at i:14283 original size:14 final size:14 Alignment explanation

Indices: 14264--14342 Score: 70 Period size: 14 Copynumber: 5.7 Consensus size: 14 14254 CAAATTCGAC 14264 GGTCAACAATCAAA 1 GGTCAACAATCAAA * * 14278 GGTCAA-AGTCAAC 1 GGTCAACAATCAAA * * 14291 GGTCAACAGTTAAA 1 GGTCAACAATCAAA * * 14305 GGTCAACAGTCAAC 1 GGTCAACAATCAAA * * * 14319 GATAAATAATCAAA 1 GGTCAACAATCAAA 14333 GGTCAACAAT 1 GGTCAACAAT 14343 TCGGTTAACA Statistics Matches: 50, Mismatches: 14, Indels: 2 0.76 0.21 0.03 Matches are distributed among these distances: 13 11 0.22 14 39 0.78 ACGTcount: A:0.46, C:0.19, G:0.18, T:0.18 Consensus pattern (14 bp): GGTCAACAATCAAA Found at i:14284 original size:27 final size:27 Alignment explanation

Indices: 14247--14338 Score: 105 Period size: 27 Copynumber: 3.4 Consensus size: 27 14237 AAAGTAAAAG * * 14247 TCAAA-GTCAAATTCGACGGTCAACAA 1 TCAAAGGTCAAAGTCAACGGTCAACAA * 14273 TCAAAGGTCAAAGTCAACGGTCAACAG 1 TCAAAGGTCAAAGTCAACGGTCAACAA * * * * 14300 TTAAAGGTCAACAGTCAACGATAAATAA 1 TCAAAGGTCAA-AGTCAACGGTCAACAA 14328 TCAAAGGTCAA 1 TCAAAGGTCAA 14339 CAATTCGGTT Statistics Matches: 55, Mismatches: 9, Indels: 2 0.83 0.14 0.03 Matches are distributed among these distances: 26 5 0.09 27 28 0.51 28 22 0.40 ACGTcount: A:0.45, C:0.20, G:0.17, T:0.18 Consensus pattern (27 bp): TCAAAGGTCAAAGTCAACGGTCAACAA Found at i:14333 original size:28 final size:28 Alignment explanation

Indices: 14268--14340 Score: 94 Period size: 28 Copynumber: 2.6 Consensus size: 28 14258 TTCGACGGTC * * 14268 AACAATCAAAGGTCAA-AGTCAACGGTC 1 AACAATCAAAGGTCAACAGTCAACGATA * * 14295 AACAGTTAAAGGTCAACAGTCAACGATA 1 AACAATCAAAGGTCAACAGTCAACGATA * 14323 AATAATCAAAGGTCAACA 1 AACAATCAAAGGTCAACA 14341 ATTCGGTTAA Statistics Matches: 38, Mismatches: 7, Indels: 1 0.83 0.15 0.02 Matches are distributed among these distances: 27 14 0.37 28 24 0.63 ACGTcount: A:0.48, C:0.19, G:0.16, T:0.16 Consensus pattern (28 bp): AACAATCAAAGGTCAACAGTCAACGATA Found at i:15448 original size:23 final size:23 Alignment explanation

Indices: 15429--15535 Score: 151 Period size: 23 Copynumber: 4.7 Consensus size: 23 15419 TTCATTTGCA 15429 CATAAATCACATTAATATTTAAG 1 CATAAATCACATTAATATTTAAG 15452 CATAAATCACATTAATATTTAAG 1 CATAAATCACATTAATATTTAAG * * * * 15475 TATAAATCATATTGACATTTAAG 1 CATAAATCACATTAATATTTAAG * * 15498 CACAAACCACATTAATATTTAAG 1 CATAAATCACATTAATATTTAAG * 15521 CACAAATCACATTAA 1 CATAAATCACATTAA 15536 CTTTTAACAC Statistics Matches: 73, Mismatches: 11, Indels: 0 0.87 0.13 0.00 Matches are distributed among these distances: 23 73 1.00 ACGTcount: A:0.48, C:0.16, G:0.05, T:0.32 Consensus pattern (23 bp): CATAAATCACATTAATATTTAAG Found at i:15823 original size:36 final size:36 Alignment explanation

Indices: 15783--15856 Score: 130 Period size: 36 Copynumber: 2.1 Consensus size: 36 15773 TTCATTGTCA * 15783 CATGGCATGAATAACATACCCACATATTTACTATCT 1 CATGGCATGAATAACATACCCACATATTTACAATCT * 15819 CATGGCATGAATAACATACCCTCATATTTACAATCT 1 CATGGCATGAATAACATACCCACATATTTACAATCT 15855 CA 1 CA 15857 GTCTTTACTT Statistics Matches: 36, Mismatches: 2, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 36 36 1.00 ACGTcount: A:0.36, C:0.26, G:0.08, T:0.30 Consensus pattern (36 bp): CATGGCATGAATAACATACCCACATATTTACAATCT Found at i:21619 original size:17 final size:18 Alignment explanation

Indices: 21599--21636 Score: 53 Period size: 17 Copynumber: 2.2 Consensus size: 18 21589 TAGAGCAAAT 21599 AAAATGCAGA-GACAA-TA 1 AAAATGCA-ATGACAATTA 21616 AAAATGCAATGACAATTA 1 AAAATGCAATGACAATTA 21634 AAA 1 AAA 21637 GAAATGCATA Statistics Matches: 19, Mismatches: 0, Indels: 3 0.86 0.00 0.14 Matches are distributed among these distances: 16 1 0.05 17 13 0.68 18 5 0.26 ACGTcount: A:0.61, C:0.11, G:0.13, T:0.16 Consensus pattern (18 bp): AAAATGCAATGACAATTA Found at i:22023 original size:3 final size:3 Alignment explanation

Indices: 22015--22071 Score: 55 Period size: 3 Copynumber: 19.0 Consensus size: 3 22005 CGGAGTAACG * * 22015 TCT TCT TCT TTT TCT TCAT TC- TCT TCT TTT TCT TCT TCT T-T TCTT 1 TCT TCT TCT TCT TCT TC-T TCT TCT TCT TCT TCT TCT TCT TCT TC-T * 22060 TCT CCT TCT TCT 1 TCT TCT TCT TCT 22072 ATCTCAATAC Statistics Matches: 44, Mismatches: 6, Indels: 8 0.76 0.10 0.14 Matches are distributed among these distances: 2 4 0.09 3 34 0.77 4 6 0.14 ACGTcount: A:0.02, C:0.30, G:0.00, T:0.68 Consensus pattern (3 bp): TCT Found at i:22041 original size:18 final size:17 Alignment explanation

Indices: 22018--22071 Score: 72 Period size: 18 Copynumber: 3.1 Consensus size: 17 22008 AGTAACGTCT 22018 TCTTCTTTTTCTTCATTC 1 TCTTCTTTTTCTTC-TTC 22036 TCTTCTTTTTCTTCTTC 1 TCTTCTTTTTCTTCTTC * * 22053 TTTTCTTTCTCCTTCTTC 1 TCTTCTTT-TTCTTCTTC 22071 T 1 T 22072 ATCTCAATAC Statistics Matches: 33, Mismatches: 2, Indels: 2 0.89 0.05 0.05 Matches are distributed among these distances: 17 10 0.30 18 23 0.70 ACGTcount: A:0.02, C:0.30, G:0.00, T:0.69 Consensus pattern (17 bp): TCTTCTTTTTCTTCTTC Found at i:23406 original size:100 final size:99 Alignment explanation

Indices: 23233--23431 Score: 380 Period size: 100 Copynumber: 2.0 Consensus size: 99 23223 CATCCCAAAC * 23233 GAGAAATTGTTTTGGTAACCTACTTCCCTAATGTCAGATGCGGGGACACCAAAATCCTTGTTCAG 1 GAGAAATTGTTTTGGTAACCTACTTCCCTAATGTCAGATGCGGGGACACCAAAATCCTTCTTCAG 23298 GATGACAGTTGCTTTTCATATCACCTCATGGTAA 66 GATGACAGTTGCTTTTCATATCACCTCATGGTAA 23332 NGAGAAATTGTTTTGGTAACCTACTTCCCTAATGTCAGATGCGGGGACACCAAAATCCTTCTTCA 1 -GAGAAATTGTTTTGGTAACCTACTTCCCTAATGTCAGATGCGGGGACACCAAAATCCTTCTTCA 23397 GGATGACAGTTGCTTTTCATATCACCTCATGGTAA 65 GGATGACAGTTGCTTTTCATATCACCTCATGGTAA 23432 CGGTTCTTCG Statistics Matches: 98, Mismatches: 1, Indels: 0 0.99 0.01 0.00 Matches are distributed among these distances: 100 98 1.00 ACGTcount: A:0.27, C:0.22, G:0.20, T:0.31 Consensus pattern (99 bp): GAGAAATTGTTTTGGTAACCTACTTCCCTAATGTCAGATGCGGGGACACCAAAATCCTTCTTCAG GATGACAGTTGCTTTTCATATCACCTCATGGTAA Found at i:36939 original size:9 final size:9 Alignment explanation

Indices: 36925--36952 Score: 56 Period size: 9 Copynumber: 3.1 Consensus size: 9 36915 CTGCCTTTGC 36925 TTTCTTCTT 1 TTTCTTCTT 36934 TTTCTTCTT 1 TTTCTTCTT 36943 TTTCTTCTT 1 TTTCTTCTT 36952 T 1 T 36953 GGGCGCTCCC Statistics Matches: 19, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 9 19 1.00 ACGTcount: A:0.00, C:0.21, G:0.00, T:0.79 Consensus pattern (9 bp): TTTCTTCTT Found at i:52861 original size:23 final size:23 Alignment explanation

Indices: 52701--52854 Score: 236 Period size: 23 Copynumber: 6.7 Consensus size: 23 52691 AGTTCATTTA 52701 CATATAATCACATTAATATTTAAG 1 CATA-AATCACATTAATATTTAAG * 52725 CATAAATCACATTAACATTTAAG 1 CATAAATCACATTAATATTTAAG 52748 CATAAATCACATTAATATTTAAG 1 CATAAATCACATTAATATTTAAG * * * 52771 CATAAATCATATTAACATCTAAG 1 CATAAATCACATTAATATTTAAG * * 52794 CACAAATCACATTAATATTTCAG 1 CATAAATCACATTAATATTTAAG * 52817 CACAAATCACATTAATATTTAAG 1 CATAAATCACATTAATATTTAAG 52840 CATAAATCACATTAA 1 CATAAATCACATTAA 52855 CTTTTAACAC Statistics Matches: 118, Mismatches: 12, Indels: 1 0.90 0.09 0.01 Matches are distributed among these distances: 23 114 0.97 24 4 0.03 ACGTcount: A:0.47, C:0.17, G:0.04, T:0.32 Consensus pattern (23 bp): CATAAATCACATTAATATTTAAG Found at i:68881 original size:26 final size:26 Alignment explanation

Indices: 68845--68904 Score: 93 Period size: 26 Copynumber: 2.3 Consensus size: 26 68835 AACATGGGAC 68845 ATAAAAAATTAACATTAGAATATAAA 1 ATAAAAAATTAACATTAGAATATAAA * * 68871 ATAAGAAATTAACATTAGAATTTAAA 1 ATAAAAAATTAACATTAGAATATAAA * 68897 ACAAAAAA 1 ATAAAAAA 68905 AAAACATTTT Statistics Matches: 30, Mismatches: 4, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 26 30 1.00 ACGTcount: A:0.65, C:0.05, G:0.05, T:0.25 Consensus pattern (26 bp): ATAAAAAATTAACATTAGAATATAAA Found at i:68910 original size:26 final size:26 Alignment explanation

Indices: 68855--68912 Score: 71 Period size: 26 Copynumber: 2.2 Consensus size: 26 68845 ATAAAAAATT * * ** 68855 AACATTAGAATATAAAATAAGAAATT 1 AACATTAGAATATAAAACAAAAAAAA * 68881 AACATTAGAATTTAAAACAAAAAAAA 1 AACATTAGAATATAAAACAAAAAAAA 68907 AACATT 1 AACATT 68913 TTAAATCTTA Statistics Matches: 27, Mismatches: 5, Indels: 0 0.84 0.16 0.00 Matches are distributed among these distances: 26 27 1.00 ACGTcount: A:0.64, C:0.07, G:0.05, T:0.24 Consensus pattern (26 bp): AACATTAGAATATAAAACAAAAAAAA Found at i:72200 original size:13 final size:13 Alignment explanation

Indices: 72184--72222 Score: 51 Period size: 13 Copynumber: 2.8 Consensus size: 13 72174 TTATGAAGGA 72184 TATTAATCTAAAC 1 TATTAATCTAAAC * 72197 TATTAAACTAAAC 1 TATTAATCTAAAC 72210 TAAATTAATCTAA 1 T--ATTAATCTAA 72223 GCTGAAGATA Statistics Matches: 22, Mismatches: 2, Indels: 2 0.85 0.08 0.08 Matches are distributed among these distances: 13 13 0.59 15 9 0.41 ACGTcount: A:0.51, C:0.13, G:0.00, T:0.36 Consensus pattern (13 bp): TATTAATCTAAAC Found at i:76783 original size:80 final size:81 Alignment explanation

Indices: 76648--76802 Score: 251 Period size: 80 Copynumber: 1.9 Consensus size: 81 76638 TTTCACGTGG * * * 76648 AAATAAGTTGTAAAAAATTTATTGATGTGACATAAAATAATTGAATGGAATCATATATGAGGTGA 1 AAATAAGTTGTAAAAAATTTATTGATATGACATAAAATAATTGAATGGAACCATATATGAGATGA 76713 TGGGAGTGGTACGTTA 66 TGGGAGTGGTACGTTA * 76729 AAATAAGTTGTAAAAAA-TTATTGATATGACATAAAATAATTTG-ATGGAACCATATATGAGATT 1 AAATAAGTTGTAAAAAATTTATTGATATGACATAAAATAA-TTGAATGGAACCATATATGAGATG 76792 ATGGGAGTGGT 65 ATGGGAGTGGT 76803 CGGTAGAATA Statistics Matches: 69, Mismatches: 4, Indels: 3 0.91 0.05 0.04 Matches are distributed among these distances: 80 49 0.71 81 20 0.29 ACGTcount: A:0.42, C:0.04, G:0.22, T:0.32 Consensus pattern (81 bp): AAATAAGTTGTAAAAAATTTATTGATATGACATAAAATAATTGAATGGAACCATATATGAGATGA TGGGAGTGGTACGTTA Found at i:76867 original size:82 final size:82 Alignment explanation

Indices: 76758--76911 Score: 231 Period size: 82 Copynumber: 1.9 Consensus size: 82 76748 ATTGATATGA * * * * 76758 CATAAAATAATTTGATGGAACCATATATGAGATTATGGGAGTGG-TCGGTAGAATAATTTCTCAT 1 CATAAAATAATTAGATGGAACCATATATGAGATGATGAGAATGGTTC-GTAGAATAATTTCTCAT 76822 TATTATCTATTAATGTGG 65 TATTATCTATTAATGTGG * 76840 CATAAAATAATTAGAT-GATACCATATATGAGGTGATGAGAATGGTTCGTAGAATAATTTCTCAT 1 CATAAAATAATTAGATGGA-ACCATATATGAGATGATGAGAATGGTTCGTAGAATAATTTCTCAT 76904 TATTATCT 65 TATTATCT 76912 TTTGACCATT Statistics Matches: 65, Mismatches: 5, Indels: 4 0.88 0.07 0.05 Matches are distributed among these distances: 81 2 0.03 82 61 0.94 83 2 0.03 ACGTcount: A:0.36, C:0.09, G:0.19, T:0.36 Consensus pattern (82 bp): CATAAAATAATTAGATGGAACCATATATGAGATGATGAGAATGGTTCGTAGAATAATTTCTCATT ATTATCTATTAATGTGG Found at i:79858 original size:6 final size:6 Alignment explanation

Indices: 79844--79886 Score: 77 Period size: 6 Copynumber: 7.2 Consensus size: 6 79834 TGAGTTTGGT * 79844 CTCCTA CTCCCA CTCCCA CTCCCA CTCCCA CTCCCA CTCCCA C 1 CTCCCA CTCCCA CTCCCA CTCCCA CTCCCA CTCCCA CTCCCA C 79887 CGTAGAATAA Statistics Matches: 36, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 6 36 1.00 ACGTcount: A:0.16, C:0.65, G:0.00, T:0.19 Consensus pattern (6 bp): CTCCCA Done.