Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: NTFQ01005949.1 Kokia drynarioides strain JFW-HI SEQ_120338, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 60265
ACGTcount: A:0.34, C:0.17, G:0.16, T:0.33


Found at i:2379 original size:32 final size:32

Alignment explanation

Indices: 2343--2404 Score: 97 Period size: 32 Copynumber: 1.9 Consensus size: 32 2333 TATATTGATA * * 2343 TAATTTTATGATAATATCTATTTATATTTTTT 1 TAATTTTATGATAATATATATATATATTTTTT * 2375 TAATTTTTTGATAATATATATATATATTTT 1 TAATTTTATGATAATATATATATATATTTT 2405 ATTCAAACAG Statistics Matches: 27, Mismatches: 3, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 32 27 1.00 ACGTcount: A:0.34, C:0.02, G:0.03, T:0.61 Consensus pattern (32 bp): TAATTTTATGATAATATATATATATATTTTTT Found at i:4862 original size:17 final size:17 Alignment explanation

Indices: 4840--4873 Score: 50 Period size: 17 Copynumber: 2.0 Consensus size: 17 4830 AGGCTGATAC 4840 TTAAATTGAGTTTCTTT 1 TTAAATTGAGTTTCTTT * * 4857 TTAAATTTATTTTCTTT 1 TTAAATTGAGTTTCTTT 4874 AAATTCTTAG Statistics Matches: 15, Mismatches: 2, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 17 15 1.00 ACGTcount: A:0.24, C:0.06, G:0.06, T:0.65 Consensus pattern (17 bp): TTAAATTGAGTTTCTTT Found at i:8425 original size:18 final size:18 Alignment explanation

Indices: 8381--8443 Score: 65 Period size: 18 Copynumber: 3.5 Consensus size: 18 8371 TAACAAAACT * * 8381 CATAAAATAAGTAGTGCA 1 CATAAAATAAGTAATTCA * * 8399 CCTATAATAAGTAATTCA 1 CATAAAATAAGTAATTCA * 8417 CATAAAATAAG-CATATCA 1 CATAAAATAAGTAAT-TCA 8435 CATAAAATA 1 CATAAAATA 8444 TCAATTCACA Statistics Matches: 37, Mismatches: 7, Indels: 2 0.80 0.15 0.04 Matches are distributed among these distances: 17 2 0.05 18 35 0.95 ACGTcount: A:0.52, C:0.14, G:0.08, T:0.25 Consensus pattern (18 bp): CATAAAATAAGTAATTCA Found at i:8458 original size:18 final size:18 Alignment explanation

Indices: 8381--8454 Score: 62 Period size: 18 Copynumber: 4.2 Consensus size: 18 8371 TAACAAAACT * * * 8381 CATAAAATAAGTAGTGCA 1 CATAAAATAAGCAATTCA * * * 8399 CCTATAATAAGTAATTCA 1 CATAAAATAAGCAATTCA 8417 CATAAAATAAGC-ATATCA 1 CATAAAATAAGCAAT-TCA * 8435 CATAAAAT-ATCAATTCA 1 CATAAAATAAGCAATTCA 8452 CAT 1 CAT 8455 TAAACATTAG Statistics Matches: 46, Mismatches: 8, Indels: 5 0.78 0.14 0.08 Matches are distributed among these distances: 17 10 0.22 18 36 0.78 ACGTcount: A:0.50, C:0.16, G:0.07, T:0.27 Consensus pattern (18 bp): CATAAAATAAGCAATTCA Found at i:16197 original size:298 final size:297 Alignment explanation

Indices: 15621--16813 Score: 2032 Period size: 298 Copynumber: 4.0 Consensus size: 297 15611 GAAAGGAAGA * 15621 AGGACAACCACCATAAGAAGAAAAGGACATTCACGATCAATCGGGTAACTTCTTTCCTTATTACT 1 AGGACAACCACCATAAGAAGAAAAGGACATTCACGATCAATCGGGTAACTTCTTTCCTTAATACT * 15686 TTTTGGGCTACACGTTAAGGACAATGTGTTTTCTAAAGTGTGGGAGGATAACATAGGAATTTTCG 66 TTTTGGGCTACACGTTAAGGACAATGTGTTTTCTAAAGTGTGGG-GGATAACATAGGAATTTTTG 15751 GCATATAG-TTTT-TTTTTATTTTGACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGA 130 GCATATAGTTTTTATTTTTATTTTGACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGA * 15814 TTGATTAGGAGCATGTATAATAGGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGATAAGGA 195 TTGGTTAGGAGCATGTATAATAGGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGATAAGGA * 15879 ATTTTACAGTTTTGAGCATGAAATTTTTGAAGTTCTAT 260 ATTTTACATTTTTGAGCATGAAATTTTTGAAGTTCTAT * * * * 15917 AAGACAACCACAATAAGAAGAAAAGGACATTCACAATCAACCGGGTAACTTCTTTCCTTAATACT 1 AGGACAACCACCATAAGAAGAAAAGGACATTCACGATCAATCGGGTAACTTCTTTCCTTAATACT * * 15982 TTTTGGGCTACACGTTAAGGACAATATGTTTTCTAAAGTGTAGGGGGATAACATAGGAATTTTTA 66 TTTTGGGCTACACGTTAAGGACAATGTGTTTTCTAAAGTGT-GGGGGATAACATAGGAATTTTTG 16047 GCATATAGTTTTTATTTTTATTTTGACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGA 130 GCATATAGTTTTTATTTTTATTTTGACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGA 16112 TTGGTTAGGAGCATGTATAATAGGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGATAAGGA 195 TTGGTTAGGAGCATGTATAATAGGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGATAAGGA * 16177 ATTTTACATTTTTAAGCATGAAATTTTTGAAGTTCTAT 260 ATTTTACATTTTTGAGCATGAAATTTTTGAAGTTCTAT 16215 AGGACAACCACCATAAGAAGAAAAGGACATTCACGATCAATCGGGTAACTTCTTTCCTTAATACT 1 AGGACAACCACCATAAGAAGAAAAGGACATTCACGATCAATCGGGTAACTTCTTTCCTTAATACT * 16280 TTTTGGGCTACACGTTAAGGACAATGTGTTTTCTAAAGTGTGGGGGGTTAACATAGGAATTTTTG 66 TTTTGGGCTACACGTTAAGGACAATGTGTTTTCTAAAGTGT-GGGGGATAACATAGGAATTTTTG * * 16345 GCATATAGTTTTTATTTTTGTTTTCACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGA 130 GCATATAGTTTTTATTTTTATTTTGACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGA * * * 16410 TTGGTTAGGAGCATGTATAATAAGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGACAAGGT 195 TTGGTTAGGAGCATGTATAATAGGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGATAAGGA * * 16475 ATTTTACATTTTTTAGCATGAAATTTTTGAAGTTCGAT 260 ATTTTACATTTTTGAGCATGAAATTTTTGAAGTTCTAT * 16513 AGGACAACCACCATAAGAAGAAAAGGACATTCACGATTAATCGGGTAACTTCTTTCCTTAATACT 1 AGGACAACCACCATAAGAAGAAAAGGACATTCACGATCAATCGGGTAACTTCTTTCCTTAATACT * * * * * 16578 TTTTGGGCTACACATTAAGGACAATGTGTTTTTTAAAGTGCCGGAGGATAATATAGGAATTTTTG 66 TTTTGGGCTACACGTTAAGGACAATGTGTTTTCTAAAGTG-TGGGGGATAACATAGGAATTTTTG * 16643 GCATATAGTTTTTATTTTTATTTTGACTTTTTGATGTGTGCTTGAGTTTGTAGAAATACAAGTGA 130 GCATATAGTTTTTATTTTTATTTTGACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGA * * * * * 16708 TTGGTTAGGAGCAAGTATAGA-AGGTTAATTGTGTTTTCTTGATAAACTTGACATGATGATGAGG 195 TTGGTTAGGAGCATGTATA-ATAGGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGATAAGG * 16772 AATTTTATATTTTTGAGCATGAAATTTTTGAAGTTCTAT 259 AATTTTACATTTTTGAGCATGAAATTTTTGAAGTTCTAT 16811 AGG 1 AGG 16814 TTACATTATG Statistics Matches: 845, Mismatches: 47, Indels: 8 0.94 0.05 0.01 Matches are distributed among these distances: 296 126 0.15 297 7 0.01 298 711 0.84 299 1 0.00 ACGTcount: A:0.30, C:0.11, G:0.21, T:0.38 Consensus pattern (297 bp): AGGACAACCACCATAAGAAGAAAAGGACATTCACGATCAATCGGGTAACTTCTTTCCTTAATACT TTTTGGGCTACACGTTAAGGACAATGTGTTTTCTAAAGTGTGGGGGATAACATAGGAATTTTTGG CATATAGTTTTTATTTTTATTTTGACTTTTTGATGTGTGCTTGAGCTTGTAGAAATACAAGTGAT TGGTTAGGAGCATGTATAATAGGTTGATTGTGTTTTCTTGATAATCTTGGCATGATGATAAGGAA TTTTACATTTTTGAGCATGAAATTTTTGAAGTTCTAT Found at i:19081 original size:12 final size:12 Alignment explanation

Indices: 19064--19098 Score: 52 Period size: 12 Copynumber: 2.9 Consensus size: 12 19054 AAACTTTTGC * 19064 TTTTATTTTCTA 1 TTTTATTTTATA * 19076 TTTTATTTTATC 1 TTTTATTTTATA 19088 TTTTATTTTAT 1 TTTTATTTTAT 19099 GCTCTAAAAA Statistics Matches: 21, Mismatches: 2, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 12 21 1.00 ACGTcount: A:0.17, C:0.06, G:0.00, T:0.77 Consensus pattern (12 bp): TTTTATTTTATA Found at i:19086 original size:17 final size:17 Alignment explanation

Indices: 19064--19098 Score: 54 Period size: 17 Copynumber: 2.1 Consensus size: 17 19054 AAACTTTTGC 19064 TTTTAT-TTTCTATTTTA 1 TTTTATCTTT-TATTTTA 19081 TTTTATCTTTTATTTTA 1 TTTTATCTTTTATTTTA 19098 T 1 T 19099 GCTCTAAAAA Statistics Matches: 17, Mismatches: 0, Indels: 2 0.89 0.00 0.11 Matches are distributed among these distances: 17 14 0.82 18 3 0.18 ACGTcount: A:0.17, C:0.06, G:0.00, T:0.77 Consensus pattern (17 bp): TTTTATCTTTTATTTTA Found at i:19405 original size:31 final size:32 Alignment explanation

Indices: 19348--19417 Score: 99 Period size: 31 Copynumber: 2.2 Consensus size: 32 19338 TCATATATGC * * 19348 ATATATATAATCAAGACCTACGCATATATATT 1 ATATATATAATCAAAACCTACACATATATATT 19380 ATATATATAA-CAAAA-CTCACACATATATATT 1 ATATATATAATCAAAACCT-ACACATATATATT 19411 ATATATA 1 ATATATA 19418 CACATACCTT Statistics Matches: 35, Mismatches: 2, Indels: 3 0.88 0.05 0.08 Matches are distributed among these distances: 30 2 0.06 31 23 0.66 32 10 0.29 ACGTcount: A:0.49, C:0.14, G:0.03, T:0.34 Consensus pattern (32 bp): ATATATATAATCAAAACCTACACATATATATT Found at i:23396 original size:26 final size:26 Alignment explanation

Indices: 23359--23436 Score: 86 Period size: 26 Copynumber: 3.0 Consensus size: 26 23349 TACGAACCAA * 23359 TTCAGCACATCG-TACTTTCGAGCCAG 1 TTCAGCATATCGCT-CTTTCGAGCCAG * 23385 TTCAGCATATCGCTCTTTCTAGCCAG 1 TTCAGCATATCGCTCTTTCGAGCCAG * ** * 23411 TTCAGTATATTTCTCTTACGAGCCAG 1 TTCAGCATATCGCTCTTTCGAGCCAG 23437 ATGACATATC Statistics Matches: 44, Mismatches: 7, Indels: 2 0.83 0.13 0.04 Matches are distributed among these distances: 26 43 0.98 27 1 0.02 ACGTcount: A:0.22, C:0.28, G:0.17, T:0.33 Consensus pattern (26 bp): TTCAGCATATCGCTCTTTCGAGCCAG Found at i:23653 original size:24 final size:24 Alignment explanation

Indices: 23567--23666 Score: 101 Period size: 24 Copynumber: 4.2 Consensus size: 24 23557 TGCAAGTTTA * ** * * 23567 GTACGTTTACGCTCACCAGCTAAT 1 GTACGTTTATGCTCGTCAGCCACT * * * 23591 GTAAGTTTATGCTCTTCAGCTACT 1 GTACGTTTATGCTCGTCAGCCACT * 23615 ATACGTTTATGCTCGTCAGCCACT 1 GTACGTTTATGCTCGTCAGCCACT * * 23639 GTACGTTTATGCTCGTTAACCACT 1 GTACGTTTATGCTCGTCAGCCACT 23663 GTAC 1 GTAC 23667 ATTTGATACT Statistics Matches: 64, Mismatches: 12, Indels: 0 0.84 0.16 0.00 Matches are distributed among these distances: 24 64 1.00 ACGTcount: A:0.22, C:0.26, G:0.17, T:0.35 Consensus pattern (24 bp): GTACGTTTATGCTCGTCAGCCACT Found at i:24824 original size:71 final size:71 Alignment explanation

Indices: 24742--24882 Score: 273 Period size: 71 Copynumber: 2.0 Consensus size: 71 24732 ATTTTAATAA * 24742 ATACTAATGACCTCGTAAATATTAAATAATTAATATTTACAGTCTAACACATCGAAGTTGTGGCT 1 ATACTAATGACCTCGTAAATATTAAATAATTAATATTTACAATCTAACACATCGAAGTTGTGGCT 24807 TCAAAC 66 TCAAAC 24813 ATACTAATGACCTCGTAAATATTAAATAATTAATATTTACAATCTAACACATCGAAGTTGTGGCT 1 ATACTAATGACCTCGTAAATATTAAATAATTAATATTTACAATCTAACACATCGAAGTTGTGGCT 24878 TCAAA 66 TCAAA 24883 ATCACCAATT Statistics Matches: 69, Mismatches: 1, Indels: 0 0.99 0.01 0.00 Matches are distributed among these distances: 71 69 1.00 ACGTcount: A:0.40, C:0.16, G:0.11, T:0.33 Consensus pattern (71 bp): ATACTAATGACCTCGTAAATATTAAATAATTAATATTTACAATCTAACACATCGAAGTTGTGGCT TCAAAC Found at i:25967 original size:18 final size:18 Alignment explanation

Indices: 25944--25983 Score: 62 Period size: 18 Copynumber: 2.2 Consensus size: 18 25934 ACGATTTTTA * * 25944 TTTTATTTTTTGTTTATT 1 TTTTATCTTTTCTTTATT 25962 TTTTATCTTTTCTTTATT 1 TTTTATCTTTTCTTTATT 25980 TTTT 1 TTTT 25984 TAAGTTCAGT Statistics Matches: 20, Mismatches: 2, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 18 20 1.00 ACGTcount: A:0.10, C:0.05, G:0.03, T:0.82 Consensus pattern (18 bp): TTTTATCTTTTCTTTATT Found at i:30975 original size:13 final size:13 Alignment explanation

Indices: 30957--30981 Score: 50 Period size: 13 Copynumber: 1.9 Consensus size: 13 30947 GAGAAATAGT 30957 GAAAAAGAAAAAA 1 GAAAAAGAAAAAA 30970 GAAAAAGAAAAA 1 GAAAAAGAAAAA 30982 TGAGTAGAAA Statistics Matches: 12, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 12 1.00 ACGTcount: A:0.84, C:0.00, G:0.16, T:0.00 Consensus pattern (13 bp): GAAAAAGAAAAAA Found at i:39439 original size:181 final size:181 Alignment explanation

Indices: 39133--39486 Score: 548 Period size: 181 Copynumber: 2.0 Consensus size: 181 39123 AATATTATTA * * * * 39133 TACTTAAGTAATTGAGGATGAGAAGTAGAGACTATGAGCATTCTTTGTAGTAAATAAAAAATGAT 1 TACTTAAGCAATTGAGGATGAGAAGTAGAGACTATAAGCATTCTTTGTAGTAAAGAAAAAATAAT * * * * 39198 CATCCCCAAGTAAATCTCAAGAATACACTCACGATTTTTATCCAAATAATTAATTTTCAATGCTT 66 CATCCCCAAGCAAATCTCAAGAAGACACTCACGATTTTTATCCAAACAATTAAATTTCAATGCTT 39263 GGAGTCCAACAAAATTTTAACGCAAACTAAGAACCTATTACTCATTAGACT 131 GGAGTCCAACAAAATTTTAACGCAAACTAAGAACCTATTACTCATTAGACT * * * 39314 TACTTAAGCAATTGGGGATGAGAAGTAGATACTATAAGCATTCTTTGTAGTAAAGAAAAAGTAAT 1 TACTTAAGCAATTGAGGATGAGAAGTAGAGACTATAAGCATTCTTTGTAGTAAAGAAAAAATAAT * * 39379 CATCCCCAAGCAAATCTCAAG-AGGCACTCATGATTTTTTATCCAAACAATTAAATTTCAATGCT 66 CATCCCCAAGCAAATCTCAAGAAGACACTCACGA-TTTTTATCCAAACAATTAAATTTCAATGCT * * * 39443 TGGAGTCCAACAAAATTTTGATGCAAACTAAGAACTTATTACTC 130 TGGAGTCCAACAAAATTTTAACGCAAACTAAGAACCTATTACTC 39487 TTGAATAAAA Statistics Matches: 156, Mismatches: 16, Indels: 2 0.90 0.09 0.01 Matches are distributed among these distances: 180 9 0.06 181 147 0.94 ACGTcount: A:0.39, C:0.17, G:0.14, T:0.30 Consensus pattern (181 bp): TACTTAAGCAATTGAGGATGAGAAGTAGAGACTATAAGCATTCTTTGTAGTAAAGAAAAAATAAT CATCCCCAAGCAAATCTCAAGAAGACACTCACGATTTTTATCCAAACAATTAAATTTCAATGCTT GGAGTCCAACAAAATTTTAACGCAAACTAAGAACCTATTACTCATTAGACT Found at i:42512 original size:26 final size:26 Alignment explanation

Indices: 42476--42530 Score: 110 Period size: 26 Copynumber: 2.1 Consensus size: 26 42466 ACTTTCAAGA 42476 ATACCTTATCGCCCACAGCATATTCG 1 ATACCTTATCGCCCACAGCATATTCG 42502 ATACCTTATCGCCCACAGCATATTCG 1 ATACCTTATCGCCCACAGCATATTCG 42528 ATA 1 ATA 42531 TCTCTCCGTT Statistics Matches: 29, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 26 29 1.00 ACGTcount: A:0.29, C:0.33, G:0.11, T:0.27 Consensus pattern (26 bp): ATACCTTATCGCCCACAGCATATTCG Found at i:47484 original size:17 final size:17 Alignment explanation

Indices: 47462--47528 Score: 82 Period size: 17 Copynumber: 3.9 Consensus size: 17 47452 GACAAATGGA 47462 AATGCAATGACAATAAG 1 AATGCAATGACAATAAG * 47479 AATGCAGTGACAATTAA- 1 AATGCAATGACAA-TAAG * 47496 AATGCAATGACAATGAG 1 AATGCAATGACAATAAG ** 47513 AATGGGATGACAATAA 1 AATGCAATGACAATAA 47529 TTACTAACAG Statistics Matches: 42, Mismatches: 6, Indels: 4 0.81 0.12 0.08 Matches are distributed among these distances: 16 2 0.05 17 37 0.88 18 3 0.07 ACGTcount: A:0.49, C:0.10, G:0.21, T:0.19 Consensus pattern (17 bp): AATGCAATGACAATAAG Found at i:47982 original size:21 final size:21 Alignment explanation

Indices: 47956--47995 Score: 55 Period size: 21 Copynumber: 1.9 Consensus size: 21 47946 TTTACCTCGA * 47956 CTTCTT-TTCCTTCCTCTTGTT 1 CTTCTTCTT-CTTCCGCTTGTT 47977 CTTCTTCTTCTTCCGCTTG 1 CTTCTTCTTCTTCCGCTTG 47996 CATTTTTCCC Statistics Matches: 17, Mismatches: 1, Indels: 2 0.85 0.05 0.10 Matches are distributed among these distances: 21 15 0.88 22 2 0.12 ACGTcount: A:0.00, C:0.35, G:0.07, T:0.57 Consensus pattern (21 bp): CTTCTTCTTCTTCCGCTTGTT Found at i:49748 original size:6 final size:6 Alignment explanation

Indices: 49739--49763 Score: 50 Period size: 6 Copynumber: 4.2 Consensus size: 6 49729 AATTAAACTG 49739 AAACAA AAACAA AAACAA AAACAA A 1 AAACAA AAACAA AAACAA AAACAA A 49764 CTAAAATTAC Statistics Matches: 19, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 19 1.00 ACGTcount: A:0.84, C:0.16, G:0.00, T:0.00 Consensus pattern (6 bp): AAACAA Found at i:53628 original size:18 final size:18 Alignment explanation

Indices: 53592--53635 Score: 52 Period size: 18 Copynumber: 2.4 Consensus size: 18 53582 CATCAGGTTA * ** 53592 CATTGACATAGTTTTTTG 1 CATTGACATAGGTAGTTG 53610 CATTGACATAGGTAGTTG 1 CATTGACATAGGTAGTTG * 53628 CATCGACA 1 CATTGACA 53636 ATTTTGTCAA Statistics Matches: 22, Mismatches: 4, Indels: 0 0.85 0.15 0.00 Matches are distributed among these distances: 18 22 1.00 ACGTcount: A:0.27, C:0.16, G:0.20, T:0.36 Consensus pattern (18 bp): CATTGACATAGGTAGTTG Found at i:60179 original size:18 final size:20 Alignment explanation

Indices: 60135--60184 Score: 68 Period size: 18 Copynumber: 2.5 Consensus size: 20 60125 TAATTTAAAC 60135 AAGTTTAGCATACTTATATAA 1 AAGTTTAG-ATACTTATATAA * 60156 AAGTTTAGATA-TTA-ATAG 1 AAGTTTAGATACTTATATAA 60174 AAGTTTAGATA 1 AAGTTTAGATA 60185 GCATTCCCAA Statistics Matches: 28, Mismatches: 1, Indels: 3 0.88 0.03 0.09 Matches are distributed among these distances: 18 14 0.50 19 3 0.11 20 3 0.11 21 8 0.29 ACGTcount: A:0.44, C:0.04, G:0.14, T:0.38 Consensus pattern (20 bp): AAGTTTAGATACTTATATAA Done.