Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: NTFQ01014218.1 Kokia drynarioides strain JFW-HI SEQ_129251, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 48549
ACGTcount: A:0.35, C:0.15, G:0.16, T:0.34


Found at i:29 original size:17 final size:18

Alignment explanation

Indices: 1--83 Score: 111 Period size: 17 Copynumber: 4.8 Consensus size: 18 * 1 TAAAATAAATTTAAATTT 1 TAAAACAAATTTAAATTT 19 T-AAACAAATTT-AATTT 1 TAAAACAAATTTAAATTT * 35 TAAAATAAATTTAAATTT 1 TAAAACAAATTTAAATTT 53 T-AAACAAATTT-AATTT 1 TAAAACAAATTTAAATTT * 69 TAAAATAAATTTAAA 1 TAAAACAAATTTAAA 84 GGGAGTATGG Statistics Matches: 57, Mismatches: 4, Indels: 8 0.83 0.06 0.12 Matches are distributed among these distances: 16 12 0.21 17 36 0.63 18 9 0.16 ACGTcount: A:0.55, C:0.02, G:0.00, T:0.42 Consensus pattern (18 bp): TAAAACAAATTTAAATTT Found at i:44 original size:34 final size:34 Alignment explanation

Indices: 1--83 Score: 166 Period size: 34 Copynumber: 2.4 Consensus size: 34 1 TAAAATAAATTTAAATTTTAAACAAATTTAATTT 1 TAAAATAAATTTAAATTTTAAACAAATTTAATTT 35 TAAAATAAATTTAAATTTTAAACAAATTTAATTT 1 TAAAATAAATTTAAATTTTAAACAAATTTAATTT 69 TAAAATAAATTTAAA 1 TAAAATAAATTTAAA 84 GGGAGTATGG Statistics Matches: 49, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 34 49 1.00 ACGTcount: A:0.55, C:0.02, G:0.00, T:0.42 Consensus pattern (34 bp): TAAAATAAATTTAAATTTTAAACAAATTTAATTT Found at i:2463 original size:14 final size:14 Alignment explanation

Indices: 2444--2486 Score: 54 Period size: 14 Copynumber: 3.2 Consensus size: 14 2434 TGAAGGAAAA 2444 AAGAAAGAAGGAAG 1 AAGAAAGAAGGAAG * 2458 AAGAAAAAAGGAAG 1 AAGAAAGAAGGAAG * 2472 AAG-CAGAA-GAAG 1 AAGAAAGAAGGAAG 2484 AAG 1 AAG 2487 GAGAAGGAGA Statistics Matches: 26, Mismatches: 3, Indels: 2 0.84 0.10 0.06 Matches are distributed among these distances: 12 7 0.27 13 3 0.12 14 16 0.62 ACGTcount: A:0.65, C:0.02, G:0.33, T:0.00 Consensus pattern (14 bp): AAGAAAGAAGGAAG Found at i:2463 original size:21 final size:22 Alignment explanation

Indices: 2439--2485 Score: 69 Period size: 21 Copynumber: 2.1 Consensus size: 22 2429 TAATTTGAAG 2439 GAAAAAAGAAAGAAG-GAAGAA 1 GAAAAAAGAAAGAAGAGAAGAA * 2460 GAAAAAAGGAAGAAGCAGAAGAA 1 GAAAAAAGAAAGAAG-AGAAGAA 2483 GAA 1 GAA 2486 GGAGAAGGAG Statistics Matches: 23, Mismatches: 1, Indels: 2 0.88 0.04 0.08 Matches are distributed among these distances: 21 14 0.61 23 9 0.39 ACGTcount: A:0.68, C:0.02, G:0.30, T:0.00 Consensus pattern (22 bp): GAAAAAAGAAAGAAGAGAAGAA Found at i:2665 original size:18 final size:18 Alignment explanation

Indices: 2642--2676 Score: 61 Period size: 18 Copynumber: 1.9 Consensus size: 18 2632 AAAAAAAAGA 2642 TAAGTTTGATTAATTTTT 1 TAAGTTTGATTAATTTTT * 2660 TAAGTTTGGTTAATTTT 1 TAAGTTTGATTAATTTT 2677 AAATTTATTT Statistics Matches: 16, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 18 16 1.00 ACGTcount: A:0.26, C:0.00, G:0.14, T:0.60 Consensus pattern (18 bp): TAAGTTTGATTAATTTTT Found at i:3629 original size:16 final size:16 Alignment explanation

Indices: 3583--3631 Score: 61 Period size: 16 Copynumber: 3.2 Consensus size: 16 3573 TAAACCTAGC 3583 TAATTAATTACCAAAA 1 TAATTAATTACCAAAA 3599 T-A-TAATATA--AAAA 1 TAATTAAT-TACCAAAA 3612 TAATTAATTACCAAAA 1 TAATTAATTACCAAAA 3628 TAAT 1 TAAT 3632 AACACCATCA Statistics Matches: 28, Mismatches: 0, Indels: 10 0.74 0.00 0.26 Matches are distributed among these distances: 13 5 0.18 14 7 0.25 15 7 0.25 16 9 0.32 ACGTcount: A:0.59, C:0.08, G:0.00, T:0.33 Consensus pattern (16 bp): TAATTAATTACCAAAA Found at i:5004 original size:43 final size:43 Alignment explanation

Indices: 4922--5006 Score: 109 Period size: 43 Copynumber: 2.0 Consensus size: 43 4912 ATTAACATGT * * * 4922 TAAATTATATTACTTAACTCGTGTTAATATGGTTTCATGTTAC 1 TAAATTATATTACTTAACTCGTATTAATATGCTTACATGTTAC * * 4965 TAAATTATATTACTTTACTCTTATTAATAT-CTTGACATGTTA 1 TAAATTATATTACTTAACTCGTATTAATATGCTT-ACATGTTA 5007 TTAATTGTGC Statistics Matches: 36, Mismatches: 5, Indels: 2 0.84 0.12 0.05 Matches are distributed among these distances: 42 2 0.06 43 34 0.94 ACGTcount: A:0.32, C:0.12, G:0.08, T:0.48 Consensus pattern (43 bp): TAAATTATATTACTTAACTCGTATTAATATGCTTACATGTTAC Found at i:5638 original size:45 final size:45 Alignment explanation

Indices: 5574--5710 Score: 184 Period size: 45 Copynumber: 3.0 Consensus size: 45 5564 CCCTAGCTCA * * 5574 TCAAGCCAAGGATATCAGCCTTAGTTTGACGAGCCACCACAATAC 1 TCAAGCCAAGGATATCAGCCTCAGTTTGACGAGCCACCGCAATAC * * 5619 TCAAGCTAAGGATATCAGCCTCAGTTTGACAAGCCACCGCAATAC 1 TCAAGCCAAGGATATCAGCCTCAGTTTGACGAGCCACCGCAATAC ** * * * * 5664 TCAAGGGAAGGATATCAGGCTGAGTTTGATGAGTCACCGCAATAC 1 TCAAGCCAAGGATATCAGCCTCAGTTTGACGAGCCACCGCAATAC 5709 TC 1 TC 5711 TACTCCTCCT Statistics Matches: 81, Mismatches: 11, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 45 81 1.00 ACGTcount: A:0.32, C:0.26, G:0.21, T:0.21 Consensus pattern (45 bp): TCAAGCCAAGGATATCAGCCTCAGTTTGACGAGCCACCGCAATAC Found at i:5747 original size:21 final size:21 Alignment explanation

Indices: 5702--5744 Score: 59 Period size: 21 Copynumber: 2.0 Consensus size: 21 5692 ATGAGTCACC * 5702 GCAATACTCTACTCCTCCTGG 1 GCAATACTCTACTCCTCATGG * 5723 GCAATACTTTACTCCTTCATGG 1 GCAATACTCTACTCC-TCATGG 5745 CAAATGAACC Statistics Matches: 19, Mismatches: 2, Indels: 1 0.86 0.09 0.05 Matches are distributed among these distances: 21 14 0.74 22 5 0.26 ACGTcount: A:0.21, C:0.33, G:0.14, T:0.33 Consensus pattern (21 bp): GCAATACTCTACTCCTCATGG Found at i:5994 original size:21 final size:21 Alignment explanation

Indices: 5970--6028 Score: 66 Period size: 21 Copynumber: 2.8 Consensus size: 21 5960 CCTAACATTA * 5970 AACCCTAAACCTTAAACTTAG 1 AACCATAAACCTTAAACTTAG * 5991 AACCATAAACCTTGAATCTTAG 1 AACCATAAACCTT-AAACTTAG ** 6013 -ACTTTAAACCTTAAAC 1 AACCATAAACCTTAAAC 6029 ACTAAATCCT Statistics Matches: 32, Mismatches: 5, Indels: 3 0.80 0.12 0.08 Matches are distributed among these distances: 20 3 0.09 21 22 0.69 22 7 0.22 ACGTcount: A:0.42, C:0.25, G:0.05, T:0.27 Consensus pattern (21 bp): AACCATAAACCTTAAACTTAG Found at i:6001 original size:7 final size:7 Alignment explanation

Indices: 5967--6028 Score: 54 Period size: 7 Copynumber: 8.9 Consensus size: 7 5957 TTTCCTAACA 5967 TTAAACC 1 TTAAACC * 5974 CTAAACC 1 TTAAACC 5981 TTAAA-C 1 TTAAACC 5987 TTAGAACC 1 TTA-AACC * 5995 ATAAACC 1 TTAAACC * * 6002 TTGAATC 1 TTAAACC * * 6009 TTAGACT 1 TTAAACC 6016 TTAAACC 1 TTAAACC 6023 TTAAAC 1 TTAAAC 6029 ACTAAATCCT Statistics Matches: 41, Mismatches: 12, Indels: 4 0.72 0.21 0.07 Matches are distributed among these distances: 6 4 0.10 7 34 0.83 8 3 0.07 ACGTcount: A:0.42, C:0.24, G:0.05, T:0.29 Consensus pattern (7 bp): TTAAACC Found at i:8706 original size:4 final size:4 Alignment explanation

Indices: 8699--8750 Score: 52 Period size: 4 Copynumber: 13.0 Consensus size: 4 8689 ATGCATTACA * * * * 8699 TTTC TTTC TTTC -TTC TTTC TTTCC TCTC ATTC ATTC ATTC TTTC TTTC 1 TTTC TTTC TTTC TTTC TTTC TTT-C TTTC TTTC TTTC TTTC TTTC TTTC 8747 TTTC 1 TTTC 8751 CCGTTTATTT Statistics Matches: 42, Mismatches: 4, Indels: 4 0.84 0.08 0.08 Matches are distributed among these distances: 3 3 0.07 4 36 0.86 5 3 0.07 ACGTcount: A:0.06, C:0.29, G:0.00, T:0.65 Consensus pattern (4 bp): TTTC Found at i:8917 original size:26 final size:27 Alignment explanation

Indices: 8888--8939 Score: 63 Period size: 27 Copynumber: 2.0 Consensus size: 27 8878 GTCAAGTGGC * 8888 AAAACACCT-CTT-AGTGCCGTCACTTG 1 AAAA-ACCTCCTTCAGTGCCGCCACTTG * 8914 AAAATCCTCCTTCAGTGCCGCCACTT 1 AAAAACCTCCTTCAGTGCCGCCACTT 8940 TGTGTCCTTC Statistics Matches: 22, Mismatches: 2, Indels: 3 0.81 0.07 0.11 Matches are distributed among these distances: 25 3 0.14 26 7 0.32 27 12 0.55 ACGTcount: A:0.25, C:0.35, G:0.13, T:0.27 Consensus pattern (27 bp): AAAAACCTCCTTCAGTGCCGCCACTTG Found at i:10506 original size:17 final size:18 Alignment explanation

Indices: 10484--10520 Score: 58 Period size: 18 Copynumber: 2.1 Consensus size: 18 10474 TAATAAAAAT 10484 ATAATTTA-ATTATTATA 1 ATAATTTATATTATTATA * 10501 ATAATTTATATTTTTATA 1 ATAATTTATATTATTATA 10519 AT 1 AT 10521 TTTTAAAAAA Statistics Matches: 18, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 17 8 0.44 18 10 0.56 ACGTcount: A:0.43, C:0.00, G:0.00, T:0.57 Consensus pattern (18 bp): ATAATTTATATTATTATA Found at i:11435 original size:27 final size:29 Alignment explanation

Indices: 11377--11457 Score: 85 Period size: 27 Copynumber: 2.7 Consensus size: 29 11367 TTAGTAAAAA ** 11377 AGTTTAGTTTTAATATTTAATTTAATTTTTG 1 AGTTTA-TTTTAATA-TTAATTTAATACTTG * 11408 A-TTT-TTTTAATATTAATTTGATACTTG 1 AGTTTATTTTAATATTAATTTAATACTTG 11435 AGTTTAATTTTAATATTCAATTT 1 AGTTT-ATTTTAATATT-AATTT 11458 TATATTCAAA Statistics Matches: 43, Mismatches: 3, Indels: 8 0.80 0.06 0.15 Matches are distributed among these distances: 27 13 0.30 28 11 0.26 30 13 0.30 31 6 0.14 ACGTcount: A:0.31, C:0.02, G:0.07, T:0.59 Consensus pattern (29 bp): AGTTTATTTTAATATTAATTTAATACTTG Found at i:17517 original size:30 final size:30 Alignment explanation

Indices: 17481--17543 Score: 99 Period size: 30 Copynumber: 2.1 Consensus size: 30 17471 TGGTGGGTTT * * 17481 GATTTTAAAATAAATAAGTTATTGTGGATC 1 GATTTTAAAATAAATAAATTATCGTGGATC * 17511 GATTTTAAAATAAATAAATTATCGTGGCTC 1 GATTTTAAAATAAATAAATTATCGTGGATC 17541 GAT 1 GAT 17544 AATAAAAGAT Statistics Matches: 30, Mismatches: 3, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 30 30 1.00 ACGTcount: A:0.40, C:0.06, G:0.16, T:0.38 Consensus pattern (30 bp): GATTTTAAAATAAATAAATTATCGTGGATC Found at i:17550 original size:30 final size:30 Alignment explanation

Indices: 17486--17550 Score: 85 Period size: 30 Copynumber: 2.2 Consensus size: 30 17476 GGTTTGATTT * * ** 17486 TAAAATAAATAAGTTATTGTGGATCGATTT 1 TAAAATAAATAAATTATCGTGGATCGATAA * 17516 TAAAATAAATAAATTATCGTGGCTCGATAA 1 TAAAATAAATAAATTATCGTGGATCGATAA 17546 TAAAA 1 TAAAA 17551 GATGTTTTGG Statistics Matches: 30, Mismatches: 5, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 30 30 1.00 ACGTcount: A:0.46, C:0.06, G:0.14, T:0.34 Consensus pattern (30 bp): TAAAATAAATAAATTATCGTGGATCGATAA Found at i:24803 original size:29 final size:30 Alignment explanation

Indices: 24751--24807 Score: 80 Period size: 29 Copynumber: 1.9 Consensus size: 30 24741 ATCTTTTTAG * * 24751 TTGATTCATTTTAATAGTACAGGGACTAAA 1 TTGATCCATTTTAATAATACAGGGACTAAA * 24781 TTGATCCA-TTTAATAATAGAGGGACTA 1 TTGATCCATTTTAATAATACAGGGACTA 24808 CTATGACCCG Statistics Matches: 24, Mismatches: 3, Indels: 1 0.86 0.11 0.04 Matches are distributed among these distances: 29 17 0.71 30 7 0.29 ACGTcount: A:0.37, C:0.11, G:0.18, T:0.35 Consensus pattern (30 bp): TTGATCCATTTTAATAATACAGGGACTAAA Found at i:32741 original size:10 final size:10 Alignment explanation

Indices: 32726--32778 Score: 51 Period size: 10 Copynumber: 5.6 Consensus size: 10 32716 TAAAAATTTG 32726 TTAAATATAT 1 TTAAATATAT 32736 TTAAATATAT 1 TTAAATATAT * 32746 AT-AAT-TAT 1 TTAAATATAT * 32754 TCAAAT-T-T 1 TTAAATATAT 32762 TATAAATATAT 1 T-TAAATATAT 32773 TTAAAT 1 TTAAAT 32779 CACAATAAAA Statistics Matches: 35, Mismatches: 4, Indels: 8 0.74 0.09 0.17 Matches are distributed among these distances: 8 5 0.14 9 11 0.31 10 17 0.49 11 2 0.06 ACGTcount: A:0.49, C:0.02, G:0.00, T:0.49 Consensus pattern (10 bp): TTAAATATAT Found at i:32741 original size:19 final size:18 Alignment explanation

Indices: 32719--32778 Score: 59 Period size: 19 Copynumber: 3.2 Consensus size: 18 32709 TTTTATATAA 32719 AAATTTGTTAAATATATTT 1 AAATTT-TTAAATATATTT * * * 32738 AAATATATATAAT-TATTC 1 AAATTTTTA-AATATATTT 32756 AAATTTTATAAATATATTT 1 AAATTTT-TAAATATATTT 32775 AAAT 1 AAAT 32779 CACAATAAAA Statistics Matches: 32, Mismatches: 6, Indels: 6 0.73 0.14 0.14 Matches are distributed among these distances: 18 14 0.44 19 18 0.56 ACGTcount: A:0.48, C:0.02, G:0.02, T:0.48 Consensus pattern (18 bp): AAATTTTTAAATATATTT Found at i:33086 original size:19 final size:20 Alignment explanation

Indices: 33059--33096 Score: 51 Period size: 19 Copynumber: 1.9 Consensus size: 20 33049 TTTTGTTATT * 33059 GTTTTGGCC-ATTTCAATCC 1 GTTTCGGCCTATTTCAATCC * 33078 GTTTCGGCCTGTTTCAATC 1 GTTTCGGCCTATTTCAATC 33097 AATTTCAATT Statistics Matches: 16, Mismatches: 2, Indels: 1 0.84 0.11 0.05 Matches are distributed among these distances: 19 8 0.50 20 8 0.50 ACGTcount: A:0.13, C:0.26, G:0.18, T:0.42 Consensus pattern (20 bp): GTTTCGGCCTATTTCAATCC Found at i:33539 original size:9 final size:9 Alignment explanation

Indices: 33525--33558 Score: 50 Period size: 9 Copynumber: 3.8 Consensus size: 9 33515 AATATTATGA 33525 ATTTTTTAT 1 ATTTTTTAT 33534 ATTTTTTAT 1 ATTTTTTAT * * 33543 TTTTTTTTT 1 ATTTTTTAT 33552 ATTTTTT 1 ATTTTTT 33559 TTAGAAAGGC Statistics Matches: 22, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 9 22 1.00 ACGTcount: A:0.15, C:0.00, G:0.00, T:0.85 Consensus pattern (9 bp): ATTTTTTAT Found at i:33542 original size:20 final size:20 Alignment explanation

Indices: 33519--33561 Score: 50 Period size: 20 Copynumber: 2.1 Consensus size: 20 33509 TTTACAAATA 33519 TTATGAATTTTTTATATTTT 1 TTATGAATTTTTTATATTTT *** * 33539 TTATTTTTTTTTTATTTTTT 1 TTATGAATTTTTTATATTTT 33559 TTA 1 TTA 33562 GAAAGGCCAA Statistics Matches: 19, Mismatches: 4, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 20 19 1.00 ACGTcount: A:0.19, C:0.00, G:0.02, T:0.79 Consensus pattern (20 bp): TTATGAATTTTTTATATTTT Found at i:34424 original size:7 final size:7 Alignment explanation

Indices: 34412--34445 Score: 50 Period size: 7 Copynumber: 4.7 Consensus size: 7 34402 CAATCCATTC 34412 ATAAATA 1 ATAAATA * 34419 ATAAATT 1 ATAAATA 34426 ATAAATTA 1 ATAAA-TA 34434 ATAAATA 1 ATAAATA 34441 ATAAA 1 ATAAA 34446 ATTATATTTT Statistics Matches: 24, Mismatches: 2, Indels: 2 0.86 0.07 0.07 Matches are distributed among these distances: 7 18 0.75 8 6 0.25 ACGTcount: A:0.68, C:0.00, G:0.00, T:0.32 Consensus pattern (7 bp): ATAAATA Found at i:34437 original size:15 final size:14 Alignment explanation

Indices: 34412--34445 Score: 50 Period size: 15 Copynumber: 2.4 Consensus size: 14 34402 CAATCCATTC * 34412 ATAAATAATAAATT 1 ATAAATAATAAATA 34426 ATAAATTAATAAATA 1 ATAAA-TAATAAATA 34441 ATAAA 1 ATAAA 34446 ATTATATTTT Statistics Matches: 18, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 14 5 0.28 15 13 0.72 ACGTcount: A:0.68, C:0.00, G:0.00, T:0.32 Consensus pattern (14 bp): ATAAATAATAAATA Found at i:37725 original size:23 final size:25 Alignment explanation

Indices: 37699--37745 Score: 62 Period size: 25 Copynumber: 2.0 Consensus size: 25 37689 TACTAAATCC * 37699 ATTTATTT-ATT-ATTTATATAGTT 1 ATTTATTTCATTCATATATATAGTT * 37722 ATTTTTTTCATTCATATATATAGT 1 ATTTATTTCATTCATATATATAGT 37746 GGACCCGCCC Statistics Matches: 20, Mismatches: 2, Indels: 2 0.83 0.08 0.08 Matches are distributed among these distances: 23 7 0.35 24 3 0.15 25 10 0.50 ACGTcount: A:0.30, C:0.04, G:0.04, T:0.62 Consensus pattern (25 bp): ATTTATTTCATTCATATATATAGTT Found at i:38338 original size:24 final size:24 Alignment explanation

Indices: 38306--38353 Score: 96 Period size: 24 Copynumber: 2.0 Consensus size: 24 38296 ATATTATAAG 38306 TATATAAATAATCAACACTACCAC 1 TATATAAATAATCAACACTACCAC 38330 TATATAAATAATCAACACTACCAC 1 TATATAAATAATCAACACTACCAC 38354 AAAGAGCTAC Statistics Matches: 24, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 24 24 1.00 ACGTcount: A:0.50, C:0.25, G:0.00, T:0.25 Consensus pattern (24 bp): TATATAAATAATCAACACTACCAC Found at i:42449 original size:14 final size:13 Alignment explanation

Indices: 42424--42457 Score: 52 Period size: 13 Copynumber: 2.6 Consensus size: 13 42414 TTCAAGGTTT 42424 AAGTTTAGATTTG 1 AAGTTTAGATTTG 42437 AAGTTTGAGATTTG 1 AAGTTT-AGATTTG 42451 -AGTTTAG 1 AAGTTTAG 42458 GGTTTAGGAT Statistics Matches: 20, Mismatches: 0, Indels: 3 0.87 0.00 0.13 Matches are distributed among these distances: 12 2 0.10 13 11 0.55 14 7 0.35 ACGTcount: A:0.29, C:0.00, G:0.26, T:0.44 Consensus pattern (13 bp): AAGTTTAGATTTG Found at i:45489 original size:6 final size:6 Alignment explanation

Indices: 45478--45507 Score: 60 Period size: 6 Copynumber: 5.0 Consensus size: 6 45468 AAAAGGGGTT 45478 GGGATG GGGATG GGGATG GGGATG GGGATG 1 GGGATG GGGATG GGGATG GGGATG GGGATG 45508 AGGTTTAAGA Statistics Matches: 24, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 24 1.00 ACGTcount: A:0.17, C:0.00, G:0.67, T:0.17 Consensus pattern (6 bp): GGGATG Done.