Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01024420.1 Corchorus olitorius cultivar O-4 contig24453, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 33041
ACGTcount: A:0.33, C:0.17, G:0.17, T:0.33


Found at i:4179 original size:1 final size:1

Alignment explanation

Indices: 4173--4199 Score: 54 Period size: 1 Copynumber: 27.0 Consensus size: 1 4163 AGACTCGGTC 4173 AAAAAAAAAAAAAAAAAAAAAAAAAAA 1 AAAAAAAAAAAAAAAAAAAAAAAAAAA 4200 CTGCTGAAGA Statistics Matches: 26, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 1 26 1.00 ACGTcount: A:1.00, C:0.00, G:0.00, T:0.00 Consensus pattern (1 bp): A Found at i:5324 original size:14 final size:14 Alignment explanation

Indices: 5305--5331 Score: 54 Period size: 14 Copynumber: 1.9 Consensus size: 14 5295 AATATATCTA 5305 TATTTATCTATACC 1 TATTTATCTATACC 5319 TATTTATCTATAC 1 TATTTATCTATAC 5332 TATATATAAT Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 14 13 1.00 ACGTcount: A:0.30, C:0.19, G:0.00, T:0.52 Consensus pattern (14 bp): TATTTATCTATACC Found at i:5656 original size:21 final size:21 Alignment explanation

Indices: 5632--5751 Score: 102 Period size: 21 Copynumber: 5.6 Consensus size: 21 5622 AATTTTGGAA * * 5632 GTTATCAAAATTCATTGTGTG 1 GTTATCAAAATTTATAGTGTG 5653 GTTA-CTAAAATTTATAGTGTG 1 GTTATC-AAAATTTATAGTGTG * 5674 GTTCTCAAAATTTTATAGTGTG 1 GTTATCAAAA-TTTATAGTGTG * * 5696 GTTACCAAAATTTCATAGGTATG 1 GTTATCAAAATTT-ATA-GTGTG * 5719 ATGTTA--AAATTTTATAGTGTG 1 --GTTATCAAAATTTATAGTGTG * 5740 GTTATCATAATT 1 GTTATCAAAATT 5752 CCATGGGATG Statistics Matches: 80, Mismatches: 10, Indels: 18 0.74 0.09 0.17 Matches are distributed among these distances: 19 4 0.05 20 1 0.01 21 35 0.44 22 26 0.32 23 10 0.12 25 4 0.05 ACGTcount: A:0.32, C:0.07, G:0.17, T:0.43 Consensus pattern (21 bp): GTTATCAAAATTTATAGTGTG Found at i:5799 original size:27 final size:27 Alignment explanation

Indices: 5767--5865 Score: 99 Period size: 27 Copynumber: 4.0 Consensus size: 27 5757 GGATGTTATC * 5767 AAAATTTCATAAGGAGGTTATTAAAAT 1 AAAATTTCATAAGGAGGTTATCAAAAT * 5794 AAAATTTCATAAGGATGTTATCAAAAT 1 AAAATTTCATAAGGAGGTTATCAAAAT * 5821 ----TTTCATATGGAGGTTATC----- 1 AAAATTTCATAAGGAGGTTATCAAAAT * 5839 AAAATTTCATAAGGAGGTTATCGAAAT 1 AAAATTTCATAAGGAGGTTATCAAAAT 5866 TCATGGCAAT Statistics Matches: 58, Mismatches: 5, Indels: 18 0.72 0.06 0.22 Matches are distributed among these distances: 22 17 0.29 23 16 0.28 27 25 0.43 ACGTcount: A:0.42, C:0.07, G:0.16, T:0.34 Consensus pattern (27 bp): AAAATTTCATAAGGAGGTTATCAAAAT Found at i:6000 original size:22 final size:21 Alignment explanation

Indices: 5632--6000 Score: 180 Period size: 22 Copynumber: 16.9 Consensus size: 21 5622 AATTTTGGAA * * 5632 GTTATCAAAA-TTCATTGTGTG 1 GTTATCAAAATTTCATAG-GAG * 5653 GTTA-CTAAAATTT-ATAGTGTG 1 GTTATC-AAAATTTCATAG-GAG * * * 5674 GTTCTCAAAATTTTATAGTGTG 1 GTTATCAAAATTTCATAG-GAG * * 5696 GTTACCAAAATTTCATAGGTAT 1 GTTATCAAAATTTCATAGG-AG * * * * * 5718 GATGTTAAAATTTTATAGTGTG 1 GTTATCAAAATTTCATAG-GAG * * * * 5740 GTTATCATAATTCCATGGGAT 1 GTTATCAAAATTTCATAGGAG 5761 GTTATCAAAATTTCATAAGGAG 1 GTTATCAAAATTTCAT-AGGAG * * 5783 GTTATTAAAATAAAATTTCATAAGGAT 1 GTTA-T----CAAAATTTCAT-AGGAG 5810 GTTATCAAAATTTTCATATGGAG 1 GTTATCAAAA-TTTCATA-GGAG 5833 GTTATCAAAATTTCATAAGGAG 1 GTTATCAAAATTTCAT-AGGAG * 5855 GTTATCGAAA-TTCAT-GGCA- 1 GTTATCAAAATTTCATAGG-AG * * * 5874 -ATGTCAAAATTTCACAAGGAG 1 GTTATCAAAATTTCA-TAGGAG *** 5895 GTTA-CTAAAATTTCATACTTTG 1 GTTATC-AAAATTTCATA-GGAG * 5917 GTTATCAAAATTTCATAGGGCG 1 GTTATCAAAATTTCATA-GGAG * * 5939 GCTATCGAAATCTT-ATATGGAG 1 GTTATCAAAAT-TTCATA-GGAG * 5961 GTTATTAAAATTTCATAGGAAG 1 GTTATCAAAATTTCATAGG-AG * 5983 ATTATCAAAATTTCATAG 1 GTTATCAAAATTTCATAG 6001 TGTGCTTATA Statistics Matches: 266, Mismatches: 55, Indels: 53 0.71 0.15 0.14 Matches are distributed among these distances: 18 6 0.02 19 6 0.02 20 3 0.01 21 54 0.20 22 152 0.57 23 25 0.09 26 1 0.00 27 19 0.07 ACGTcount: A:0.36, C:0.10, G:0.17, T:0.37 Consensus pattern (21 bp): GTTATCAAAATTTCATAGGAG Found at i:6009 original size:22 final size:22 Alignment explanation

Indices: 5794--6026 Score: 73 Period size: 22 Copynumber: 10.7 Consensus size: 22 5784 TTATTAAAAT 5794 AAAATTTCATAAG-GATG-TTATC 1 AAAATTTCAT-AGTGA-GCTTATC * 5816 AAAATTTTCATA-TGGAGGTTATC 1 AAAA-TTTCATAGT-GAGCTTATC * 5839 AAAATTTCATAAG-GAGGTTATC 1 AAAATTTCAT-AGTGAGCTTATC * * * 5861 GAAA-TTC--A-TG-GCAATGTC 1 AAAATTTCATAGTGAGC-TTATC * * 5879 AAAATTTCACAAG-GAGGTTA-C 1 AAAATTTCA-TAGTGAGCTTATC * ** * 5900 TAAAATTTCATACTTTGGTTATC 1 -AAAATTTCATAGTGAGCTTATC * * 5923 AAAATTTCATAGGGCGGC-TATC 1 AAAATTTCATAGTG-AGCTTATC * * * 5945 GAAATCTT-ATA-TGGAGGTTATT 1 AAAAT-TTCATAGT-GAGCTTATC * 5967 AAAATTTCATAG-GAAGATTATC 1 AAAATTTCATAGTG-AGCTTATC * * 5989 AAAATTTCATAGTGTGCTTATA 1 AAAATTTCATAGTGAGCTTATC * 6011 AAAATTACATAGTGAG 1 AAAATTTCATAGTGAG 6027 ATAGAGTGAG Statistics Matches: 156, Mismatches: 30, Indels: 50 0.66 0.13 0.21 Matches are distributed among these distances: 17 1 0.01 18 8 0.05 19 3 0.02 21 9 0.06 22 111 0.71 23 24 0.15 ACGTcount: A:0.38, C:0.11, G:0.17, T:0.34 Consensus pattern (22 bp): AAAATTTCATAGTGAGCTTATC Found at i:6174 original size:22 final size:22 Alignment explanation

Indices: 6038--6744 Score: 159 Period size: 22 Copynumber: 32.5 Consensus size: 22 6028 TAGAGTGAGC * 6038 TTATCAAAATTTCA-AGTGTTG 1 TTATCAAAATTTCATAGTGTGG * ** 6059 TTACCAAAATTTCATAGTGTAA 1 TTATCAAAATTTCATAGTGTGG * * * 6081 TAATCACAATTTCATA--GAGG 1 TTATCAAAATTTCATAGTGTGG * * * * * 6101 TTAACAAAATTTCATGGGGAGA 1 TTATCAAAATTTCATAGTGTGG * * * 6123 TTATCGAAATTT--TAGAGGGG 1 TTATCAAAATTTCATAGTGTGG 6143 ATAATATCAAAATTTCATAGTGTGG 1 -T--TATCAAAATTTCATAGTGTGG * * 6168 TTATCAAAATTTTATAATGTGG 1 TTATCAAAATTTCATAGTGTGG * * * 6190 ---T----ATTTCAGAGGGAGAGG 1 TTATCAAAATTTCATA--GTGTGG * * 6207 TTATCAAATTTTCATTGTGTGG 1 TTATCAAAATTTCATAGTGTGG * * * * 6229 -TAGTTAAAATTTCATAATGAGT 1 TTA-TCAAAATTTCATAGTGTGG * * 6251 TTATCAAAATTT-ATAGTGAGA 1 TTATCAAAATTTCATAGTGTGG * * 6272 TTAACAAAATTTGATAGTGTGG 1 TTATCAAAATTTCATAGTGTGG * * * * * 6294 TTCTCAAAATTTTATAGGGAGA 1 TTATCAAAATTTCATAGTGTGG * * 6316 TTAACAAAATTTCATAG-GTAAG 1 TTATCAAAATTTCATAGTGT-GG *** * * * ** 6338 TTATTGTAGTTTTATGGTGTAATTAA 1 TTATCAAAATTTCATAGTG----TGG * * 6364 TTATCAAAATTTCATTG-GGGG 1 TTATCAAAATTTCATAGTGTGG * ** 6385 TTATCAAAATTTAATAGTGTTC 1 TTATCAAAATTTCATAGTGTGG ** * 6407 TTATCAAAATTTTGTAGT-AGG 1 TTATCAAAATTTCATAGTGTGG * * * * * 6428 TTAACGAAATTTTATAAG-GAGA 1 TTATCAAAATTTCAT-AGTGTGG * * * 6450 TTATAAAAAATTT-ATCGGGAT-G 1 TTAT-CAAAATTTCATAGTG-TGG * 6472 TTATCAAAATTTCATAAG-ATGG 1 TTATCAAAATTTCAT-AGTGTGG 6494 TTATCAAAATTTCATGAG-GTGG 1 TTATCAAAATTTCAT-AGTGTGG * * 6516 TTTTCAAAATTTCA-AATG-GAG 1 TTATCAAAATTTCATAGTGTG-G * * * 6537 ATTTATCAAAATTTTATAGGGAGG 1 --TTATCAAAATTTCATAGTGTGG * * 6561 TTTATAAAAATTTCATAGTGAGG 1 -TTATCAAAATTTCATAGTGTGG * * * 6584 -TATCACAATTTTAT-GATATGG 1 TTATCAAAATTTCATAG-TGTGG * * 6605 TTATCAAAATTTCATAATGTGA 1 TTATCAAAATTTCATAGTGTGG * * 6627 TTA-CAAACACTT--TA-T-CGG 1 TTATCAAA-ATTTCATAGTGTGG * * 6645 TTATCAAAATATCATAATGTGCG 1 TTATCAAAATTTCATAGTGTG-G * * * * * 6668 CTTAACAACATTTCATTGGGAGG 1 -TTATCAAAATTTCATAGTGTGG * * 6691 CTATCAAAATTTCAT-GGGTGG 1 TTATCAAAATTTCATAGTGTGG * * 6712 TT-GCAAAATTTCATTAG-ATGG 1 TTATCAAAATTTCA-TAGTGTGG * 6733 TTATTAAAATTT 1 TTATCAAAATTT 6745 TTATAGGGAT Statistics Matches: 489, Mismatches: 142, Indels: 109 0.66 0.19 0.15 Matches are distributed among these distances: 15 6 0.01 17 3 0.01 18 6 0.01 19 6 0.01 20 35 0.07 21 104 0.21 22 231 0.47 23 54 0.11 24 23 0.05 25 8 0.02 26 12 0.02 27 1 0.00 ACGTcount: A:0.36, C:0.08, G:0.17, T:0.38 Consensus pattern (22 bp): TTATCAAAATTTCATAGTGTGG Found at i:6606 original size:89 final size:89 Alignment explanation

Indices: 6434--6623 Score: 217 Period size: 89 Copynumber: 2.1 Consensus size: 89 6424 TAGGTTAACG * * * * * 6434 AAATTTTATAA-GGAGATTATAAAAAATTTATCGGGATGTTATCAAAATTTCATAAGATGGTTAT 1 AAATTTCATAATGGAGATTATAAAAAATTTATAGGGAGGTTATAAAAATTTCATAAGATGGGTAT * * 6498 CAAAATTTCATGAGGTGGTTTTCA 66 CAAAATTTCATGAGATGGTTATCA * * 6522 AAATTTCA-AATGGAGATTTATCAAAATTTTATAGGGAGGTTTATAAAAATTTCAT-AG-TGAGG 1 AAATTTCATAATGGAGA-TTATAAAAAATTTATAGGGAGG-TTATAAAAATTTCATAAGATG-GG * * * 6584 TATCACAATTTTATGATATGGTTATCA 63 TATCAAAATTTCATGAGATGGTTATCA 6611 AAATTTCATAATG 1 AAATTTCATAATG 6624 TGATTACAAA Statistics Matches: 85, Mismatches: 12, Indels: 8 0.81 0.11 0.08 Matches are distributed among these distances: 87 2 0.02 88 14 0.16 89 51 0.60 90 18 0.21 ACGTcount: A:0.39, C:0.07, G:0.16, T:0.38 Consensus pattern (89 bp): AAATTTCATAATGGAGATTATAAAAAATTTATAGGGAGGTTATAAAAATTTCATAAGATGGGTAT CAAAATTTCATGAGATGGTTATCA Found at i:6791 original size:22 final size:22 Alignment explanation

Indices: 6763--6822 Score: 66 Period size: 22 Copynumber: 2.7 Consensus size: 22 6753 ATCTGGAGTG 6763 TAACAAAATTTTATAGGGAAGT 1 TAACAAAATTTTATAGGGAAGT * * * 6785 TAACAAAATTTCATATGGAGGT 1 TAACAAAATTTTATAGGGAAGT ** * 6807 TTTCAAAATTCTATAG 1 TAACAAAATTTTATAG 6823 TATCATCATA Statistics Matches: 30, Mismatches: 8, Indels: 0 0.79 0.21 0.00 Matches are distributed among these distances: 22 30 1.00 ACGTcount: A:0.42, C:0.08, G:0.15, T:0.35 Consensus pattern (22 bp): TAACAAAATTTTATAGGGAAGT Found at i:6972 original size:2 final size:2 Alignment explanation

Indices: 6965--6991 Score: 54 Period size: 2 Copynumber: 13.5 Consensus size: 2 6955 CTAAAACTAG 6965 TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA T 6992 TATTATTATT Statistics Matches: 25, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 25 1.00 ACGTcount: A:0.48, C:0.00, G:0.00, T:0.52 Consensus pattern (2 bp): TA Found at i:9167 original size:24 final size:26 Alignment explanation

Indices: 9135--9189 Score: 78 Period size: 27 Copynumber: 2.2 Consensus size: 26 9125 ACAAGTCTAA 9135 TGATGTTTC-TAC-TGATCATGTTGT 1 TGATGTTTCTTACATGATCATGTTGT * 9159 TGATGTTTCTTTTCATGATCATGTTGT 1 TGATGTTTC-TTACATGATCATGTTGT 9186 TGAT 1 TGAT 9190 TCTTTTAAAC Statistics Matches: 27, Mismatches: 1, Indels: 3 0.87 0.03 0.10 Matches are distributed among these distances: 24 9 0.33 26 2 0.07 27 16 0.59 ACGTcount: A:0.16, C:0.11, G:0.20, T:0.53 Consensus pattern (26 bp): TGATGTTTCTTACATGATCATGTTGT Found at i:9192 original size:24 final size:25 Alignment explanation

Indices: 9147--9195 Score: 73 Period size: 27 Copynumber: 1.9 Consensus size: 25 9137 ATGTTTCTAC 9147 TGATCATGTTGTTGATGTTTCTTTTCA 1 TGATCATGTTGTTGA--TTTCTTTTCA 9174 TGATCATGTTGTTGA-TTCTTTT 1 TGATCATGTTGTTGATTTCTTTT 9196 AAACATTTTG Statistics Matches: 22, Mismatches: 0, Indels: 3 0.88 0.00 0.12 Matches are distributed among these distances: 24 7 0.32 27 15 0.68 ACGTcount: A:0.14, C:0.10, G:0.18, T:0.57 Consensus pattern (25 bp): TGATCATGTTGTTGATTTCTTTTCA Found at i:13099 original size:35 final size:34 Alignment explanation

Indices: 13033--13100 Score: 93 Period size: 35 Copynumber: 2.0 Consensus size: 34 13023 ACAACAACAT * * 13033 AAAAACCAACAACAAATTTGATGTAAAAAAAAAA 1 AAAAACCAACAACAAATTTAAAGTAAAAAAAAAA * 13067 AAAAGACCAACAACAAATTTAAAG-AAAGAAAAAA 1 AAAA-ACCAACAACAAATTTAAAGTAAAAAAAAAA 13101 TGTACCTTAC Statistics Matches: 30, Mismatches: 3, Indels: 2 0.86 0.09 0.06 Matches are distributed among these distances: 34 13 0.43 35 17 0.57 ACGTcount: A:0.69, C:0.12, G:0.07, T:0.12 Consensus pattern (34 bp): AAAAACCAACAACAAATTTAAAGTAAAAAAAAAA Found at i:16199 original size:34 final size:35 Alignment explanation

Indices: 16146--16216 Score: 101 Period size: 34 Copynumber: 2.1 Consensus size: 35 16136 ACAAATATAC * 16146 AACAGTTTAATCTTATCCTT-GTAA-ATTTCTTGGA 1 AACAATTTAATCTTATCCTTAGTAATATTT-TTGGA * 16180 AACAATTTAATCTTGTCCTTAGTAATATTTTTGGA 1 AACAATTTAATCTTATCCTTAGTAATATTTTTGGA 16215 AA 1 AA 16217 AAACAGAAAT Statistics Matches: 33, Mismatches: 2, Indels: 3 0.87 0.05 0.08 Matches are distributed among these distances: 34 18 0.55 35 11 0.33 36 4 0.12 ACGTcount: A:0.32, C:0.13, G:0.11, T:0.44 Consensus pattern (35 bp): AACAATTTAATCTTATCCTTAGTAATATTTTTGGA Found at i:21409 original size:23 final size:23 Alignment explanation

Indices: 21377--21433 Score: 96 Period size: 23 Copynumber: 2.5 Consensus size: 23 21367 TGTGATAATA * 21377 ACGAAAAAACTAATATTGAAATG 1 ACGAAAAAACCAATATTGAAATG * 21400 ACGAAGAAACCAATATTGAAATG 1 ACGAAAAAACCAATATTGAAATG 21423 ACGAAAAAACC 1 ACGAAAAAACC 21434 CATCTAAATA Statistics Matches: 31, Mismatches: 3, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 23 31 1.00 ACGTcount: A:0.56, C:0.14, G:0.14, T:0.16 Consensus pattern (23 bp): ACGAAAAAACCAATATTGAAATG Found at i:23073 original size:3 final size:3 Alignment explanation

Indices: 23065--23098 Score: 61 Period size: 3 Copynumber: 11.7 Consensus size: 3 23055 TTTCTGAGCT 23065 TTC TTC TTC -TC TTC TTC TTC TTC TTC TTC TTC TT 1 TTC TTC TTC TTC TTC TTC TTC TTC TTC TTC TTC TT 23099 TATCTTTATA Statistics Matches: 30, Mismatches: 0, Indels: 2 0.94 0.00 0.06 Matches are distributed among these distances: 2 2 0.07 3 28 0.93 ACGTcount: A:0.00, C:0.32, G:0.00, T:0.68 Consensus pattern (3 bp): TTC Found at i:27610 original size:29 final size:31 Alignment explanation

Indices: 27573--27634 Score: 83 Period size: 30 Copynumber: 2.1 Consensus size: 31 27563 TTTTAGGGAC * 27573 TTGGCCCTT-GAACTTTAGATTTTGGACAAT 1 TTGGCCCTTCAAACTTTAGATTTTGGACAAT ** 27603 TTGG-CCTTCAAACTTTATTTTTTGGACAAT 1 TTGGCCCTTCAAACTTTAGATTTTGGACAAT 27633 TT 1 TT 27635 AGCACGAGTT Statistics Matches: 28, Mismatches: 3, Indels: 2 0.85 0.09 0.06 Matches are distributed among these distances: 29 4 0.14 30 24 0.86 ACGTcount: A:0.23, C:0.16, G:0.16, T:0.45 Consensus pattern (31 bp): TTGGCCCTTCAAACTTTAGATTTTGGACAAT Found at i:29302 original size:21 final size:21 Alignment explanation

Indices: 29273--29312 Score: 71 Period size: 21 Copynumber: 1.9 Consensus size: 21 29263 TTTGGCGGGA 29273 ATTTACTTTTTTCCTTTTTCC 1 ATTTACTTTTTTCCTTTTTCC * 29294 ATTTTCTTTTTTCCTTTTT 1 ATTTACTTTTTTCCTTTTT 29313 TATAGCAAGA Statistics Matches: 18, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 21 18 1.00 ACGTcount: A:0.07, C:0.20, G:0.00, T:0.72 Consensus pattern (21 bp): ATTTACTTTTTTCCTTTTTCC Found at i:29827 original size:19 final size:20 Alignment explanation

Indices: 29788--29828 Score: 57 Period size: 19 Copynumber: 2.1 Consensus size: 20 29778 TTAATTTTTG * 29788 AAATTAAAAATAAAATTACA 1 AAATTAAAAATAAAATCACA * 29808 AAATTAAACA-AAAATCACA 1 AAATTAAAAATAAAATCACA 29827 AA 1 AA 29829 CACTCTTCAA Statistics Matches: 19, Mismatches: 2, Indels: 1 0.86 0.09 0.05 Matches are distributed among these distances: 19 10 0.53 20 9 0.47 ACGTcount: A:0.71, C:0.10, G:0.00, T:0.20 Consensus pattern (20 bp): AAATTAAAAATAAAATCACA Found at i:32946 original size:2 final size:2 Alignment explanation

Indices: 32939--32972 Score: 68 Period size: 2 Copynumber: 17.0 Consensus size: 2 32929 ATTATTACTT 32939 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 32973 GTTTCTCTCC Statistics Matches: 32, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 32 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): TA Done.