Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01012793.1 Corchorus capsularis cultivar CVL-1 contig12814, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 61544
ACGTcount: A:0.32, C:0.18, G:0.18, T:0.31


Found at i:7694 original size:1 final size:1

Alignment explanation

Indices: 7688--7713 Score: 52 Period size: 1 Copynumber: 26.0 Consensus size: 1 7678 ACAGTTCAAG 7688 TTTTTTTTTTTTTTTTTTTTTTTTTT 1 TTTTTTTTTTTTTTTTTTTTTTTTTT 7714 ATCAAGAGGA Statistics Matches: 25, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 1 25 1.00 ACGTcount: A:0.00, C:0.00, G:0.00, T:1.00 Consensus pattern (1 bp): T Found at i:8188 original size:19 final size:19 Alignment explanation

Indices: 8164--8203 Score: 80 Period size: 19 Copynumber: 2.1 Consensus size: 19 8154 ATTCAAAAAG 8164 TTGGGTTTGGATCTGTATC 1 TTGGGTTTGGATCTGTATC 8183 TTGGGTTTGGATCTGTATC 1 TTGGGTTTGGATCTGTATC 8202 TT 1 TT 8204 CAGTATAAAC Statistics Matches: 21, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 19 21 1.00 ACGTcount: A:0.10, C:0.10, G:0.30, T:0.50 Consensus pattern (19 bp): TTGGGTTTGGATCTGTATC Found at i:10298 original size:12 final size:12 Alignment explanation

Indices: 10281--10305 Score: 50 Period size: 12 Copynumber: 2.1 Consensus size: 12 10271 TAAACTATAG 10281 TTTTTAGACGTT 1 TTTTTAGACGTT 10293 TTTTTAGACGTT 1 TTTTTAGACGTT 10305 T 1 T 10306 GACCCCTTTT Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 12 13 1.00 ACGTcount: A:0.16, C:0.08, G:0.16, T:0.60 Consensus pattern (12 bp): TTTTTAGACGTT Found at i:10626 original size:19 final size:21 Alignment explanation

Indices: 10597--10638 Score: 61 Period size: 20 Copynumber: 2.1 Consensus size: 21 10587 ATAAACTATG 10597 AACTAAAATTG-AAATAATTA 1 AACTAAAATTGCAAATAATTA * 10617 AACT-AAATTGCAAGTAATTA 1 AACTAAAATTGCAAATAATTA 10637 AA 1 AA 10639 ATAGAAGAAA Statistics Matches: 20, Mismatches: 1, Indels: 2 0.87 0.04 0.09 Matches are distributed among these distances: 19 6 0.30 20 14 0.70 ACGTcount: A:0.57, C:0.07, G:0.07, T:0.29 Consensus pattern (21 bp): AACTAAAATTGCAAATAATTA Found at i:16650 original size:13 final size:13 Alignment explanation

Indices: 16634--16669 Score: 54 Period size: 13 Copynumber: 2.8 Consensus size: 13 16624 AACATCCAAA 16634 AACAACAAAGCAG 1 AACAACAAAGCAG * 16647 AACAACAAAGTAG 1 AACAACAAAGCAG * 16660 GACAACAAAG 1 AACAACAAAG 16670 GGATGGGCAA Statistics Matches: 21, Mismatches: 2, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 13 21 1.00 ACGTcount: A:0.61, C:0.19, G:0.17, T:0.03 Consensus pattern (13 bp): AACAACAAAGCAG Found at i:18541 original size:3 final size:3 Alignment explanation

Indices: 18533--18569 Score: 74 Period size: 3 Copynumber: 12.3 Consensus size: 3 18523 TAGTTGTGTG 18533 ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA A 1 ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA A 18570 GCTTTGATTA Statistics Matches: 34, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 3 34 1.00 ACGTcount: A:0.68, C:0.00, G:0.00, T:0.32 Consensus pattern (3 bp): ATA Found at i:19521 original size:16 final size:16 Alignment explanation

Indices: 19500--19534 Score: 61 Period size: 16 Copynumber: 2.2 Consensus size: 16 19490 ACAATTCAGA * 19500 AAGCAGAAGAGCTCTG 1 AAGCAGAAAAGCTCTG 19516 AAGCAGAAAAGCTCTG 1 AAGCAGAAAAGCTCTG 19532 AAG 1 AAG 19535 TATTTCAGAT Statistics Matches: 18, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 16 18 1.00 ACGTcount: A:0.43, C:0.17, G:0.29, T:0.11 Consensus pattern (16 bp): AAGCAGAAAAGCTCTG Found at i:20365 original size:16 final size:16 Alignment explanation

Indices: 20344--20378 Score: 61 Period size: 16 Copynumber: 2.2 Consensus size: 16 20334 ACAATTCAGA * 20344 AAGCAGAAGAGCTCTG 1 AAGCAGAAAAGCTCTG 20360 AAGCAGAAAAGCTCTG 1 AAGCAGAAAAGCTCTG 20376 AAG 1 AAG 20379 TATTTCAGAT Statistics Matches: 18, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 16 18 1.00 ACGTcount: A:0.43, C:0.17, G:0.29, T:0.11 Consensus pattern (16 bp): AAGCAGAAAAGCTCTG Found at i:20734 original size:844 final size:843 Alignment explanation

Indices: 19104--20797 Score: 3271 Period size: 844 Copynumber: 2.0 Consensus size: 843 19094 AATTTCGACC * 19104 AATCAGAAATTTTCTACAAATTTTCTTTTGAGAATTTATCCTTAAATTTATTGGATATTTTCTAA 1 AATCAAAAATTTTCTACAAATTTTCTTTTGAGAATTTATCCTTAAATTTATTGGATATTTTCTAA 19169 AGAAAAATGGTAGATAAATATGACACATGGTCTAACCAATTAAAATTTCAATGTTTAATGGTTTG 66 AGAAAAATGGTAGATAAATATGACACATGGTCTAACCAATTAAAATTTCAATGTTTAATGGTTTG * * 19234 ACCAATCAAAATTAAGTTGTGAGTCATCACCTTTTATCTCTAATTGAATTGAGATAATTGGTAAA 131 ACCAATCAAAACTAAGTTGTGAGTCATCACCTTTTATCTCTAATTAAATTGAGATAATTGGTAAA * * 19299 TAAATTGCATTATCTGGCCAATCAAATCTCAAGGTTTCAAGCATTAACCAATCAGAAATTACTGT 196 TAAATTGCATCATCTGGCCAATCAAATCTCAAGGTTTCAAGCATTAACCAATCAGAAATCACTGT 19364 TGTGTCATCTTATTTATCTCTAATTAAAATTAATTATTTAGAGATAAATAACAAGGAGAAATTAA 261 TGTGTCATCTTATTTATCTCTAATTAAAATTAATTATTTAGAGATAAATAACAAGGAGAAATTAA 19429 GTTATCTTGAAACTCTCCATCCCCCACTATAAATACCAAGCTCCCAAGTCATTTTCAAGAGACAA 326 GTTATCTTGAAACTCTCCATCCCCCACTATAAATACCAAGCTCCCAAGTCATTTTCAAGAGACAA 19494 TTCAGAAAGCAGAAGAGCTCTGAAGCAGAAAAGCTCTGAAGTATTTCAGATGTTCTTATTCTGAT 391 TTCAGAAAGCAGAAGAGCTCTGAAGCAGAAAAGCTCTGAAGTATTTCAGATGTTCTTATTCTGAT * 19559 CTTCACCAAAGCTTAAGAAGATTGAAAGAAGATCCACGTATGTGGAAAATTCTTCTTTCAAAGAA 456 ATTCACCAAAGCTTAAGAAGATTGAAAGAAGATCCACGTATGTGGAAAATTCTTCTTTCAAAGAA * 19624 GATTCAATTATCGGAGAATTACTGAAGACCCAGTTATTGGGAAATTATTGAAAGAAGATCCACGT 521 GATTCAATTATCGGAGAATTACTGAAAACCCAGTTATTGGGAAATTATTGAAAGAAGATCCACGT 19689 ATGTGGAGGATTCTTCTTTCAAAGAAGATCCAAGGAAGATTTTCAAAGATTTACTGAAGCTCGCT 586 ATGTGGAGGATTCTTCTTTCAAAGAAGATCCAAGGAAGATTTTCAAAGATTTACTGAAGCTCGCT * 19754 TTCAAGAAATTATTATTTTGATCTTCATCAACATATTTGAAGAAGATCATTTGTGTTTGTTCAAG 651 TTCAAGAAATTATTATTTTGATCTTCATCAACATATTTGAAGAAGATCATTTGTGTTCGTTCAAG 19819 ATCAAGTCATTCGACCATTGAATCAAATTATCATCAATTCGAGATCAAGTCATCAAAGACCCTCG 716 ATCAAGTCATTCGACCATTGAATCAAATTATCATCAATTCGAGATCAAGTCATCAAAGACCCTCG 19884 AATCAAATCAAATTCCCAAGTCATCAATTCAAGATTAAGTCATTCGACTCTTGAATCAAATCA 781 AATCAAATCAAATTCCCAAGTCATCAATTCAAGATTAAGTCATTCGACTCTTGAATCAAATCA 19947 AATCAAAAATTTTCTACAAATTTTCTTTTGAGAATTTATCCTTAAATTTATTGGATATTTTCTAA 1 AATCAAAAATTTTCTACAAATTTTCTTTTGAGAATTTATCCTTAAATTTATTGGATATTTTCTAA 20012 AGAAAAATGGTAGATAAATATGACACATGGTCTAACCAATTAAAATTTCAATGTTTAATGGTTTG 66 AGAAAAATGGTAGATAAATATGACACATGGTCTAACCAATTAAAATTTCAATGTTTAATGGTTTG 20077 ACCAATCAAAACTAAGTTGTGAGTCATCACCTTTTATCTCTAATTAAATTGAGATAATTGGTAAA 131 ACCAATCAAAACTAAGTTGTGAGTCATCACCTTTTATCTCTAATTAAATTGAGATAATTGGTAAA 20142 TAAATTGCATCATCTGGCCAATCAAATCTCAAGGTTTCAAGCATTAACCAATCAGAAATCACTGT 196 TAAATTGCATCATCTGGCCAATCAAATCTCAAGGTTTCAAGCATTAACCAATCAGAAATCACTGT 20207 TGTGTCATCTTATTTATCTCTAATTAAAATTAATTATTTAGAGATAAATAACAAGGAGAAATTAA 261 TGTGTCATCTTATTTATCTCTAATTAAAATTAATTATTTAGAGATAAATAACAAGGAGAAATTAA 20272 GTTATCTTGAAACTCTCCATCCCCCCACTATAAATACCAAGCTCCCAAGTCATTTTCAAGAGACA 326 GTTATCTTGAAACTCTCCAT-CCCCCACTATAAATACCAAGCTCCCAAGTCATTTTCAAGAGACA 20337 ATTCAGAAAGCAGAAGAGCTCTGAAGCAGAAAAGCTCTGAAGTATTTCAGATGTTCTTATTCTGA 390 ATTCAGAAAGCAGAAGAGCTCTGAAGCAGAAAAGCTCTGAAGTATTTCAGATGTTCTTATTCTGA 20402 TATTCACCAAAGCTTAAGAAGATTGAAAGAAGATCCACGTATGTGGAAAATTCTTCTTTCAAAGA 455 TATTCACCAAAGCTTAAGAAGATTGAAAGAAGATCCACGTATGTGGAAAATTCTTCTTTCAAAGA 20467 AGATTCAATTATCGGAGAATTACTGAAAACCCAGTTATTGGGAAATTATTGAAAGAAGATCCACG 520 AGATTCAATTATCGGAGAATTACTGAAAACCCAGTTATTGGGAAATTATTGAAAGAAGATCCACG * 20532 TATGTGGAGGATTCTTCTTTCAAAGAAGATCCAAGGAAGATTTTCAAAGATTTACTGAAGCTCGT 585 TATGTGGAGGATTCTTCTTTCAAAGAAGATCCAAGGAAGATTTTCAAAGATTTACTGAAGCTCGC 20597 TTTCAAGAAATTATTATTTTGATCTTCATCAACATATTTGAAGAAGATCATTTGTGTTCGTTCAA 650 TTTCAAGAAATTATTATTTTGATCTTCATCAACATATTTGAAGAAGATCATTTGTGTTCGTTCAA * 20662 GATCAAGTCATTCGACCCTTGAATCAAATTATCATCAATTCGAGATCAAGTCATCAAAGACCCTC 715 GATCAAGTCATTCGACCATTGAATCAAATTATCATCAATTCGAGATCAAGTCATCAAAGACCCTC * * 20727 GAATCAAATCAAATTCCCAAGTCATTAATTCAAGATTAAGTCATTCGACTTTTGAATCAAATCA 780 GAATCAAATCAAATTCCCAAGTCATCAATTCAAGATTAAGTCATTCGACTCTTGAATCAAATCA 20791 AATCAAA 1 AATCAAA 20798 CTCTCAAATT Statistics Matches: 838, Mismatches: 12, Indels: 1 0.98 0.01 0.00 Matches are distributed among these distances: 843 340 0.41 844 498 0.59 ACGTcount: A:0.37, C:0.16, G:0.14, T:0.32 Consensus pattern (843 bp): AATCAAAAATTTTCTACAAATTTTCTTTTGAGAATTTATCCTTAAATTTATTGGATATTTTCTAA AGAAAAATGGTAGATAAATATGACACATGGTCTAACCAATTAAAATTTCAATGTTTAATGGTTTG ACCAATCAAAACTAAGTTGTGAGTCATCACCTTTTATCTCTAATTAAATTGAGATAATTGGTAAA TAAATTGCATCATCTGGCCAATCAAATCTCAAGGTTTCAAGCATTAACCAATCAGAAATCACTGT TGTGTCATCTTATTTATCTCTAATTAAAATTAATTATTTAGAGATAAATAACAAGGAGAAATTAA GTTATCTTGAAACTCTCCATCCCCCACTATAAATACCAAGCTCCCAAGTCATTTTCAAGAGACAA TTCAGAAAGCAGAAGAGCTCTGAAGCAGAAAAGCTCTGAAGTATTTCAGATGTTCTTATTCTGAT ATTCACCAAAGCTTAAGAAGATTGAAAGAAGATCCACGTATGTGGAAAATTCTTCTTTCAAAGAA GATTCAATTATCGGAGAATTACTGAAAACCCAGTTATTGGGAAATTATTGAAAGAAGATCCACGT ATGTGGAGGATTCTTCTTTCAAAGAAGATCCAAGGAAGATTTTCAAAGATTTACTGAAGCTCGCT TTCAAGAAATTATTATTTTGATCTTCATCAACATATTTGAAGAAGATCATTTGTGTTCGTTCAAG ATCAAGTCATTCGACCATTGAATCAAATTATCATCAATTCGAGATCAAGTCATCAAAGACCCTCG AATCAAATCAAATTCCCAAGTCATCAATTCAAGATTAAGTCATTCGACTCTTGAATCAAATCA Found at i:22695 original size:71 final size:73 Alignment explanation

Indices: 22598--22740 Score: 236 Period size: 71 Copynumber: 2.0 Consensus size: 73 22588 GTCAAACCAA * ** * 22598 AAAAAAAAAGAGCTCGCTAAGTTGAAAATCCTGCAAAGGACGGTTTAGGCAAAAGTTAGAGCA-A 1 AAAAAAAAAGAGCTCGCTAAGTTGAAAATCCTGAAAAGGACGACTTAGGCAAAACTTAGAGCACA 22662 AAAAAAAG 66 AAAAAAAG 22670 AAAAAAAAAG-GCTCGCTAAGTTGAAAATCCTGAAAAGGACGACTTAGGCAAAACTTAGAGCACA 1 AAAAAAAAAGAGCTCGCTAAGTTGAAAATCCTGAAAAGGACGACTTAGGCAAAACTTAGAGCACA 22734 AAAAAAA 66 AAAAAAA 22741 AAGTGAACTA Statistics Matches: 66, Mismatches: 4, Indels: 2 0.92 0.06 0.03 Matches are distributed among these distances: 71 48 0.73 72 18 0.27 ACGTcount: A:0.51, C:0.14, G:0.20, T:0.15 Consensus pattern (73 bp): AAAAAAAAAGAGCTCGCTAAGTTGAAAATCCTGAAAAGGACGACTTAGGCAAAACTTAGAGCACA AAAAAAAG Found at i:23925 original size:5 final size:5 Alignment explanation

Indices: 23915--23941 Score: 54 Period size: 5 Copynumber: 5.4 Consensus size: 5 23905 TCGACTCTTG 23915 AATCA AATCA AATCA AATCA AATCA AA 1 AATCA AATCA AATCA AATCA AATCA AA 23942 CTCTCAAATT Statistics Matches: 22, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 5 22 1.00 ACGTcount: A:0.63, C:0.19, G:0.00, T:0.19 Consensus pattern (5 bp): AATCA Found at i:27881 original size:21 final size:21 Alignment explanation

Indices: 27856--27904 Score: 89 Period size: 21 Copynumber: 2.3 Consensus size: 21 27846 GCTTAATCGT 27856 GAAGGAAAGAAGACATGTCGC 1 GAAGGAAAGAAGACATGTCGC * 27877 GAAGGAAATAAGACATGTCGC 1 GAAGGAAAGAAGACATGTCGC 27898 GAAGGAA 1 GAAGGAA 27905 GCCTGTCGAC Statistics Matches: 27, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 21 27 1.00 ACGTcount: A:0.45, C:0.12, G:0.33, T:0.10 Consensus pattern (21 bp): GAAGGAAAGAAGACATGTCGC Found at i:28774 original size:21 final size:22 Alignment explanation

Indices: 28750--28794 Score: 56 Period size: 21 Copynumber: 2.1 Consensus size: 22 28740 AATAACTGAA * 28750 TTGCTAAACACCGCCCCA-TTT 1 TTGCTAAACACCACCCCACTTT ** 28771 TTGCTATTCACCACCCCACTTT 1 TTGCTAAACACCACCCCACTTT 28793 TT 1 TT 28795 ACACTTTTGC Statistics Matches: 20, Mismatches: 3, Indels: 1 0.83 0.12 0.04 Matches are distributed among these distances: 21 15 0.75 22 5 0.25 ACGTcount: A:0.20, C:0.38, G:0.07, T:0.36 Consensus pattern (22 bp): TTGCTAAACACCACCCCACTTT Found at i:29008 original size:14 final size:14 Alignment explanation

Indices: 28985--29021 Score: 56 Period size: 14 Copynumber: 2.6 Consensus size: 14 28975 TTTGCAGATC 28985 TAGATCTAGAGAAA 1 TAGATCTAGAGAAA * 28999 TAGATGTAGAGAAA 1 TAGATCTAGAGAAA * 29013 TAAATCTAG 1 TAGATCTAG 29022 GGTTTTGGAT Statistics Matches: 20, Mismatches: 3, Indels: 0 0.87 0.13 0.00 Matches are distributed among these distances: 14 20 1.00 ACGTcount: A:0.49, C:0.05, G:0.22, T:0.24 Consensus pattern (14 bp): TAGATCTAGAGAAA Found at i:29115 original size:32 final size:32 Alignment explanation

Indices: 29042--29116 Score: 87 Period size: 32 Copynumber: 2.3 Consensus size: 32 29032 TGGCTGTGCT * 29042 GCCCCAGGGGGGTGGCAGGCCGTGGCAAGGCC 1 GCCCCAGGGGGGCGGCAGGCCGTGGCAAGGCC * * * * * 29074 ACCCCAGGGGGGCGGCATGTCGTTGCAAGGTC 1 GCCCCAGGGGGGCGGCAGGCCGTGGCAAGGCC * 29106 GCCCTAGGGGG 1 GCCCCAGGGGG 29117 ATGGTTGTGC Statistics Matches: 35, Mismatches: 8, Indels: 0 0.81 0.19 0.00 Matches are distributed among these distances: 32 35 1.00 ACGTcount: A:0.13, C:0.29, G:0.47, T:0.11 Consensus pattern (32 bp): GCCCCAGGGGGGCGGCAGGCCGTGGCAAGGCC Found at i:45907 original size:15 final size:15 Alignment explanation

Indices: 45855--45898 Score: 61 Period size: 15 Copynumber: 2.9 Consensus size: 15 45845 ATGATTATTT * 45855 GCACCATAGTTGTTC 1 GCACCATTGTTGTTC * * 45870 GCACCATTGTGGTTT 1 GCACCATTGTTGTTC 45885 GCACCATTGTTGTT 1 GCACCATTGTTGTT 45899 GGCGCCATTC Statistics Matches: 25, Mismatches: 4, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 15 25 1.00 ACGTcount: A:0.16, C:0.23, G:0.23, T:0.39 Consensus pattern (15 bp): GCACCATTGTTGTTC Found at i:52267 original size:17 final size:17 Alignment explanation

Indices: 52244--52294 Score: 56 Period size: 17 Copynumber: 3.2 Consensus size: 17 52234 TGTCTTTTAA 52244 ATATTTTTTTCAATTAC 1 ATATTTTTTTCAATTAC * 52261 AT-TTTTTCTT---TTAA 1 ATATTTTT-TTCAATTAC 52275 ATATTTTTTTCAATTAC 1 ATATTTTTTTCAATTAC 52292 ATA 1 ATA 52295 ATAATAAAGT Statistics Matches: 27, Mismatches: 2, Indels: 10 0.69 0.05 0.26 Matches are distributed among these distances: 14 7 0.26 15 5 0.19 16 5 0.19 17 10 0.37 ACGTcount: A:0.29, C:0.10, G:0.00, T:0.61 Consensus pattern (17 bp): ATATTTTTTTCAATTAC Found at i:52272 original size:31 final size:32 Alignment explanation

Indices: 52230--52293 Score: 121 Period size: 31 Copynumber: 2.0 Consensus size: 32 52220 TCATCTTTCT 52230 TTTTTGTCTTTTAAATATTTTTTTCAATTACA 1 TTTTTGTCTTTTAAATATTTTTTTCAATTACA 52262 TTTTT-TCTTTTAAATATTTTTTTCAATTACA 1 TTTTTGTCTTTTAAATATTTTTTTCAATTACA 52293 T 1 T 52294 AATAATAAAG Statistics Matches: 32, Mismatches: 0, Indels: 1 0.97 0.00 0.03 Matches are distributed among these distances: 31 27 0.84 32 5 0.16 ACGTcount: A:0.25, C:0.09, G:0.02, T:0.64 Consensus pattern (32 bp): TTTTTGTCTTTTAAATATTTTTTTCAATTACA Found at i:59777 original size:21 final size:21 Alignment explanation

Indices: 59748--59790 Score: 77 Period size: 21 Copynumber: 2.0 Consensus size: 21 59738 ATAGAGGTGG 59748 CAAACGATTAAGTCTAGCAAT 1 CAAACGATTAAGTCTAGCAAT * 59769 CAAATGATTAAGTCTAGCAAT 1 CAAACGATTAAGTCTAGCAAT 59790 C 1 C 59791 TCTACAGTCA Statistics Matches: 21, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 21 21 1.00 ACGTcount: A:0.42, C:0.19, G:0.14, T:0.26 Consensus pattern (21 bp): CAAACGATTAAGTCTAGCAAT Done.