Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01007570.1 Corchorus capsularis cultivar CVL-1 contig07591, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 21019
ACGTcount: A:0.33, C:0.18, G:0.17, T:0.33


Found at i:2978 original size:27 final size:28

Alignment explanation

Indices: 2944--3011 Score: 88 Period size: 27 Copynumber: 2.5 Consensus size: 28 2934 ATCCTAGGGA 2944 ACTAGCTTTGAATGGGA-AACTGT-TCTG 1 ACTAGCTTTGAATGGGAGAACTGTCT-TG 2971 ACTAGCTTTGAAT-GGAGAACTGTCTTG 1 ACTAGCTTTGAATGGGAGAACTGTCTTG * * 2998 ACTAACTTGGAATG 1 ACTAGCTTTGAATG 3012 AGAGTCTGAC Statistics Matches: 36, Mismatches: 2, Indels: 5 0.84 0.05 0.12 Matches are distributed among these distances: 26 3 0.08 27 32 0.89 28 1 0.03 ACGTcount: A:0.28, C:0.15, G:0.25, T:0.32 Consensus pattern (28 bp): ACTAGCTTTGAATGGGAGAACTGTCTTG Found at i:5360 original size:28 final size:28 Alignment explanation

Indices: 5329--5395 Score: 125 Period size: 28 Copynumber: 2.4 Consensus size: 28 5319 AAATATTTAC * 5329 ATAAAAAAGTAATTGACAATAAAGATTT 1 ATAAAAAAATAATTGACAATAAAGATTT 5357 ATAAAAAAATAATTGACAATAAAGATTT 1 ATAAAAAAATAATTGACAATAAAGATTT 5385 ATAAAAAAATA 1 ATAAAAAAATA 5396 GTAATTAAAA Statistics Matches: 38, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 28 38 1.00 ACGTcount: A:0.63, C:0.03, G:0.07, T:0.27 Consensus pattern (28 bp): ATAAAAAAATAATTGACAATAAAGATTT Found at i:5393 original size:30 final size:29 Alignment explanation

Indices: 5317--5393 Score: 111 Period size: 28 Copynumber: 2.7 Consensus size: 29 5307 AGAAGTTAAG * * * 5317 ATAAATATTTACATAAAAAAGTAATTGACA 1 ATAAAGATTTATA-AAAAAAATAATTGACA 5347 ATAAAGATTTAT-AAAAAAATAATTGACA 1 ATAAAGATTTATAAAAAAAATAATTGACA 5375 ATAAAGATTTATAAAAAAA 1 ATAAAGATTTATAAAAAAA 5394 TAGTAATTAA Statistics Matches: 43, Mismatches: 3, Indels: 3 0.88 0.06 0.06 Matches are distributed among these distances: 28 27 0.63 29 6 0.14 30 10 0.23 ACGTcount: A:0.61, C:0.04, G:0.06, T:0.29 Consensus pattern (29 bp): ATAAAGATTTATAAAAAAAATAATTGACA Found at i:7777 original size:86 final size:86 Alignment explanation

Indices: 7670--7841 Score: 335 Period size: 86 Copynumber: 2.0 Consensus size: 86 7660 TATTATTAAG * 7670 GGTAAAGTAGTATATTTATATTTATTTTTTTTATTTATGAAATTCATGCGCAATTCCATTTCTGA 1 GGTAAAGTAGTATATTTATATTTATTTTTTTTATTTATGAAATTCATGCGCAATTCCATCTCTGA 7735 ACCAAACTTATGAAATAACAA 66 ACCAAACTTATGAAATAACAA 7756 GGTAAAGTAGTATATTTATATTTATTTTTTTTATTTATGAAATTCATGCGCAATTCCATCTCTGA 1 GGTAAAGTAGTATATTTATATTTATTTTTTTTATTTATGAAATTCATGCGCAATTCCATCTCTGA 7821 ACCAAACTTATGAAATAACAA 66 ACCAAACTTATGAAATAACAA 7842 TTTAATTCCT Statistics Matches: 85, Mismatches: 1, Indels: 0 0.99 0.01 0.00 Matches are distributed among these distances: 86 85 1.00 ACGTcount: A:0.36, C:0.12, G:0.10, T:0.41 Consensus pattern (86 bp): GGTAAAGTAGTATATTTATATTTATTTTTTTTATTTATGAAATTCATGCGCAATTCCATCTCTGA ACCAAACTTATGAAATAACAA Found at i:8671 original size:31 final size:32 Alignment explanation

Indices: 8608--8683 Score: 100 Period size: 31 Copynumber: 2.4 Consensus size: 32 8598 GGCATTTACC * 8608 GAAGAAAACGCCATAGAGTATGAGTGTTTATT 1 GAAGAAAACGCCATAGAGTATGAGCGTTTATT * * 8640 GAAGAAAACGCCTTAGAGTA-GGGCGTTTATT 1 GAAGAAAACGCCATAGAGTATGAGCGTTTATT * * 8671 GAACAAAATGCCA 1 GAAGAAAACGCCA 8684 CTATTTTTGG Statistics Matches: 38, Mismatches: 6, Indels: 1 0.84 0.13 0.02 Matches are distributed among these distances: 31 19 0.50 32 19 0.50 ACGTcount: A:0.38, C:0.13, G:0.25, T:0.24 Consensus pattern (32 bp): GAAGAAAACGCCATAGAGTATGAGCGTTTATT Found at i:11985 original size:28 final size:29 Alignment explanation

Indices: 11945--12023 Score: 108 Period size: 28 Copynumber: 2.8 Consensus size: 29 11935 GTTTTACAAA * ** 11945 ATAATATATATATATATATATGTATTAT- 1 ATAATATATATATATATATATATAACATC 11973 ATAATATATATATATA-ATATATAACATC 1 ATAATATATATATATATATATATAACATC * 12001 ACAATATATATATATATATATAT 1 ATAATATATATATATATATATAT 12024 TATAATATAT Statistics Matches: 45, Mismatches: 4, Indels: 3 0.87 0.08 0.06 Matches are distributed among these distances: 27 8 0.18 28 31 0.69 29 6 0.13 ACGTcount: A:0.51, C:0.04, G:0.01, T:0.44 Consensus pattern (29 bp): ATAATATATATATATATATATATAACATC Found at i:12042 original size:2 final size:2 Alignment explanation

Indices: 11948--12034 Score: 89 Period size: 2 Copynumber: 46.5 Consensus size: 2 11938 TTACAAAATA * 11948 AT AT AT AT AT AT AT AT AT GT AT -T AT AT A- AT AT AT AT AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT * * 11988 A- AT AT AT A- AC AT CAC A- AT AT AT AT AT AT AT AT AT AT -T AT 1 AT AT AT AT AT AT AT -AT AT AT AT AT AT AT AT AT AT AT AT AT AT 12027 A- AT AT AT A 1 AT AT AT AT A 12035 AAATATATTT Statistics Matches: 73, Mismatches: 4, Indels: 16 0.78 0.04 0.17 Matches are distributed among these distances: 1 7 0.10 2 65 0.89 3 1 0.01 ACGTcount: A:0.51, C:0.03, G:0.01, T:0.45 Consensus pattern (2 bp): AT Found at i:16693 original size:22 final size:22 Alignment explanation

Indices: 16668--16950 Score: 126 Period size: 22 Copynumber: 13.1 Consensus size: 22 16658 CAATTGTTAG * 16668 TAATCACACTCTGAAATTTTGA 1 TAATCACACTATGAAATTTTGA * 16690 TAATCACACTATGAAATTGTGA 1 TAATCACACTATGAAATTTTGA * * * 16712 TAACCTCGCTATGAAATTTTGA 1 TAATCACACTATGAAATTTTGA * * 16734 TTAATCTTC-CTATAAAATTTTGA 1 -TAATC-ACACTATGAAATTTTGA 16757 TAAACCTC-CA-TAT-AAATTTTGA 1 T-AA--TCACACTATGAAATTTTGA ** * * 16779 TAACTTTC-TTATGAAATCTTG- 1 TAA-TCACACTATGAAATTTTGA * 16800 --AT-A-ACTA-CAAATTTTGA 1 TAATCACACTATGAAATTTTGA * * * ** 16817 TAACCTCCCTATGATTTTTTGA 1 TAATCACACTATGAAATTTTGA * * * * 16839 TAACCTCATTATGAAATTTTGT 1 TAATCACACTATGAAATTTTGA * * 16861 TAATCTCCCTATGAAATTTTGA 1 TAATCACACTATGAAATTTTGA * * 16883 T-CTGCATACTATGAAATTTTGA 1 TAAT-CACACTATGAAATTTTGA * * 16905 TAA-CCCTCTTATGAAATTTTGA 1 TAATCACAC-TATGAAATTTTGA * * 16927 -AAACTAAACTATGAAATTTTGA 1 TAATC-ACACTATGAAATTTTGA 16949 TA 1 TA 16951 TCCTCCCTGA Statistics Matches: 199, Mismatches: 40, Indels: 43 0.71 0.14 0.15 Matches are distributed among these distances: 16 7 0.04 17 2 0.01 18 1 0.01 19 2 0.01 20 2 0.01 21 14 0.07 22 142 0.71 23 25 0.13 24 2 0.01 25 2 0.01 ACGTcount: A:0.35, C:0.16, G:0.09, T:0.40 Consensus pattern (22 bp): TAATCACACTATGAAATTTTGA Found at i:16752 original size:45 final size:46 Alignment explanation

Indices: 16679--16779 Score: 122 Period size: 45 Copynumber: 2.3 Consensus size: 46 16669 AATCACACTC * 16679 TGAAATTTTGA-TAATCACACTATGAAATTGTGAT-AACCTCGC-TA 1 TGAAATTTTGATTAATCACACTATAAAATTGTGATAAACCTC-CATA * * 16723 TGAAATTTTGATTAATCTTC-CTATAAAATTTTGATAAACCTCCATA 1 TGAAATTTTGATTAATC-ACACTATAAAATTGTGATAAACCTCCATA 16769 T-AAATTTTGAT 1 TGAAATTTTGAT 16780 AACTTTCTTA Statistics Matches: 50, Mismatches: 3, Indels: 7 0.83 0.05 0.12 Matches are distributed among these distances: 44 11 0.22 45 29 0.58 46 10 0.20 ACGTcount: A:0.37, C:0.14, G:0.10, T:0.40 Consensus pattern (46 bp): TGAAATTTTGATTAATCACACTATAAAATTGTGATAAACCTCCATA Found at i:16915 original size:126 final size:127 Alignment explanation

Indices: 16681--16932 Score: 310 Period size: 126 Copynumber: 2.0 Consensus size: 127 16671 TCACACTCTG * * * 16681 AAATTTTGATAATCACACTATGAAATTGTGATAACCTCGCTATGAAATTTTGATTAATCTTCCTA 1 AAATTTTGATAACCACACTATGAAATTGTGATAACCTCACTATGAAATTTTGATTAATCTCCCTA * * ** * 16746 TAAAATTTTGATAAACCTCCATATAAATTTTGATAACTTTCTTATGAAATCTTGATAACTAC 66 TAAAATTTTGATAAACATACATATAAATTTTGATAACCCTCTTATGAAATCTTGAAAACTAC * * ** * * 16808 AAATTTTGATAACCTCCCTATGATTTTTTGATAACCTCATTATGAAATTTTG-TTAATCTCCCTA 1 AAATTTTGATAACCACACTATGAAATTGTGATAACCTCACTATGAAATTTTGATTAATCTCCCTA * *** * 16872 TGAAATTTTGATCTGCATAC-TATGAAATTTTGATAACCCTCTTATGAAATTTTGAAAACTA 66 TAAAATTTTGATAAACATACATAT-AAATTTTGATAACCCTCTTATGAAATCTTGAAAACTA 16933 AACTATGAAA Statistics Matches: 105, Mismatches: 19, Indels: 3 0.83 0.15 0.02 Matches are distributed among these distances: 125 3 0.03 126 58 0.55 127 44 0.42 ACGTcount: A:0.35, C:0.15, G:0.09, T:0.40 Consensus pattern (127 bp): AAATTTTGATAACCACACTATGAAATTGTGATAACCTCACTATGAAATTTTGATTAATCTCCCTA TAAAATTTTGATAAACATACATATAAATTTTGATAACCCTCTTATGAAATCTTGAAAACTAC Found at i:17168 original size:66 final size:66 Alignment explanation

Indices: 17063--17198 Score: 157 Period size: 66 Copynumber: 2.1 Consensus size: 66 17053 GAAATTTTTG * * * * ** * 17063 TAATCACATTCTGAAAATTTGATAACCTCTTTATAAAATTTTGTTGACCCCTCTATGAAATTCTG 1 TAATCACATTATGAAAATTTGATAACCTCCTTATAAAATTTTGGTAACAACACTATGAAATTCTG 17128 A 66 A * * * * 17129 TAATCACATTATGTAATTTTGATAACCTCGCTT-TGAAATTTTGGTAACAACACTATGAAATTTT 1 TAATCACATTATGAAAATTTGATAACCTC-CTTATAAAATTTTGGTAACAACACTATGAAATTCT 17193 GA 65 GA 17195 TAAT 1 TAAT 17199 TTGATCTCTA Statistics Matches: 58, Mismatches: 11, Indels: 2 0.82 0.15 0.03 Matches are distributed among these distances: 66 56 0.97 67 2 0.03 ACGTcount: A:0.35, C:0.15, G:0.10, T:0.40 Consensus pattern (66 bp): TAATCACATTATGAAAATTTGATAACCTCCTTATAAAATTTTGGTAACAACACTATGAAATTCTG A Found at i:17232 original size:22 final size:22 Alignment explanation

Indices: 17113--17238 Score: 94 Period size: 22 Copynumber: 5.6 Consensus size: 22 17103 TTGTTGACCC * 17113 CTCTATGAAATTCTGATAATCA 1 CTCTATGAAATTTTGATAATCA * * * 17135 CAT-TATGTAATTTTGATAACCT 1 C-TCTATGAAATTTTGATAATCA * * * 17157 CGCTTTGAAATTTTGGTAA-CAA 1 CTCTATGAAATTTTGATAATC-A * * 17179 CACTATGAAATTTTGATAATTTGA 1 CTCTATGAAATTTTGATAA--TCA * 17203 TCTCTATGAAATTTCGATAATCA 1 -CTCTATGAAATTTTGATAATCA * 17226 CTCTATGAGATTT 1 CTCTATGAAATTT 17239 AATAACCTTC Statistics Matches: 80, Mismatches: 17, Indels: 14 0.72 0.15 0.13 Matches are distributed among these distances: 21 1 0.01 22 58 0.73 23 3 0.04 24 1 0.01 25 17 0.21 ACGTcount: A:0.33, C:0.14, G:0.12, T:0.40 Consensus pattern (22 bp): CTCTATGAAATTTTGATAATCA Found at i:17312 original size:22 final size:22 Alignment explanation

Indices: 17283--17640 Score: 153 Period size: 22 Copynumber: 16.5 Consensus size: 22 17273 AAATTGAGAC 17283 TTTT-ATAACCTTCA-TATGAAA 1 TTTTGATAACC-TCACTATGAAA * 17304 TTTTGATAACCAT-ACTATAAAA 1 TTTTGATAACC-TCACTATGAAA * * 17326 TTTTGATAACCTCCCCATGAAA 1 TTTTGATAACCTCACTATGAAA * 17348 TATT-AGTAACCTC-CTAATGAAA 1 TTTTGA-TAACCTCACT-ATGAAA * * 17370 TTTTGTTAACCACACTATGAAA 1 TTTTGATAACCTCACTATGAAA * 17392 TTCTT-ATAACCTCACTATGACA 1 TT-TTGATAACCTCACTATGAAA * * 17414 TTTTGATAA--TCTCTTTGATAA 1 TTTTGATAACCTCACTATGA-AA * * * 17435 CCTTTCT-ATAA---AATTGTGAAA 1 --TTT-TGATAACCTCACTATGAAA * * * 17456 --AT--TAACCACCCTATGAAA 1 TTTTGATAACCTCACTATGAAA ** * * 17474 TTTCAATAACC-AACTTAAGAAA 1 TTTTGATAACCTCAC-TATGAAA * * 17496 TTTTAATAACCTGATCCTATGAAA 1 TTTTGATAACCTCA--CTATGAAA * ** * 17520 TTTTGGTTTCCACACTATGAAA 1 TTTTGATAACCTCACTATGAAA * 17542 TTTTGATAACTTC-CATATGAAA 1 TTTTGATAACCTCAC-TATGAAA * * * 17564 TTTTGGTAACCACACTATGGAA 1 TTTTGATAACCTCACTATGAAA 17586 TTTTGATAACCTC-CTCATGAAA 1 TTTTGATAACCTCACT-ATGAAA * * * 17608 TTATAATAACCATC-TTATGAAA 1 TTTTGATAACC-TCACTATGAAA 17630 TTTTGATAACC 1 TTTTGATAACC 17641 ACATAGAGAC Statistics Matches: 253, Mismatches: 56, Indels: 55 0.70 0.15 0.15 Matches are distributed among these distances: 15 3 0.01 16 1 0.00 18 6 0.02 20 7 0.03 21 17 0.07 22 187 0.74 23 16 0.06 24 15 0.06 25 1 0.00 ACGTcount: A:0.37, C:0.18, G:0.08, T:0.37 Consensus pattern (22 bp): TTTTGATAACCTCACTATGAAA Found at i:17346 original size:44 final size:44 Alignment explanation

Indices: 17098--17640 Score: 196 Period size: 44 Copynumber: 12.3 Consensus size: 44 17088 CCTCTTTATA * * * * * * 17098 AAATTTTGTTGACC-CCTCTATGAAATTCTGATAATCACATTATG 1 AAATTTTGATAACCTCC-CTATGAAATTTTGATAACCATACTATG * * * * * * 17142 TAATTTTGATAACCTCGCTTTGAAATTTTGGTAACAACACTATG 1 AAATTTTGATAACCTCCCTATGAAATTTTGATAACCATACTATG ** * * * 17186 AAATTTTGATAATTTGATCTCTATGAAATTTCGATAATCACT-CTATG 1 AAATTTTGATAA---CCTCCCTATGAAATTTTGATAACCA-TACTATG * * * * * * * 17233 AGA-TTTAATAACCT-TCTATCAAATTTTGGTACTCCTTATGA-AATTG 1 AAATTTTGATAACCTCCCTATGAAATTTTGATA-ACC--AT-ACTA-TG * * * * 17279 AGACTTTT-ATAACCTTCATATGAAATTTTGATAACCATACTATA 1 A-AATTTTGATAACCTCCCTATGAAATTTTGATAACCATACTATG * * * 17323 AAATTTTGATAACCTCCCCATGAAATATT-AGTAACC-TCCTAATG 1 AAATTTTGATAACCTCCCTATGAAATTTTGA-TAACCATACT-ATG * * * 17367 AAATTTTGTTAACCACACTATGAAATTCTT-ATAACC-TCACTATG 1 AAATTTTGATAACCTCCCTATGAAATT-TTGATAACCAT-ACTATG * * * * * 17411 ACATTTTGATAA--T--C--T----CTTTGATAACCTTTCTATA 1 AAATTTTGATAACCTCCCTATGAAATTTTGATAACCATACTATG * * ** * 17445 AAATTGTGAAAATTAACCACCCTATGAAATTTCAATAACCA-ACTTAAG 1 AAATTTTG---A-TAACCTCCCTATGAAATTTTGATAACCATAC-TATG * * * ** * 17493 AAATTTTAATAACCTGATCCTATGAAATTTTGGTTTCCACACTATG 1 AAATTTTGATAACCT--CCCTATGAAATTTTGATAACCATACTATG * * * * 17539 AAATTTTGATAACTTCCATATGAAATTTTGGTAACCACACTATG 1 AAATTTTGATAACCTCCCTATGAAATTTTGATAACCATACTATG * * * 17583 GAATTTTGATAACCT-CCTCATGAAATTATAATAACCAT-CTTATG 1 AAATTTTGATAACCTCCCT-ATGAAATTTTGATAACCATAC-TATG 17627 AAATTTTGATAACC 1 AAATTTTGATAACC 17641 ACATAGAGAC Statistics Matches: 365, Mismatches: 92, Indels: 84 0.67 0.17 0.16 Matches are distributed among these distances: 33 2 0.01 34 17 0.05 35 1 0.00 37 1 0.00 38 4 0.01 40 1 0.00 42 15 0.04 43 14 0.04 44 188 0.52 45 12 0.03 46 42 0.12 47 36 0.10 48 32 0.09 ACGTcount: A:0.36, C:0.17, G:0.10, T:0.38 Consensus pattern (44 bp): AAATTTTGATAACCTCCCTATGAAATTTTGATAACCATACTATG Found at i:17996 original size:15 final size:15 Alignment explanation

Indices: 17978--18024 Score: 55 Period size: 15 Copynumber: 3.3 Consensus size: 15 17968 TATATAATCT 17978 AATAATTAATAATGG 1 AATAATTAATAATGG * * 17993 AATAATTTATAAT-T 1 AATAATTAATAATGG 18007 AA-AA-TAATAATGG 1 AATAATTAATAATGG 18020 AATAA 1 AATAA 18025 AAATACTATT Statistics Matches: 26, Mismatches: 4, Indels: 5 0.74 0.11 0.14 Matches are distributed among these distances: 12 6 0.23 13 4 0.15 14 4 0.15 15 12 0.46 ACGTcount: A:0.57, C:0.00, G:0.09, T:0.34 Consensus pattern (15 bp): AATAATTAATAATGG Found at i:18096 original size:30 final size:32 Alignment explanation

Indices: 18051--18116 Score: 100 Period size: 31 Copynumber: 2.1 Consensus size: 32 18041 TGGTAATTTA * 18051 GAAATATGATTTTAAAAA-AAGGGTACAATTG 1 GAAATATGATTTTAAAAATAAGGGTACAATCG * 18082 GAAATATG-TTTTAAAAATAAGGGTATAATCG 1 GAAATATGATTTTAAAAATAAGGGTACAATCG 18113 GAAA 1 GAAA 18117 ACATAAAGTT Statistics Matches: 32, Mismatches: 2, Indels: 2 0.89 0.06 0.06 Matches are distributed among these distances: 30 9 0.28 31 23 0.72 ACGTcount: A:0.48, C:0.03, G:0.20, T:0.29 Consensus pattern (32 bp): GAAATATGATTTTAAAAATAAGGGTACAATCG Found at i:18955 original size:29 final size:27 Alignment explanation

Indices: 18882--18973 Score: 112 Period size: 27 Copynumber: 3.3 Consensus size: 27 18872 ATTCCCAAGC 18882 CTCACCTAACTTGGAGCTTCTTTGAGG 1 CTCACCTAACTTGGAGCTTCTTTGAGG * * * 18909 CTCACCTAACCTGAAGCTTCTTTGAGCCT 1 CTCACCTAACTTGGAGCTTCTTTGAG--G * * 18938 CTCACATAACTTGGAGCTTCTTTTAGG 1 CTCACCTAACTTGGAGCTTCTTTGAGG * 18965 TTCACCTAA 1 CTCACCTAA 18974 AACTCTAGAC Statistics Matches: 53, Mismatches: 10, Indels: 4 0.79 0.15 0.06 Matches are distributed among these distances: 27 31 0.58 29 22 0.42 ACGTcount: A:0.22, C:0.28, G:0.16, T:0.34 Consensus pattern (27 bp): CTCACCTAACTTGGAGCTTCTTTGAGG Found at i:20782 original size:21 final size:21 Alignment explanation

Indices: 20758--20806 Score: 62 Period size: 21 Copynumber: 2.3 Consensus size: 21 20748 CAACTGGGGG * * * 20758 CCCATGTGGTATGCTTGGCGC 1 CCCATGTGGTATGCCTCGCGA * 20779 CCCATGTGGTTTGCCTCGCGA 1 CCCATGTGGTATGCCTCGCGA 20800 CCCATGT 1 CCCATGT 20807 CCTCCAGTGC Statistics Matches: 24, Mismatches: 4, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 21 24 1.00 ACGTcount: A:0.10, C:0.33, G:0.29, T:0.29 Consensus pattern (21 bp): CCCATGTGGTATGCCTCGCGA Done.