Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01014488.1 Corchorus capsularis cultivar CVL-1 contig14509, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 14861
ACGTcount: A:0.33, C:0.17, G:0.16, T:0.34


Found at i:100 original size:22 final size:22

Alignment explanation

Indices: 75--267 Score: 70 Period size: 22 Copynumber: 8.8 Consensus size: 22 65 TGATCCCATC 75 ATGAAATTTTGATAACCTTCCT 1 ATGAAATTTTGATAACCTTCCT * ** * 97 ATGAAATTTTAATAACAATACT 1 ATGAAATTTTGATAACCTTCCT * ** * * ** 119 ATGGAATTTCAAGAATCTTTTT 1 ATGAAATTTTGATAACCTTCCT ** ** * * 141 AT-ATGTTTTTTTAACTTTCTT 1 ATGAAATTTTGATAACCTTCCT * * 162 ATGAAATTTTGTTAACCTCCCT 1 ATGAAATTTTGATAACCTTCCT * * * 184 AAGGAATTTTGA-AGACC-TCAAT 1 ATGAAATTTTGATA-ACCTTC-CT ** 206 ATGAAATTTTGATAACCAACACT 1 ATGAAATTTTGATAACCTTC-CT * * 229 ATGAGATGTTGATAACC-TCCAT 1 ATGAAATTTTGATAACCTTCC-T * * 251 ATGATATATTGATAACC 1 ATGAAATTTTGATAACC 268 ACTTTATAAG Statistics Matches: 123, Mismatches: 42, Indels: 12 0.69 0.24 0.07 Matches are distributed among these distances: 21 14 0.11 22 90 0.73 23 19 0.15 ACGTcount: A:0.35, C:0.15, G:0.11, T:0.39 Consensus pattern (22 bp): ATGAAATTTTGATAACCTTCCT Found at i:240 original size:23 final size:22 Alignment explanation

Indices: 204--268 Score: 67 Period size: 23 Copynumber: 2.9 Consensus size: 22 194 GAAGACCTCA * 204 ATATGAAATTTTGATAACCAAC 1 ATATGAAATATTGATAACCAAC * * ** 226 ACTATGAGATGTTGATAACCTCC 1 A-TATGAAATATTGATAACCAAC * 249 ATATGATATATTGATAACCA 1 ATATGAAATATTGATAACCA 269 CTTTATAAGA Statistics Matches: 35, Mismatches: 7, Indels: 2 0.80 0.16 0.05 Matches are distributed among these distances: 22 17 0.49 23 18 0.51 ACGTcount: A:0.40, C:0.15, G:0.12, T:0.32 Consensus pattern (22 bp): ATATGAAATATTGATAACCAAC Found at i:373 original size:22 final size:21 Alignment explanation

Indices: 161--594 Score: 168 Period size: 22 Copynumber: 20.0 Consensus size: 21 151 TTAACTTTCT * 161 TATGAAATTTTGTTAACCTCCC 1 TATGAAATTTTGATAACCT-CC * * * 183 TAAGGAATTTTGA-AGACCTCAA 1 TATGAAATTTTGATA-ACCTC-C * 205 TATGAAATTTTGATAACCAACAC 1 TATGAAATTTTGATAACC-TC-C * * 228 TATGAGATGTTGATAACCTCC 1 TATGAAATTTTGATAACCTCC * * * * 249 ATATGATATATTGATAACCACTT 1 -TATGAAATTTTGATAACCTC-C * * * 272 TAT-AAGAATTTAAAAACCTCC 1 TATGAA-ATTTTGATAACCTCC * * 293 ATATG-AATTGTT-AGTAATCACAC 1 -TATGAAATT-TTGA-TAACCTC-C * * * * * 316 TTTAAAATTTTGACAATCACAC 1 TATGAAATTTTGATAACCTC-C * 338 TATGAAATTGTGATAACCTCGC 1 TATGAAATTTTGATAACCTC-C * 360 TATGAAATTTTGATAAATCTTCC 1 TATGAAATTTTGAT-AA-CCTCC * * 383 TATAAAATTTTAATAAACCTCCC 1 TATGAAATTTTGAT-AACCT-CC * * * * 406 AATAAAATTTTGATAACTTTCT 1 TATGAAATTTTGATAAC-CTCC * 428 TATGAAATCTTGATAA----C 1 TATGAAATTTTGATAACCTCC * * 445 TA-CAAATTTTGATAAGCTCCC 1 TATGAAATTTTGATAACCT-CC ** * * 466 TATGATTTTTTGATTACCTCAT 1 TATGAAATTTTGATAACCTC-C * * 488 TATGAAATTTTG-TTATCTCCC 1 TATGAAATTTTGATAACCT-CC * * * 509 TATGAAATTTTGATCTACATAC 1 TATGAAATTTTGAT-AACCTCC * 531 TATGAAATTTTGATAACCCTCT 1 TATGAAATTTTGATAA-CCTCC * * 553 TATGAAAATTTGATAACCTTCA 1 TATGAAATTTTGATAACC-TCC * 575 TATGAAATTTTGATATCCTC 1 TATGAAATTTTGATAACCTC 595 ACTGAATTTC Statistics Matches: 310, Mismatches: 72, Indels: 61 0.70 0.16 0.14 Matches are distributed among these distances: 16 11 0.04 17 2 0.01 21 33 0.11 22 200 0.65 23 61 0.20 24 3 0.01 ACGTcount: A:0.36, C:0.16, G:0.10, T:0.38 Consensus pattern (21 bp): TATGAAATTTTGATAACCTCC Found at i:390 original size:23 final size:23 Alignment explanation

Indices: 359--443 Score: 82 Period size: 23 Copynumber: 3.7 Consensus size: 23 349 GATAACCTCG * 359 CTATGAAATTTTGATAAATCTTC 1 CTATAAAATTTTGATAAATCTTC * * * 382 CTATAAAATTTTAATAAACCTCC 1 CTATAAAATTTTGATAAATCTTC * * 405 CAATAAAATTTTGATAACT-TTC 1 CTATAAAATTTTGATAAATCTTC * * * 427 TTATGAAATCTTGATAA 1 CTATAAAATTTTGATAA 444 CTACAAATTT Statistics Matches: 49, Mismatches: 13, Indels: 1 0.78 0.21 0.02 Matches are distributed among these distances: 22 15 0.31 23 34 0.69 ACGTcount: A:0.40, C:0.14, G:0.06, T:0.40 Consensus pattern (23 bp): CTATAAAATTTTGATAAATCTTC Found at i:511 original size:43 final size:43 Alignment explanation

Indices: 461--542 Score: 112 Period size: 43 Copynumber: 1.9 Consensus size: 43 451 TTTTGATAAG ** * * 461 CTCCCTATGATTTTTTGAT-TACCTCATTATGAAATTTTGTTAT 1 CTCCCTATGAAATTTTGATCTACAT-ACTATGAAATTTTGTTAT 504 CTCCCTATGAAATTTTGATCTACATACTATGAAATTTTG 1 CTCCCTATGAAATTTTGATCTACATACTATGAAATTTTG 543 ATAACCCTCT Statistics Matches: 34, Mismatches: 4, Indels: 2 0.85 0.10 0.05 Matches are distributed among these distances: 43 30 0.88 44 4 0.12 ACGTcount: A:0.27, C:0.17, G:0.10, T:0.46 Consensus pattern (43 bp): CTCCCTATGAAATTTTGATCTACATACTATGAAATTTTGTTAT Found at i:643 original size:20 final size:19 Alignment explanation

Indices: 580--646 Score: 98 Period size: 19 Copynumber: 3.5 Consensus size: 19 570 CTTCATATGA * 580 AATTTTGATATCCTCACTG 1 AATTTTGATATCCTCCCTG * 599 AATTTCGATATCCTCCCTG 1 AATTTTGATATCCTCCCTG * 618 AATTTTGGTATCCTCCCTG 1 AATTTTGATATCCTCCCTG 637 AAATTTTGAT 1 -AATTTTGAT 647 TACTACATCA Statistics Matches: 42, Mismatches: 5, Indels: 1 0.88 0.10 0.02 Matches are distributed among these distances: 19 34 0.81 20 8 0.19 ACGTcount: A:0.24, C:0.22, G:0.12, T:0.42 Consensus pattern (19 bp): AATTTTGATATCCTCCCTG Found at i:785 original size:22 final size:22 Alignment explanation

Indices: 758--997 Score: 120 Period size: 22 Copynumber: 11.0 Consensus size: 22 748 ATTTTAAAAA 758 TTTGATAACCTCTTTATGAAAT 1 TTTGATAACCTCTTTATGAAAT * * 780 TTTGATAACATCTTTATAAAAT 1 TTTGATAACCTCTTTATGAAAT * * * * 802 TTTGTTGACCCCTCTATGAAAT 1 TTTGATAACCTCTTTATGAAAT * * * * * * 824 TTTGAAAATCACATTAT-TATT 1 TTTGATAACCTCTTTATGAAAT * 845 TTTGATAACCTCGCTT-TGAAAT 1 TTTGATAACCTC-TTTATGAAAT ** ** 867 TTTGATAACAACACTATGAAAT 1 TTTGATAACCTCTTTATGAAAT * 889 TTTGATAA--TATTCATAT-AAAT 1 TTTGATAACCTCTT--TATGAAAT 910 TTTGATAATCCTATCTTTATGAAAT 1 TTTGATAA-CC--TCTTTATGAAAT * * * * * 935 TTCGATAATCACTCTATGAGA- 1 TTTGATAACCTCTTTATGAAAT * * 956 TTTGATAACCT-TCTATCAAAT 1 TTTGATAACCTCTTTATGAAAT * * 977 TTTGGT-A-CTCATTATGAAAT 1 TTTGATAACCTCTTTATGAAAT 997 T 1 T 998 GAGACTTTTA Statistics Matches: 159, Mismatches: 46, Indels: 28 0.68 0.20 0.12 Matches are distributed among these distances: 19 2 0.01 20 16 0.10 21 38 0.24 22 85 0.53 24 4 0.03 25 11 0.07 26 3 0.02 ACGTcount: A:0.34, C:0.14, G:0.09, T:0.43 Consensus pattern (22 bp): TTTGATAACCTCTTTATGAAAT Found at i:800 original size:44 final size:44 Alignment explanation

Indices: 727--997 Score: 138 Period size: 44 Copynumber: 6.2 Consensus size: 44 717 AGAAATACCA * 727 TTATGAAATTTTTG-TAATCACATTT-TAAAAATTTGATAACCTCT 1 TTATGAAA-TTTTGATAATCAC-TTTATAAAATTTTGATAACCTCT * * * 771 TTATGAAATTTTGATAA-CATCTTTATAAAATTTTGTTGACCCCT 1 TTATGAAATTTTGATAATCA-CTTTATAAAATTTTGATAACCTCT * * * * * * 815 CTATGAAATTTTGAAAATCACATTAT-TATTTTTGATAACCTCGC 1 TTATGAAATTTTGATAATCACTTTATAAAATTTTGATAACCTC-T ** * * 859 TT-TGAAATTTTGATAA-CAACACTATGAAATTTTGATAA--TAT 1 TTATGAAATTTTGATAATC-ACTTTATAAAATTTTGATAACCTCT * * * * 900 TCATAT-AAATTTTGATAATCCTATCTTTATGAAATTTCGATAATCACT 1 T--TATGAAATTTTGATAAT-C-A-CTTTATAAAATTTTGATAACCTCT * * * * * 948 CTATGAGA-TTTGATAA-C-CTTCTATCAAATTTTGGT-A-CTCA 1 TTATGAAATTTTGATAATCACTT-TATAAAATTTTGATAACCTCT 988 TTATGAAATT 1 TTATGAAATT 998 GAGACTTTTA Statistics Matches: 174, Mismatches: 35, Indels: 39 0.70 0.14 0.16 Matches are distributed among these distances: 40 8 0.05 41 6 0.03 42 13 0.07 43 53 0.30 44 60 0.34 45 4 0.02 46 27 0.16 47 2 0.01 48 1 0.01 ACGTcount: A:0.35, C:0.13, G:0.09, T:0.43 Consensus pattern (44 bp): TTATGAAATTTTGATAATCACTTTATAAAATTTTGATAACCTCT Found at i:1032 original size:22 final size:22 Alignment explanation

Indices: 844--1125 Score: 92 Period size: 22 Copynumber: 12.5 Consensus size: 22 834 ACATTATTAT * * 844 TTTTGATAACC-TCGCTTTGAAA 1 TTTTGATAACCTTC-ATATGAAA ** 866 TTTTGATAA-CAACACTATGAAA 1 TTTTGATAACCTTCA-TATGAAA ** 888 TTTTGATAATATTCATAT-AAA 1 TTTTGATAACCTTCATATGAAA * 909 TTTTGATAATCCTATCTTTATGAAA 1 TTTTGATAA-CCT-TC-ATATGAAA * * 934 TTTCGATAATCAC-TC-TATGAGA 1 TTTTGATAA-C-CTTCATATGAAA * 956 -TTTGATAACCTTC-TATCAAA 1 TTTTGATAACCTTCATATGAAA * 976 TTTTGGT-A-C-TCATTATGAAA 1 TTTTGATAACCTTCA-TATGAAA * 996 TTGAGACTTTTATAACCTTCATATGAAA 1 -T-----TTTGATAACCTTCATATGAAA * * 1024 TTTTGATAACC-ACACTATCAAA 1 TTTTGATAACCTTCA-TATGAAA * 1046 TTTTGATAACCTCCCA-ATGAAGCA 1 TTTTGATAACCT-TCATATGAA--A * 1070 -TTAG-TAACCTTC-TAATGAAA 1 TTTTGATAACCTTCAT-ATGAAA * * 1090 TTTTGTTAACC-ACACTATGAAA 1 TTTTGATAACCTTCA-TATGAAA 1112 TTTTTGTATAACCT 1 -TTTTG-ATAACCT 1126 CGTTATGGCA Statistics Matches: 197, Mismatches: 28, Indels: 67 0.67 0.10 0.23 Matches are distributed among these distances: 18 2 0.01 19 2 0.01 20 16 0.08 21 33 0.17 22 87 0.44 23 13 0.07 24 13 0.07 25 13 0.07 26 5 0.03 27 2 0.01 28 8 0.04 29 3 0.02 ACGTcount: A:0.35, C:0.16, G:0.10, T:0.39 Consensus pattern (22 bp): TTTTGATAACCTTCATATGAAA Found at i:1240 original size:24 final size:22 Alignment explanation

Indices: 1186--1244 Score: 64 Period size: 22 Copynumber: 2.6 Consensus size: 22 1176 AATTAAGCAC * 1186 CCTATGAAATTTCAATAATCAA 1 CCTATGAAATTTTAATAATCAA * * * 1208 CCTAAGAAATTTTAATAACCTGAT 1 CCTATGAAATTTTAATAA--TCAA 1232 CCTATGAAATTTT 1 CCTATGAAATTTT 1245 GGTAGCCACT Statistics Matches: 30, Mismatches: 5, Indels: 2 0.81 0.14 0.05 Matches are distributed among these distances: 22 16 0.53 24 14 0.47 ACGTcount: A:0.41, C:0.17, G:0.07, T:0.36 Consensus pattern (22 bp): CCTATGAAATTTTAATAATCAA Found at i:1261 original size:22 final size:22 Alignment explanation

Indices: 1233--1289 Score: 87 Period size: 22 Copynumber: 2.6 Consensus size: 22 1223 TAACCTGATC * * 1233 CTATGAAATTTTGGTAGCCACT 1 CTATGAAATTTTGGTAACCACA * 1255 CTATGAAATTTTGGTAACTACA 1 CTATGAAATTTTGGTAACCACA 1277 CTATGAAATTTTG 1 CTATGAAATTTTG 1290 ATCATGACTG Statistics Matches: 32, Mismatches: 3, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 22 32 1.00 ACGTcount: A:0.32, C:0.14, G:0.16, T:0.39 Consensus pattern (22 bp): CTATGAAATTTTGGTAACCACA Found at i:2078 original size:22 final size:22 Alignment explanation

Indices: 2051--2104 Score: 90 Period size: 22 Copynumber: 2.5 Consensus size: 22 2041 AAAAAATAAA * 2051 TTTGGTAACCATACTATGAAAT 1 TTTGGTAACCACACTATGAAAT 2073 TTTGGTAACCACACTATGAAAT 1 TTTGGTAACCACACTATGAAAT * 2095 TTTGATAACC 1 TTTGGTAACC 2105 TCCTCATAGA Statistics Matches: 30, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 22 30 1.00 ACGTcount: A:0.35, C:0.17, G:0.13, T:0.35 Consensus pattern (22 bp): TTTGGTAACCACACTATGAAAT Found at i:2117 original size:22 final size:22 Alignment explanation

Indices: 2051--2149 Score: 80 Period size: 22 Copynumber: 4.5 Consensus size: 22 2041 AAAAAATAAA * 2051 TTTGGTAACCATACT-ATGAAAT 1 TTTGATAACCAT-CTCATGAAAT * * 2073 TTTGGTAACCA-CACTATGAAAT 1 TTTGATAACCATCTC-ATGAAAT 2095 TTTGATAACC-TCCTCAT-AGAAT 1 TTTGATAACCAT-CTCATGA-AAT * * * 2117 TATAATAACCATCTTATGAAAT 1 TTTGATAACCATCTCATGAAAT 2139 TTTGATAACCA 1 TTTGATAACCA 2150 CATAGAGATA Statistics Matches: 62, Mismatches: 8, Indels: 14 0.74 0.10 0.17 Matches are distributed among these distances: 20 1 0.02 21 1 0.02 22 56 0.90 23 4 0.06 ACGTcount: A:0.37, C:0.17, G:0.10, T:0.35 Consensus pattern (22 bp): TTTGATAACCATCTCATGAAAT Found at i:2482 original size:31 final size:31 Alignment explanation

Indices: 2425--2491 Score: 100 Period size: 31 Copynumber: 2.1 Consensus size: 31 2415 TGACAATTAA * 2425 GAAATATGTTTTTTAAAAAAAGGGTACAATTG 1 GAAATATG-TTTTTAAAAAAAGGGTACAATCG 2457 GAAATATG-TTTTAAAAATAAGGGTACAATCG 1 GAAATATGTTTTTAAAAA-AAGGGTACAATCG 2488 GAAA 1 GAAA 2492 ACATAAAGTT Statistics Matches: 33, Mismatches: 1, Indels: 3 0.89 0.03 0.08 Matches are distributed among these distances: 30 9 0.27 31 16 0.48 32 8 0.24 ACGTcount: A:0.46, C:0.04, G:0.19, T:0.30 Consensus pattern (31 bp): GAAATATGTTTTTAAAAAAAGGGTACAATCG Found at i:2658 original size:1 final size:1 Alignment explanation

Indices: 2652--2679 Score: 56 Period size: 1 Copynumber: 28.0 Consensus size: 1 2642 TTTATTTACT 2652 AAAAAAAAAAAAAAAAAAAAAAAAAAAA 1 AAAAAAAAAAAAAAAAAAAAAAAAAAAA 2680 CTTTGAACTA Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 1 27 1.00 ACGTcount: A:1.00, C:0.00, G:0.00, T:0.00 Consensus pattern (1 bp): A Found at i:8868 original size:206 final size:203 Alignment explanation

Indices: 8469--8881 Score: 485 Period size: 206 Copynumber: 2.0 Consensus size: 203 8459 TATCTTCATT * * * * 8469 TTTAAGAATCTGCTACAACTTTTTACTCATCTGTAATTATCATCGATTAGTAATTATATATGCTA 1 TTTAAGAATCTGCTACAACTTTTTACTCATATGTAACTATCATCAACTAGTAATTATATATGCTA ** 8534 AGCTTGTATGTATATAAATTAACAGGAATACCGGTACCATTTCCCATCATGCTCTATTTGCCATA 66 AGCTTGTATG-ATATAAA-TAACAGGAATACAAGTACCATTTCCCATCATGCTCTATTTGCCATA * ** 8599 AATTTGTTACTTTTCAAATTATCAGTGTTCTAAGGCATGCTTCTAATCACTTTAAACCATATTCC 129 AATATGAAACTTTTCAAATTATCAGTGTTCTAAGGCATGCTTCT-ATCACTTTAAACCATATTCC 8664 TGCATATGTAC 193 TGCATATGTAC * * * * * 8675 TTTAAGTATCTGCTGCAACTTTTTACTCATATGTAACTATGATTAACTAGTGATTATATATGCTA 1 TTTAAGAATCTGCTACAACTTTTTACTCATATGTAACTATCATCAACTAGTAATTATATATGCTA * * * 8740 AGCTTGTATG-TATACA-AACAGGAATGCAAGTACCATTTTTACCAATATCCATGCTCTATTTGC 66 AGCTTGTATGATATAAATAACAGGAATACAAGTACCA--TTT-CC--CAT-CATGCTCTATTTGC * * * * * 8803 CATCAATATGAAACTTTTCATATTATCACAGTGTTTTAAGGCATGCTT-T-T-ACTTTATAGCAT 125 CATAAATATGAAACTTTTCAAATTAT--CAGTGTTCTAAGGCATGCTTCTATCACTTTAAACCAT * 8865 ATTCCTGCATATTTAC 188 ATTCCTGCATATGTAC 8881 T 1 T 8882 GTAGTGCCTT Statistics Matches: 176, Mismatches: 23, Indels: 16 0.82 0.11 0.07 Matches are distributed among these distances: 202 16 0.09 204 8 0.05 205 2 0.01 206 92 0.52 207 3 0.02 208 35 0.20 209 1 0.01 210 19 0.11 ACGTcount: A:0.31, C:0.18, G:0.12, T:0.40 Consensus pattern (203 bp): TTTAAGAATCTGCTACAACTTTTTACTCATATGTAACTATCATCAACTAGTAATTATATATGCTA AGCTTGTATGATATAAATAACAGGAATACAAGTACCATTTCCCATCATGCTCTATTTGCCATAAA TATGAAACTTTTCAAATTATCAGTGTTCTAAGGCATGCTTCTATCACTTTAAACCATATTCCTGC ATATGTAC Found at i:13238 original size:2 final size:2 Alignment explanation

Indices: 13231--13261 Score: 53 Period size: 2 Copynumber: 15.5 Consensus size: 2 13221 AAATACATTC * 13231 TA TA TA TA TA TA TA TA TA TA TA TG TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 13262 GTATGTATTG Statistics Matches: 27, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 2 27 1.00 ACGTcount: A:0.45, C:0.00, G:0.03, T:0.52 Consensus pattern (2 bp): TA Found at i:13266 original size:12 final size:12 Alignment explanation

Indices: 13231--13269 Score: 60 Period size: 12 Copynumber: 3.2 Consensus size: 12 13221 AAATACATTC * 13231 TATATATATATA 1 TATATATATATG 13243 TATATATATATG 1 TATATATATATG * 13255 TATATATGTATG 1 TATATATATATG 13267 TAT 1 TAT 13270 TGAAAGGTTG Statistics Matches: 25, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 12 25 1.00 ACGTcount: A:0.41, C:0.00, G:0.08, T:0.51 Consensus pattern (12 bp): TATATATATATG Found at i:14364 original size:3 final size:3 Alignment explanation

Indices: 14356--14391 Score: 72 Period size: 3 Copynumber: 12.0 Consensus size: 3 14346 ATTACTATCC 14356 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 1 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 14392 TTTGTATATA Statistics Matches: 33, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 3 33 1.00 ACGTcount: A:0.33, C:0.00, G:0.00, T:0.67 Consensus pattern (3 bp): ATT Done.