Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01010282.1 Corchorus capsularis cultivar CVL-1 contig10303, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 23455
ACGTcount: A:0.32, C:0.20, G:0.18, T:0.29


Found at i:87 original size:32 final size:32

Alignment explanation

Indices: 51--130 Score: 90 Period size: 32 Copynumber: 2.5 Consensus size: 32 41 GACGCAATCG * * 51 GCAAATGGCGATGCCAAGGCAACCGACCATCA 1 GCAAATGACGACGCCAAGGCAACCGACCATCA * ** * 83 GCAAATGACGACGCCAAGGCCATGGACCATCG 1 GCAAATGACGACGCCAAGGCAACCGACCATCA 115 GCAAAT-ACCGACGCCA 1 GCAAATGA-CGACGCCA 131 CATCAGGACT Statistics Matches: 41, Mismatches: 6, Indels: 2 0.84 0.12 0.04 Matches are distributed among these distances: 31 1 0.02 32 40 0.98 ACGTcount: A:0.34, C:0.33, G:0.25, T:0.09 Consensus pattern (32 bp): GCAAATGACGACGCCAAGGCAACCGACCATCA Found at i:296 original size:22 final size:21 Alignment explanation

Indices: 264--342 Score: 70 Period size: 22 Copynumber: 3.7 Consensus size: 21 254 ACCCATGTTA * 264 GGGCCCAAAGGTGCCCCCGAG 1 GGGCCCCAAGGTGCCCCCGAG * 285 GAGGCCCCAAGGCGCCCCCGAG 1 G-GGCCCCAAGGTGCCCCCGAG * * * 307 GTTGCCCCAA-GTGGCCCCAAG 1 G-GGCCCCAAGGTGCCCCCGAG * 328 GCGCCACCAAGGTGC 1 GGGCC-CCAAGGTGC 343 GAGGTACATG Statistics Matches: 46, Mismatches: 9, Indels: 5 0.77 0.15 0.08 Matches are distributed among these distances: 20 3 0.07 21 14 0.30 22 29 0.63 ACGTcount: A:0.19, C:0.41, G:0.34, T:0.06 Consensus pattern (21 bp): GGGCCCCAAGGTGCCCCCGAG Found at i:1029 original size:19 final size:19 Alignment explanation

Indices: 1002--1041 Score: 71 Period size: 19 Copynumber: 2.1 Consensus size: 19 992 GCCAACATGA * 1002 CAAGGGACGTCATCAAATT 1 CAAGAGACGTCATCAAATT 1021 CAAGAGACGTCATCAAATT 1 CAAGAGACGTCATCAAATT 1040 CA 1 CA 1042 GACAAGGCCC Statistics Matches: 20, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 19 20 1.00 ACGTcount: A:0.40, C:0.23, G:0.17, T:0.20 Consensus pattern (19 bp): CAAGAGACGTCATCAAATT Found at i:1092 original size:25 final size:25 Alignment explanation

Indices: 1057--1139 Score: 76 Period size: 25 Copynumber: 3.2 Consensus size: 25 1047 GGCCCCCATG 1057 GGCCATCGGCAAATGCCGACGCCAA 1 GGCCATCGGCAAATGCCGACGCCAA * *** ** * 1082 GGCCATGGGCATCCGCAAATGCCGCAA 1 GGCCATCGGCAAATGCCGACG-C-CAA * 1109 GGCCATCGGTAAATGCCGACGCCAA 1 GGCCATCGGCAAATGCCGACGCCAA 1134 GGCCAT 1 GGCCAT 1140 GGGCCATCGG Statistics Matches: 41, Mismatches: 15, Indels: 4 0.68 0.25 0.07 Matches are distributed among these distances: 25 23 0.56 26 2 0.05 27 16 0.39 ACGTcount: A:0.27, C:0.34, G:0.29, T:0.11 Consensus pattern (25 bp): GGCCATCGGCAAATGCCGACGCCAA Found at i:1135 original size:52 final size:53 Alignment explanation

Indices: 1057--1158 Score: 179 Period size: 52 Copynumber: 1.9 Consensus size: 53 1047 GGCCCCCATG 1057 GGCCATCGGCAAATGCCGACGCCAAGGCCATGGG-CATCCGCAAATGCCGCAA 1 GGCCATCGGCAAATGCCGACGCCAAGGCCATGGGCCATCCGCAAATGCCGCAA * * 1109 GGCCATCGGTAAATGCCGACGCCAAGGCCATGGGCCATCGGCAAATGCCG 1 GGCCATCGGCAAATGCCGACGCCAAGGCCATGGGCCATCCGCAAATGCCG 1159 GCACCACATC Statistics Matches: 47, Mismatches: 2, Indels: 1 0.94 0.04 0.02 Matches are distributed among these distances: 52 33 0.70 53 14 0.30 ACGTcount: A:0.25, C:0.33, G:0.30, T:0.11 Consensus pattern (53 bp): GGCCATCGGCAAATGCCGACGCCAAGGCCATGGGCCATCCGCAAATGCCGCAA Found at i:1891 original size:31 final size:32 Alignment explanation

Indices: 1744--2352 Score: 602 Period size: 32 Copynumber: 19.2 Consensus size: 32 1734 CACGTCGGCA * * 1744 CCAAGGGACATCGGCAAATGCCGACGACAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG 1776 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * 1808 CCAAGGGTCATCGGCAAATGCCGACGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * 1840 CCATGGGCCATCGGCAAATGCCGACGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * * 1872 CCATGGG-CATCCGCAAATGCCGACGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * * * * * * ** 1903 CCAAGGGGACA-AGGGACAT--C-A-TCAAATT 1 CCAA-GGGCCATCGGCAAATGCCGACGCCAAGG * * * ** 1931 CCAA-CGCAATCGGTAAATGCCGATACCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * 1962 -CAACGGGCCATCGGCAAATGCCGACGCCTAGG 1 CCAA-GGGCCATCGGCAAATGCCGACGCCAAGG * 1994 CCATGGG-CATCGGCAAATGCCGACGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * * 2025 CCAAGGGCCATCGGCAAATGCCAACGCCAAAG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * * * 2057 CCATGGGCCATCGGCAAATGCCG-TGCCATGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * * 2088 CCATGGG-CATCGGTAAATGCCGATGCCAAGGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGA---C---GCCAAGG * * 2125 CCAAGGG-CATCGGCAAAAGCCGATGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * * 2156 CCAAGGGCCATCGGCAAAATGCCCACACCAAGG 1 CCAAGGGCCATCGGC-AAATGCCGACGCCAAGG * * * * 2189 CCATGGG-CATCGTCAAATGCTGACGCCAAGA 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * 2220 CCAAGGGCCATCGGCAAATGCCGACGTCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG * * * ** 2252 ACATGGG-CATCGGCAACTATCGACGCCAAGG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG 2283 CCAAGGG-CAT-GGACAAATGCCGACGCCAAGG 1 CCAAGGGCCATCGG-CAAATGCCGACGCCAAGG * * * * 2314 CCTAGGGCCATCGACAAATGCCGATGCCAATG 1 CCAAGGGCCATCGGCAAATGCCGACGCCAAGG 2346 CCAAGGG 1 CCAAGGG 2353 GACAAGGGAC Statistics Matches: 474, Mismatches: 79, Indels: 48 0.79 0.13 0.08 Matches are distributed among these distances: 26 2 0.00 27 5 0.01 28 7 0.01 29 2 0.00 30 21 0.04 31 152 0.32 32 235 0.50 33 24 0.05 37 26 0.05 ACGTcount: A:0.29, C:0.31, G:0.29, T:0.10 Consensus pattern (32 bp): CCAAGGGCCATCGGCAAATGCCGACGCCAAGG Found at i:2160 original size:37 final size:37 Alignment explanation

Indices: 2081--2161 Score: 126 Period size: 37 Copynumber: 2.2 Consensus size: 37 2071 CAAATGCCGT * * * * 2081 GCCATGGCCATGGGCATCGGTAAATGCCGATGCCAAG 1 GCCAAGGCCAAGGGCATCGGCAAAAGCCGATGCCAAG 2118 GCCAAGGCCAAGGGCATCGGCAAAAGCCGATGCCAAG 1 GCCAAGGCCAAGGGCATCGGCAAAAGCCGATGCCAAG 2155 GCCAAGG 1 GCCAAGG 2162 GCCATCGGCA Statistics Matches: 40, Mismatches: 4, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 37 40 1.00 ACGTcount: A:0.28, C:0.28, G:0.33, T:0.10 Consensus pattern (37 bp): GCCAAGGCCAAGGGCATCGGCAAAAGCCGATGCCAAG Found at i:2501 original size:31 final size:31 Alignment explanation

Indices: 2382--2520 Score: 134 Period size: 32 Copynumber: 4.4 Consensus size: 31 2372 TTTCGACGCA * * * * * 2382 ATCGGCAAATGGCGATGCCAAGGCAACGGACC 1 ATCGGCAAATGCCGACGCCAAGGCCATGG-GC * * * 2414 ATCAGCAAATGACGACGCCAAGGCCATGGACC 1 ATCGGCAAATGCCGACGCCAAGGCCATGG-GC * * 2446 ATCGGCCAATACCGACGCCAAGGCCATGGGC 1 ATCGGCAAATGCCGACGCCAAGGCCATGGGC * ** 2477 ATTGGCAAATGCCGATTCCAAGGCCATGGGCC 1 ATCGGCAAATGCCGACGCCAAGGCCATGGG-C 2509 ATCGGCAAATGC 1 ATCGGCAAATGC 2521 TGATGCCAGT Statistics Matches: 90, Mismatches: 16, Indels: 2 0.83 0.15 0.02 Matches are distributed among these distances: 31 26 0.29 32 64 0.71 ACGTcount: A:0.29, C:0.30, G:0.28, T:0.12 Consensus pattern (31 bp): ATCGGCAAATGCCGACGCCAAGGCCATGGGC Found at i:2591 original size:186 final size:184 Alignment explanation

Indices: 2226--2625 Score: 502 Period size: 186 Copynumber: 2.2 Consensus size: 184 2216 AAGACCAAGG * * * * 2226 GCCATCGGCAAATGCCGACGTCAAGGACATGGGCATCGGCAACTATCGACGCCAAGGCCAAGGGC 1 GCCATC-GCAAATGCCGACGCCAAGGCCATGGCCATCGGCAACTACCGACGCCAAGGCCAAGGGC 2291 ATGGACAAATGCCGACGCCAAGGCCTAGGGCCATCGACAAATGCCGATGCCAATGCCAAGGGGAC 65 ATGGACAAATGCCGACGCCAAGGCCTAGGGCCATCGACAAATGCCGATGCCAATGCCAAGGGGAC * * * ** 2356 AAGGGACATCATAAAATTTCGACGCAATCGGCAAATGGCGATGCCAAGGCAACG- 130 AAGGGACATCATAAAATTCCGACGCAATCAGCAAATGCCGACACCAAGGCAACGT * * 2410 GACCATCAGCAAATGACGACGCCAAGGCCATGGACCATCGGCCAA-TACCGACGCCAAGGCCATG 1 G-CCATC-GCAAATGCCGACGCCAAGGCCATGG-CCATCGG-CAACTACCGACGCCAAGGCCAAG ** * * * 2474 GGCATTGG-CAAATGCCGATTCCAAGGCC-ATGGGCCATCGGCAAATGCTGATGCCAGTGCCAAG 62 GGCA-TGGACAAATGCCGACGCCAAGGCCTA-GGGCCATCGACAAATGCCGATGCCAATGCCAAG * * * * 2537 TGGACAAGGGACATCATCAAATTCCGACGCAATCAGCCAATGCCGACACCAAGGCCACGT 125 GGGACAAGGGACATCATAAAATTCCGACGCAATCAGCAAATGCCGACACCAAGGCAACGT * * 2597 GCCATCGTCAAATGCCGATGCCAATGCCA 1 GCCATCG-CAAATGCCGACGCCAAGGCCA 2626 ACGTCACAAG Statistics Matches: 185, Mismatches: 24, Indels: 12 0.84 0.11 0.05 Matches are distributed among these distances: 184 1 0.01 185 29 0.16 186 148 0.80 187 7 0.04 ACGTcount: A:0.30, C:0.30, G:0.27, T:0.13 Consensus pattern (184 bp): GCCATCGCAAATGCCGACGCCAAGGCCATGGCCATCGGCAACTACCGACGCCAAGGCCAAGGGCA TGGACAAATGCCGACGCCAAGGCCTAGGGCCATCGACAAATGCCGATGCCAATGCCAAGGGGACA AGGGACATCATAAAATTCCGACGCAATCAGCAAATGCCGACACCAAGGCAACGT Found at i:2626 original size:91 final size:91 Alignment explanation

Indices: 2484--2655 Score: 256 Period size: 91 Copynumber: 1.9 Consensus size: 91 2474 GGCATTGGCA ** * * * * 2484 AATGCCGATTCCAAGGCCATGGGCCATCGGCAAATGCTGATGCCAGTGCCAA-GTGGACAAGGGA 1 AATGCCGACACCAAGGCCACGGGCCATCGGCAAATGCCGATGCCAATGCCAACGT-CACAAGGGA 2548 CATCATCAAATTCCGACGCAATCAGCC 65 CATCATCAAATTCCGACGCAATCAGCC * * 2575 AATGCCGACACCAAGGCCACGTGCCATCGTCAAATGCCGATGCCAATGCCAACGTCACAAGGGAC 1 AATGCCGACACCAAGGCCACGGGCCATCGGCAAATGCCGATGCCAATGCCAACGTCACAAGGGAC 2640 ATCATCAAATTCCGAC 66 ATCATCAAATTCCGAC 2656 AAGGCCATCG Statistics Matches: 72, Mismatches: 8, Indels: 2 0.88 0.10 0.02 Matches are distributed among these distances: 91 70 0.97 92 2 0.03 ACGTcount: A:0.31, C:0.31, G:0.23, T:0.16 Consensus pattern (91 bp): AATGCCGACACCAAGGCCACGGGCCATCGGCAAATGCCGATGCCAATGCCAACGTCACAAGGGAC ATCATCAAATTCCGACGCAATCAGCC Found at i:2699 original size:31 final size:30 Alignment explanation

Indices: 2661--2763 Score: 134 Period size: 31 Copynumber: 3.3 Consensus size: 30 2651 CCGACAAGGC ** 2661 CATCGGCAAATGTTGACGCCAAGGCCATGG 1 CATCGGCAAATGCCGACGCCAAGGCCATGG * * 2691 ACATCGGCAAATGCCAACGCCAAGGCCAAGGG 1 -CATCGGCAAATGCCGACGCCAAGGCC-ATGG * 2723 CATCGGCAAATGCCGACGCAAAGGCCATTGG 1 CATCGGCAAATGCCGACGCCAAGGCCA-TGG 2754 CATCGGCAAA 1 CATCGGCAAA 2764 AGACGATGCC Statistics Matches: 63, Mismatches: 7, Indels: 4 0.85 0.09 0.05 Matches are distributed among these distances: 30 1 0.02 31 59 0.94 32 3 0.05 ACGTcount: A:0.31, C:0.29, G:0.28, T:0.12 Consensus pattern (30 bp): CATCGGCAAATGCCGACGCCAAGGCCATGG Found at i:2883 original size:246 final size:247 Alignment explanation

Indices: 2412--2952 Score: 807 Period size: 246 Copynumber: 2.2 Consensus size: 247 2402 AGGCAACGGA * * * * * 2412 CCATCAGCAAATGACGACGCCAAGGCCATGGACCATCGGCCAATACCGACGCCAAGGCCATGGGC 1 CCATCGGCAAATGACGACGCCAAGGCCATGGA-CATCGGCAAATGCCAACGCCAAGGCCAAGGGC * ** * * 2477 ATTGGCAAATGCCGATTCCAAGGCCATGGGCCATCGGCAAATGCTGATGCCAGTGCCAAGTGGAC 65 ATCGGCAAATGCCGACGCAAAGGCCATGGGCCATCGGCAAAAGCTGATGCCAGTGCCAAGTGGAC * * * 2542 AAGGGACATCATCAAATTCCGACGCAATCAGCCAATGCCGACACCAAGGCCACGTGCCATCGTCA 130 AAGGGACATCATCAAATTCCAACGCAATCAGCAAATGCCGACACCAAGGCCACGTGCCATCGGCA * 2607 AATGCCGATGCCAATGCCAACGTCACAAGGGACATCATCAAATTCCGACAAGG 195 AATGCCGATGCCAATGCCAACGTCACAAGGGACATCATAAAATTCCGACAAGG ** 2660 CCATCGGCAAATGTTGACGCCAAGGCCATGGACATCGGCAAATGCCAACGCCAAGGCCAAGGGCA 1 CCATCGGCAAATGACGACGCCAAGGCCATGGACATCGGCAAATGCCAACGCCAAGGCCAAGGGCA * * 2725 TCGGCAAATGCCGACGCAAAGGCCATTGG-CATCGGCAAAAGAC-GATGCCAGTGCCAATTGGAC 66 TCGGCAAATGCCGACGCAAAGGCCATGGGCCATCGGCAAAAG-CTGATGCCAGTGCCAAGTGGAC * * * * 2788 AAGGGACATCATCAAATTCCAACGCAATCGGCAAATGCCGACGCCAAGGCTATGTGCCATCGGCA 130 AAGGGACATCATCAAATTCCAACGCAATCAGCAAATGCCGACACCAAGGCCACGTGCCATCGGCA * 2853 AATGCCGATGCCAATGCCAACGTGACAAGGGACATCATAAAATTCCGACAAGG 195 AATGCCGATGCCAATGCCAACGTCACAAGGGACATCATAAAATTCCGACAAGG * * * 2906 CCATCGGTAAATGCCGACGCCAAGGCCATGGGCCATCGGCAAATGCC 1 CCATCGGCAAATGACGACGCCAAGGCCAT-GGACATCGGCAAATGCC 2953 GGCACCACAT Statistics Matches: 264, Mismatches: 27, Indels: 5 0.89 0.09 0.02 Matches are distributed among these distances: 246 165 0.62 247 70 0.27 248 29 0.11 ACGTcount: A:0.31, C:0.30, G:0.25, T:0.14 Consensus pattern (247 bp): CCATCGGCAAATGACGACGCCAAGGCCATGGACATCGGCAAATGCCAACGCCAAGGCCAAGGGCA TCGGCAAATGCCGACGCAAAGGCCATGGGCCATCGGCAAAAGCTGATGCCAGTGCCAAGTGGACA AGGGACATCATCAAATTCCAACGCAATCAGCAAATGCCGACACCAAGGCCACGTGCCATCGGCAA ATGCCGATGCCAATGCCAACGTCACAAGGGACATCATAAAATTCCGACAAGG Found at i:2949 original size:94 final size:91 Alignment explanation

Indices: 2716--2953 Score: 309 Period size: 94 Copynumber: 2.6 Consensus size: 91 2706 AACGCCAAGG * * * * * * 2716 CCAAGGGCATCGGCAAATGCCGACGCAAAGGCCATTGG-CATCGGCAAAAGACGATGCCAGTGCC 1 CCAAGGCCATCGGCAAATGCCGACGCCAAGGCCATGGGCCATCGGCAAATGCCGATGCCAATGCC * * 2780 AATTGGACAAGGGACATCATCAAATT 66 AAGTGGACAAGGGACATCATAAAATT * * * * 2806 CCAACGCAATCGGCAAATGCCGACGCCAAGGCTATGTGCCATCGGCAAATGCCGATGCCAATGCC 1 CCAAGGCCATCGGCAAATGCCGACGCCAAGGCCATGGGCCATCGGCAAATGCCGATGCCAATGCC 2871 AACGT-GACAAGGGACATCATAAAATT 66 AA-GTGGACAAGGGACATCATAAAATT * 2897 CCGACAAGGCCATCGGTAAATGCCGACGCCAAGGCCATGGGCCATCGGCAAATGCCG 1 -C--CAAGGCCATCGGCAAATGCCGACGCCAAGGCCATGGGCCATCGGCAAATGCCG 2954 GCACCACATC Statistics Matches: 126, Mismatches: 17, Indels: 6 0.85 0.11 0.04 Matches are distributed among these distances: 90 31 0.25 91 45 0.36 92 2 0.02 94 48 0.38 ACGTcount: A:0.32, C:0.29, G:0.26, T:0.14 Consensus pattern (91 bp): CCAAGGCCATCGGCAAATGCCGACGCCAAGGCCATGGGCCATCGGCAAATGCCGATGCCAATGCC AAGTGGACAAGGGACATCATAAAATT Found at i:7992 original size:45 final size:45 Alignment explanation

Indices: 7928--8015 Score: 149 Period size: 45 Copynumber: 2.0 Consensus size: 45 7918 GAAACCTCAC 7928 TACCAACCAATCCGTGATTCCATTGACCATCTCCAAATCTGTAGG 1 TACCAACCAATCCGTGATTCCATTGACCATCTCCAAATCTGTAGG ** * 7973 TACCAACCAATTGGTGATTCCATTGACTATCTCCAAATCTGTA 1 TACCAACCAATCCGTGATTCCATTGACCATCTCCAAATCTGTA 8016 TGATATAAAA Statistics Matches: 40, Mismatches: 3, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 45 40 1.00 ACGTcount: A:0.30, C:0.28, G:0.12, T:0.30 Consensus pattern (45 bp): TACCAACCAATCCGTGATTCCATTGACCATCTCCAAATCTGTAGG Found at i:10568 original size:43 final size:43 Alignment explanation

Indices: 10503--10588 Score: 145 Period size: 43 Copynumber: 2.0 Consensus size: 43 10493 GGTGCTTCCC * 10503 CTTTCGCTTTCTGTTCACCTGATTTAATTTATGATGATGAATT 1 CTTTCGCTTTCTGTTCACCTGATTTAATCTATGATGATGAATT * * 10546 CTTTCGCTTTCTGTTTATCTGATTTAATCTATGATGATGAATT 1 CTTTCGCTTTCTGTTCACCTGATTTAATCTATGATGATGAATT 10589 TCACTACATC Statistics Matches: 40, Mismatches: 3, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 43 40 1.00 ACGTcount: A:0.21, C:0.15, G:0.14, T:0.50 Consensus pattern (43 bp): CTTTCGCTTTCTGTTCACCTGATTTAATCTATGATGATGAATT Found at i:18187 original size:6 final size:6 Alignment explanation

Indices: 18176--18201 Score: 52 Period size: 6 Copynumber: 4.3 Consensus size: 6 18166 CCAAAGCTTC 18176 CTGGGA CTGGGA CTGGGA CTGGGA CT 1 CTGGGA CTGGGA CTGGGA CTGGGA CT 18202 CTCCATGGTC Statistics Matches: 20, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 20 1.00 ACGTcount: A:0.15, C:0.19, G:0.46, T:0.19 Consensus pattern (6 bp): CTGGGA Done.