Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01006851.1 Corchorus capsularis cultivar CVL-1 contig06872, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 25677
ACGTcount: A:0.32, C:0.19, G:0.17, T:0.32


Found at i:2662 original size:37 final size:36

Alignment explanation

Indices: 2576--2792 Score: 260 Period size: 37 Copynumber: 5.9 Consensus size: 36 2566 CCAGTACCCC 2576 AATACACTAAGAGTCAAAATAATAGTAACCAGTAA-T 1 AATA-ACTAAGAGTCAAAATAATAGTAACCAGTAATT * 2612 --TAACTAAGAGTCAAAATGATAGTAACCAGTAATT 1 AATAACTAAGAGTCAAAATAATAGTAACCAGTAATT * 2646 AACTAACTAAGAGTCAAAATGATAGTAACCAGTAATT 1 AA-TAACTAAGAGTCAAAATAATAGTAACCAGTAATT * 2683 AAGTAACTAAGAGTCAAAATGATAGTAACCAGTAATT 1 AA-TAACTAAGAGTCAAAATAATAGTAACCAGTAATT * * * * 2720 AAGTAATTAAGAGTCAAAGTAATGGTAATCAGTAAAATT 1 AA-TAACTAAGAGTCAAAATAATAGTAACCAGT--AATT * * * * 2759 GATAATTAAGAGTCAAAGTAATAGTAATCAGTAA 1 AATAACTAAGAGTCAAAATAATAGTAACCAGTAA 2793 ATCGATAGTT Statistics Matches: 166, Mismatches: 9, Indels: 12 0.89 0.05 0.06 Matches are distributed among these distances: 33 29 0.17 34 3 0.02 36 2 0.01 37 98 0.59 38 29 0.17 39 5 0.03 ACGTcount: A:0.49, C:0.10, G:0.15, T:0.25 Consensus pattern (36 bp): AATAACTAAGAGTCAAAATAATAGTAACCAGTAATT Found at i:2741 original size:16 final size:16 Alignment explanation

Indices: 2720--2780 Score: 50 Period size: 19 Copynumber: 3.4 Consensus size: 16 2710 ACCAGTAATT 2720 AAGTAATTAAGAGTCA 1 AAGTAATTAAGAGTCA * * 2736 AAGTAATGGTAATCAGTAA 1 AAGTAAT--TAA-GAGTCA 2755 AATTGATAATTAAGAGTCA 1 AA--G-TAATTAAGAGTCA 2774 AAGTAAT 1 AAGTAAT 2781 AGTAATCAGT Statistics Matches: 35, Mismatches: 4, Indels: 12 0.69 0.08 0.24 Matches are distributed among these distances: 16 11 0.31 17 1 0.03 18 3 0.09 19 12 0.34 20 3 0.09 21 1 0.03 22 4 0.11 ACGTcount: A:0.49, C:0.05, G:0.18, T:0.28 Consensus pattern (16 bp): AAGTAATTAAGAGTCA Found at i:2774 original size:38 final size:37 Alignment explanation

Indices: 2583--2851 Score: 185 Period size: 37 Copynumber: 7.3 Consensus size: 37 2573 CCCAATACAC * * * 2583 TAAGAGTCAAAATAATAGTAACCAGT-AAT---TAAC 1 TAAGAGTCAAAGTAATAGTAATCAGTAAATCGATAAT * * * ** * 2616 TAAGAGTCAAAATGATAGTAACCAGT-AATTAACTAAC 1 TAAGAGTCAAAGTAATAGTAATCAGTAAATCGA-TAAT * * * ** * 2653 TAAGAGTCAAAATGATAGTAACCAGT-AATTAAGTAAC 1 TAAGAGTCAAAGTAATAGTAATCAGTAAATCGA-TAAT * * * ** 2690 TAAGAGTCAAAATGATAGTAACCAGT-AATTAAGTAAT 1 TAAGAGTCAAAGTAATAGTAATCAGTAAATCGA-TAAT * * 2727 TAAGAGTCAAAGTAATGGTAATCAGTAAAATTGATAAT 1 TAAGAGTCAAAGTAATAGTAATCAGT-AAATCGATAAT * 2765 TAAGAGTCAAAGTAATAGTAATCAGTAAATCGATAGT 1 TAAGAGTCAAAGTAATAGTAATCAGTAAATCGATAAT * 2802 TAAGAGTCAAGGTAAAAATAGTAATCAGTAAATC-AGTAAT 1 TAAGAGTCAAAGT---AATAGTAATCAGTAAATCGA-TAAT * 2842 TAAGAATCAA 1 TAAGAGTCAA 2852 GGGATTAATC Statistics Matches: 212, Mismatches: 14, Indels: 13 0.89 0.06 0.05 Matches are distributed among these distances: 33 28 0.13 37 119 0.56 38 29 0.14 39 6 0.03 40 30 0.14 ACGTcount: A:0.49, C:0.09, G:0.16, T:0.26 Consensus pattern (37 bp): TAAGAGTCAAAGTAATAGTAATCAGTAAATCGATAAT Found at i:2829 original size:40 final size:37 Alignment explanation

Indices: 2723--2878 Score: 158 Period size: 40 Copynumber: 4.2 Consensus size: 37 2713 AGTAATTAAG * * * 2723 TAATTAAGAGTCAAAGTAATGGTAATCAGTAAAATTGA 1 TAATTAAGAGTCAAGGTAATAGTAATCAGT-AAATCGA * 2761 TAATTAAGAGTCAAAGTAATAGTAATCAGTAAATCGA 1 TAATTAAGAGTCAAGGTAATAGTAATCAGTAAATCGA * 2798 TAGTTAAGAGTCAAGGTAAAAATAGTAATCAGTAAATC-A 1 TAATTAAGAGTCAAGGT---AATAGTAATCAGTAAATCGA * * * 2837 GTAATTAAGAATCAAGG-GAT--TAATCAGTAAATTGA 1 -TAATTAAGAGTCAAGGTAATAGTAATCAGTAAATCGA * 2872 TACTTAA 1 TAATTAA 2879 AGGAGAAAGT Statistics Matches: 104, Mismatches: 9, Indels: 14 0.82 0.07 0.11 Matches are distributed among these distances: 34 18 0.17 35 1 0.01 36 2 0.02 37 21 0.20 38 29 0.28 39 1 0.01 40 32 0.31 ACGTcount: A:0.47, C:0.07, G:0.17, T:0.28 Consensus pattern (37 bp): TAATTAAGAGTCAAGGTAATAGTAATCAGTAAATCGA Found at i:2957 original size:23 final size:22 Alignment explanation

Indices: 2921--2966 Score: 56 Period size: 23 Copynumber: 2.0 Consensus size: 22 2911 GAAAAGGAAG * 2921 TAAAAAGGACTAATCAGTAAAT 1 TAAAAAGGACTAATAAGTAAAT * * 2943 TAAAAAGAGATTAATAATTAAAT 1 TAAAAAG-GACTAATAAGTAAAT 2966 T 1 T 2967 GGTAATCAAA Statistics Matches: 20, Mismatches: 3, Indels: 1 0.83 0.12 0.04 Matches are distributed among these distances: 22 7 0.35 23 13 0.65 ACGTcount: A:0.57, C:0.04, G:0.11, T:0.28 Consensus pattern (22 bp): TAAAAAGGACTAATAAGTAAAT Found at i:3003 original size:18 final size:19 Alignment explanation

Indices: 2980--3020 Score: 59 Period size: 19 Copynumber: 2.2 Consensus size: 19 2970 AATCAAATGG 2980 TAAGAGT-AAAAA-GGATAT 1 TAAGAGTGAAAAATGG-TAT 2998 TAAGAGTGAAAAATGGTAT 1 TAAGAGTGAAAAATGGTAT 3017 TAAG 1 TAAG 3021 CAAAAAGAGT Statistics Matches: 21, Mismatches: 0, Indels: 3 0.88 0.00 0.12 Matches are distributed among these distances: 18 7 0.33 19 12 0.57 20 2 0.10 ACGTcount: A:0.51, C:0.00, G:0.24, T:0.24 Consensus pattern (19 bp): TAAGAGTGAAAAATGGTAT Found at i:3035 original size:26 final size:26 Alignment explanation

Indices: 2999--3280 Score: 245 Period size: 26 Copynumber: 10.8 Consensus size: 26 2989 AAAGGATATT * * 2999 AAGAGTGAAAAATGGTATTAAGCAAA 1 AAGAGTAAAAAATGGTATTAAGTAAA * 3025 AAGAGTAAAAAATGGTATTAAATAAA 1 AAGAGTAAAAAATGGTATTAAGTAAA * ** * * * 3051 GAGAG-AAAAAATATTATCATGGTAAT 1 AAGAGTAAAAAATGGTATTA-AGTAAA * * 3077 AAGAGTAAAAAAAACCGGTATTAAGCAAAA 1 AAGAGT--AAAAAA-TGGTATTAAG-TAAA ** 3107 AAGAGTAAAAAATGGTATTAAGTATT 1 AAGAGTAAAAAATGGTATTAAGTAAA * 3133 GAGAGT-AAAAATGGTATTAAGTAAA 1 AAGAGTAAAAAATGGTATTAAGTAAA * ** * * * 3158 GAGAAAAAAAAATGGTATGATGGTAAT 1 AAGAGTAAAAAATGGTATTA-AGTAAA * 3185 AAGAGTAAAAAATGGTATTAAGCAAAA 1 AAGAGTAAAAAATGGTATTAAG-TAAA ** 3212 AAGAGT-AAAAATGGTATTAAGTATT 1 AAGAGTAAAAAATGGTATTAAGTAAA 3237 AAGAGT-AAAAATGGTATTAAGT-AA 1 AAGAGTAAAAAATGGTATTAAGTAAA 3261 AA-AGT-AAAAATGGTATTAAG 1 AAGAGTAAAAAATGGTATTAAG 3281 AGTAAAAAAA Statistics Matches: 205, Mismatches: 42, Indels: 21 0.76 0.16 0.08 Matches are distributed among these distances: 23 18 0.09 24 2 0.01 25 55 0.27 26 68 0.33 27 37 0.18 28 6 0.03 29 7 0.03 30 12 0.06 ACGTcount: A:0.54, C:0.02, G:0.20, T:0.23 Consensus pattern (26 bp): AAGAGTAAAAAATGGTATTAAGTAAA Found at i:3087 original size:52 final size:51 Alignment explanation

Indices: 3006--3280 Score: 202 Period size: 51 Copynumber: 5.3 Consensus size: 51 2996 ATTAAGAGTG * * * * 3006 AAAAATGGTATTAAGCAAAAAGAGTAAAAAATGGTATTAAATAAAGAGAGA 1 AAAAATAGTATTAAGTAATAAGAGTAAAAAATGGTATTAAATAAAAAGAGA * * * * 3057 AAAAATATTATCATGGTAATAAGAGTAAAAAAAACCGGTATTAAGCA-AAAAAGAGTA 1 AAAAATAGTATTA-AGTAATAAGAGT--AAAAAA-TGGTATTAA--ATAAAAAGAG-A * * * * * * 3114 AAAAATGGTATTAAGTATTGAGAGT-AAAAATGGTATTAAGTAAAGAGAAAA 1 AAAAATAGTATTAAGTAATAAGAGTAAAAAATGGTATTAAATAAAAAG-AGA * * * * 3165 AAAAATGGTATGATGGTAATAAGAGTAAAAAATGGTATTAAGCA-AAAAAGAGT 1 AAAAATAGTATTA-AGTAATAAGAGTAAAAAATGGTATTAA--ATAAAAAGAGA * * * * 3218 AAAAATGGTATTAAGTATTAAGAGT-AAAAATGGTATTAAGT-AAAA-AGT 1 AAAAATAGTATTAAGTAATAAGAGTAAAAAATGGTATTAAATAAAAAGAGA * 3266 AAAAATGGTATTAAG 1 AAAAATAGTATTAAG 3281 AGTAAAAAAA Statistics Matches: 180, Mismatches: 30, Indels: 31 0.75 0.12 0.13 Matches are distributed among these distances: 48 18 0.10 49 4 0.02 51 42 0.23 52 37 0.21 53 32 0.18 54 11 0.06 55 8 0.04 56 16 0.09 57 12 0.07 ACGTcount: A:0.55, C:0.02, G:0.20, T:0.24 Consensus pattern (51 bp): AAAAATAGTATTAAGTAATAAGAGTAAAAAATGGTATTAAATAAAAAGAGA Found at i:3121 original size:108 final size:103 Alignment explanation

Indices: 2999--3270 Score: 375 Period size: 104 Copynumber: 2.6 Consensus size: 103 2989 AAAGGATATT * ** * * 2999 AAGAGTGAAAAATGGTATTAAGCAAAAAGAGTAAAAAATGGTATTAAATAAAGAGAGAAAAAATA 1 AAGAGT-AAAAATGGTATTAAGTATTAAGAGT-AAAAATGGTATTAAGTAAAGAGA-AAAAAAAA 3064 T-TATCATGGTAATAAGAGTAAAAAAAACCGGTATTAAGCAAAA 63 TGTATCATGGTAATAAGAGT--AAAAAA-CGGTATTAAGCAAAA * 3107 AAGAGTAAAAAATGGTATTAAGTATTGAGAGTAAAAATGGTATTAAGTAAAGAGAAAAAAAAATG 1 AAGAGT-AAAAATGGTATTAAGTATTAAGAGTAAAAATGGTATTAAGTAAAGAGAAAAAAAAAT- * * 3172 GTATGATGGTAATAAGAGTAAAAAATGGTATTAAGCAAAA 64 GTATCATGGTAATAAGAGTAAAAAACGGTATTAAGCAAAA * * 3212 AAGAGTAAAAATGGTATTAAGTATTAAGAGTAAAAATGGTATTAAGTAAAAAGTAAAAA 1 AAGAGTAAAAATGGTATTAAGTATTAAGAGTAAAAATGGTATTAAGTAAAGAGAAAAAA 3271 TGGTATTAAG Statistics Matches: 150, Mismatches: 12, Indels: 8 0.88 0.07 0.05 Matches are distributed among these distances: 104 50 0.33 105 20 0.13 106 14 0.09 107 22 0.15 108 44 0.29 ACGTcount: A:0.55, C:0.02, G:0.20, T:0.23 Consensus pattern (103 bp): AAGAGTAAAAATGGTATTAAGTATTAAGAGTAAAAATGGTATTAAGTAAAGAGAAAAAAAAATGT ATCATGGTAATAAGAGTAAAAAACGGTATTAAGCAAAA Found at i:3183 original size:27 final size:26 Alignment explanation

Indices: 3113--3270 Score: 74 Period size: 25 Copynumber: 6.1 Consensus size: 26 3103 AAAAAAGAGT * * * * 3113 AAAAAATGGTATTAAGTATTGAG-AG 1 AAAAAATGGTATGAAGTAATAAGAAA * * 3138 TAAAAATGGTATTAAGTAA-AGAGAAA 1 AAAAAATGGTATGAAGTAATA-AGAAA * ** 3164 AAAAAATGGTATGATGGTAATAAGAGT 1 AAAAAATGGTATGA-AGTAATAAGAAA * * 3191 AAAAAATGGTATTAAGCAA-AA-AAGA 1 AAAAAATGGTATGAAGTAATAAGAA-A * * * * 3216 GTAAAAATGGTATTAAGTATTAAG-AG 1 -AAAAAATGGTATGAAGTAATAAGAAA * * * * 3242 TAAAAATGGTATTAAGTAAAAAGTAA 1 AAAAAATGGTATGAAGTAATAAGAAA 3268 AAA 1 AAA 3271 TGGTATTAAG Statistics Matches: 102, Mismatches: 22, Indels: 17 0.72 0.16 0.12 Matches are distributed among these distances: 24 1 0.01 25 42 0.41 26 35 0.34 27 23 0.23 28 1 0.01 ACGTcount: A:0.54, C:0.01, G:0.20, T:0.25 Consensus pattern (26 bp): AAAAAATGGTATGAAGTAATAAGAAA Found at i:3255 original size:18 final size:18 Alignment explanation

Indices: 3232--3288 Score: 69 Period size: 23 Copynumber: 2.9 Consensus size: 18 3222 ATGGTATTAA 3232 GTATTAAGAGTAAAAATG 1 GTATTAAGAGTAAAAATG 3250 GTATTAAGTAAAAAGTAAAAATG 1 GTATTAAG-----AGTAAAAATG 3273 GTATTAAGAGTAAAAA 1 GTATTAAGAGTAAAAA 3289 AAAATGGTGG Statistics Matches: 34, Mismatches: 0, Indels: 10 0.77 0.00 0.23 Matches are distributed among these distances: 18 16 0.47 23 18 0.53 ACGTcount: A:0.54, C:0.00, G:0.19, T:0.26 Consensus pattern (18 bp): GTATTAAGAGTAAAAATG Found at i:3292 original size:23 final size:21 Alignment explanation

Indices: 3218--3296 Score: 79 Period size: 25 Copynumber: 3.4 Consensus size: 21 3208 AAAAAAGAGT 3218 AAAAATGGTATTAAGTATTAAGAG 1 AAAAATGGTATTAAGTA--AA-AG 3242 TAAAAATGGTATTAAGTAAAAAG 1 -AAAAATGGTATTAAGT-AAAAG 3265 TAAAAATGGTATTAAGAGTAAAA- 1 -AAAAATGGTATT-A-AGTAAAAG 3288 AAAAATGGT 1 AAAAATGGT 3297 GGAAAATGTT Statistics Matches: 51, Mismatches: 0, Indels: 9 0.85 0.00 0.15 Matches are distributed among these distances: 22 9 0.18 23 15 0.29 24 7 0.14 25 19 0.37 26 1 0.02 ACGTcount: A:0.54, C:0.00, G:0.19, T:0.27 Consensus pattern (21 bp): AAAAATGGTATTAAGTAAAAG Found at i:9021 original size:13 final size:14 Alignment explanation

Indices: 8998--9026 Score: 51 Period size: 13 Copynumber: 2.1 Consensus size: 14 8988 ACTTTTACTT 8998 AATGCATGAATGCA 1 AATGCATGAATGCA 9012 AATG-ATGAATGCA 1 AATGCATGAATGCA 9025 AA 1 AA 9027 GTCCGGTTAT Statistics Matches: 15, Mismatches: 0, Indels: 1 0.94 0.00 0.06 Matches are distributed among these distances: 13 11 0.73 14 4 0.27 ACGTcount: A:0.48, C:0.10, G:0.21, T:0.21 Consensus pattern (14 bp): AATGCATGAATGCA Found at i:13229 original size:12 final size:12 Alignment explanation

Indices: 13212--13285 Score: 54 Period size: 12 Copynumber: 6.7 Consensus size: 12 13202 AGAAACCGAT * 13212 TATATAATTTTA 1 TATATAATATTA 13224 TATATAATATTA 1 TATATAATATTA 13236 TATAT-ATA-TA 1 TATATAATATTA * * * 13246 TGT-TATTCGTT- 1 TATATAAT-ATTA 13257 TATAT-ATA-TA 1 TATATAATATTA 13267 TATAT-ATATTA 1 TATATAATATTA 13278 TATATAAT 1 TATATAAT 13286 CTATATAATA Statistics Matches: 48, Mismatches: 7, Indels: 14 0.70 0.10 0.20 Matches are distributed among these distances: 9 2 0.04 10 13 0.27 11 13 0.27 12 20 0.42 ACGTcount: A:0.42, C:0.01, G:0.03, T:0.54 Consensus pattern (12 bp): TATATAATATTA Found at i:13311 original size:2 final size:2 Alignment explanation

Indices: 13222--13292 Score: 55 Period size: 2 Copynumber: 38.0 Consensus size: 2 13212 TATATAATTT * * * 13222 TA TA TA TA -A TA T- TA TA TA TA TA TA TG T- TA T- TCG TT TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T-A TA TA TA * 13261 TA TA TA TA TA TA TA T- TA TA TA TA -A TC TA TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 13293 ATAAGTATTT Statistics Matches: 57, Mismatches: 5, Indels: 14 0.75 0.07 0.18 Matches are distributed among these distances: 1 6 0.11 2 50 0.88 3 1 0.02 ACGTcount: A:0.42, C:0.03, G:0.03, T:0.52 Consensus pattern (2 bp): TA Found at i:13312 original size:11 final size:12 Alignment explanation

Indices: 13211--13312 Score: 59 Period size: 13 Copynumber: 8.1 Consensus size: 12 13201 GAGAAACCGA 13211 TTATATA-AT-T 1 TTATATATATAT 13221 TTATATATAATAT 1 TTATATAT-ATAT 13234 TATATATATATAT 1 T-TATATATATAT * * 13247 GTTATTCGTTTATAT 1 -TTA-T-ATATATAT * 13262 ATATATATATAT 1 TTATATATATAT 13274 ATTATATATA-AT 1 -TTATATATATAT * 13286 CTATATAATAAGTAT 1 TTATAT-AT-A-TAT 13301 TTA-ATATATAT 1 TTATATATATAT 13312 T 1 T 13313 AATTTAACCT Statistics Matches: 72, Mismatches: 8, Indels: 23 0.70 0.08 0.22 Matches are distributed among these distances: 10 7 0.10 11 9 0.12 12 13 0.18 13 20 0.28 14 13 0.18 15 10 0.14 ACGTcount: A:0.42, C:0.02, G:0.03, T:0.53 Consensus pattern (12 bp): TTATATATATAT Found at i:13774 original size:46 final size:46 Alignment explanation

Indices: 13718--13806 Score: 160 Period size: 46 Copynumber: 1.9 Consensus size: 46 13708 TGGCTCTGTT * 13718 ATTTATTTGCAGATCTGGGTTTTGTTTATTTTTCAAGGATAGTTTG 1 ATTTACTTGCAGATCTGGGTTTTGTTTATTTTTCAAGGATAGTTTG * 13764 ATTTACTTGCAGATCTGGGTTTTGTTTATTTTTTAAGGATAGT 1 ATTTACTTGCAGATCTGGGTTTTGTTTATTTTTCAAGGATAGT 13807 GATCGGTGTT Statistics Matches: 41, Mismatches: 2, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 46 41 1.00 ACGTcount: A:0.20, C:0.07, G:0.21, T:0.52 Consensus pattern (46 bp): ATTTACTTGCAGATCTGGGTTTTGTTTATTTTTCAAGGATAGTTTG Found at i:16476 original size:28 final size:27 Alignment explanation

Indices: 16437--16527 Score: 128 Period size: 27 Copynumber: 3.3 Consensus size: 27 16427 AGGGTCACCT * * 16437 AGGGGTATTTTGGTCATTTTTACATTC 1 AGGGGCATTTTTGTCATTTTTACATTC 16464 AGGGGCATTTTTGTCATTTTTACATTC 1 AGGGGCATTTTTGTCATTTTTACATTC * * * 16491 AGGGGCATTTTTGTCATTCTTGCATTT 1 AGGGGCATTTTTGTCATTTTTACATTC 16518 AGGGGGCATT 1 A-GGGGCATT 16528 CAGGTCATTT Statistics Matches: 58, Mismatches: 5, Indels: 1 0.91 0.08 0.02 Matches are distributed among these distances: 27 50 0.86 28 8 0.14 ACGTcount: A:0.18, C:0.13, G:0.24, T:0.45 Consensus pattern (27 bp): AGGGGCATTTTTGTCATTTTTACATTC Found at i:21996 original size:18 final size:18 Alignment explanation

Indices: 21973--22008 Score: 63 Period size: 18 Copynumber: 2.0 Consensus size: 18 21963 CAAGGATTGG * 21973 AAGGAAGCATGGATAAGC 1 AAGGAAGCATGGACAAGC 21991 AAGGAAGCATGGACAAGC 1 AAGGAAGCATGGACAAGC 22009 TTAAAGGAGA Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 18 17 1.00 ACGTcount: A:0.44, C:0.14, G:0.33, T:0.08 Consensus pattern (18 bp): AAGGAAGCATGGACAAGC Done.