Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01015074.1 Corchorus olitorius cultivar O-4 contig15107, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 39473
ACGTcount: A:0.33, C:0.17, G:0.17, T:0.34


Found at i:1001 original size:16 final size:16

Alignment explanation

Indices: 965--1055 Score: 80 Period size: 16 Copynumber: 5.8 Consensus size: 16 955 TTCGGGCGGG * * 965 TTCGGGTTTGGGTA-C 1 TTCGGGTTCGGGTATT 980 TTCGGGTTCGGGTATT 1 TTCGGGTTCGGGTATT * * 996 TTCGGGCTCGGGT-TAA 1 TTCGGGTTCGGGTAT-T * 1012 GTCGGGTTCGGGTATT 1 TTCGGGTTCGGGTATT * 1028 TTCGGGCTCGGGT-TAT 1 TTCGGGTTCGGGTAT-T * 1044 GTCGGGTTCGGG 1 TTCGGGTTCGGG 1056 CTCTGGTAGG Statistics Matches: 61, Mismatches: 11, Indels: 7 0.77 0.14 0.09 Matches are distributed among these distances: 15 15 0.25 16 45 0.74 17 1 0.02 ACGTcount: A:0.07, C:0.15, G:0.42, T:0.36 Consensus pattern (16 bp): TTCGGGTTCGGGTATT Found at i:1022 original size:32 final size:32 Alignment explanation

Indices: 981--1055 Score: 141 Period size: 32 Copynumber: 2.3 Consensus size: 32 971 TTTGGGTACT 981 TCGGGTTCGGGTATTTTCGGGCTCGGGTTAAG 1 TCGGGTTCGGGTATTTTCGGGCTCGGGTTAAG * 1013 TCGGGTTCGGGTATTTTCGGGCTCGGGTTATG 1 TCGGGTTCGGGTATTTTCGGGCTCGGGTTAAG 1045 TCGGGTTCGGG 1 TCGGGTTCGGG 1056 CTCTGGTAGG Statistics Matches: 42, Mismatches: 1, Indels: 0 0.98 0.02 0.00 Matches are distributed among these distances: 32 42 1.00 ACGTcount: A:0.07, C:0.16, G:0.43, T:0.35 Consensus pattern (32 bp): TCGGGTTCGGGTATTTTCGGGCTCGGGTTAAG Found at i:1053 original size:22 final size:22 Alignment explanation

Indices: 1028--1107 Score: 76 Period size: 22 Copynumber: 3.6 Consensus size: 22 1018 TTCGGGTATT * 1028 TTCGGGCTCGGGTTATGTCGGG 1 TTCGGGCTCGGGTTAGGTCGGG * 1050 TTCGGGCTCTGG-TAGGGTTCCGGG 1 TTCGGGCTCGGGTTA-GG-T-CGGG ** 1074 -TCGGG-TCGGGTCGGGTCGGG 1 TTCGGGCTCGGGTTAGGTCGGG 1094 TTCGGGCTCGGGTT 1 TTCGGGCTCGGGTT 1108 TGATTTTGAT Statistics Matches: 46, Mismatches: 6, Indels: 12 0.72 0.09 0.19 Matches are distributed among these distances: 20 4 0.09 21 8 0.17 22 24 0.52 23 6 0.13 24 4 0.09 ACGTcount: A:0.03, C:0.20, G:0.49, T:0.29 Consensus pattern (22 bp): TTCGGGCTCGGGTTAGGTCGGG Found at i:1104 original size:6 final size:5 Alignment explanation

Indices: 1070--1106 Score: 56 Period size: 5 Copynumber: 7.0 Consensus size: 5 1060 GGTAGGGTTC 1070 CGGGT CGGGT CGGGT CGGGT CGGGTT CGGGCT CGGGT 1 CGGGT CGGGT CGGGT CGGGT CGGG-T CGGG-T CGGGT 1107 TTGATTTTGA Statistics Matches: 30, Mismatches: 1, Indels: 2 0.91 0.03 0.06 Matches are distributed among these distances: 5 20 0.67 6 10 0.33 ACGTcount: A:0.00, C:0.22, G:0.57, T:0.22 Consensus pattern (5 bp): CGGGT Found at i:1255 original size:13 final size:12 Alignment explanation

Indices: 1232--1278 Score: 51 Period size: 13 Copynumber: 3.8 Consensus size: 12 1222 AAGTTTATTG 1232 ATAATATATAAT 1 ATAATATATAAT 1244 ATAATAATATAAT 1 ATAAT-ATATAAT * * 1257 ATAACAT-TATT 1 ATAATATATAAT 1268 ATCAATATATA 1 AT-AATATATA 1279 TAAAGATTGA Statistics Matches: 29, Mismatches: 3, Indels: 5 0.78 0.08 0.14 Matches are distributed among these distances: 11 5 0.17 12 11 0.38 13 13 0.45 ACGTcount: A:0.55, C:0.04, G:0.00, T:0.40 Consensus pattern (12 bp): ATAATATATAAT Found at i:1549 original size:21 final size:21 Alignment explanation

Indices: 1523--1566 Score: 88 Period size: 21 Copynumber: 2.1 Consensus size: 21 1513 ATCAATTAAA 1523 TATAAAATACATATACTTTAT 1 TATAAAATACATATACTTTAT 1544 TATAAAATACATATACTTTAT 1 TATAAAATACATATACTTTAT 1565 TA 1 TA 1567 ATAATTAATG Statistics Matches: 23, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 21 23 1.00 ACGTcount: A:0.48, C:0.09, G:0.00, T:0.43 Consensus pattern (21 bp): TATAAAATACATATACTTTAT Found at i:1623 original size:31 final size:31 Alignment explanation

Indices: 1588--1659 Score: 78 Period size: 31 Copynumber: 2.3 Consensus size: 31 1578 TAAATTATTG * 1588 CAAATTAAAACAAAT-TAAG-CATTAAATTAAA 1 CAAATTAAAA-AAATGAAAGTC-TTAAATTAAA * 1619 CAAA-TAATTAAAATGAAAGTCTTAAATTAAA 1 CAAATTAA-AAAAATGAAAGTCTTAAATTAAA 1650 CAAATTAAAA 1 CAAATTAAAA 1660 GCTGATAGAC Statistics Matches: 34, Mismatches: 3, Indels: 8 0.76 0.07 0.18 Matches are distributed among these distances: 30 7 0.21 31 23 0.68 32 4 0.12 ACGTcount: A:0.61, C:0.08, G:0.04, T:0.26 Consensus pattern (31 bp): CAAATTAAAAAAATGAAAGTCTTAAATTAAA Found at i:1906 original size:2 final size:2 Alignment explanation

Indices: 1901--1929 Score: 58 Period size: 2 Copynumber: 14.5 Consensus size: 2 1891 TATATAAGTT 1901 TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1930 TTAGTAGTTT Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 27 1.00 ACGTcount: A:0.48, C:0.00, G:0.00, T:0.52 Consensus pattern (2 bp): TA Found at i:2038 original size:26 final size:22 Alignment explanation

Indices: 1981--2038 Score: 53 Period size: 26 Copynumber: 2.4 Consensus size: 22 1971 AATTGTGATC * 1981 ATTATTATATAAATTTTATTAT 1 ATTAATATATAAATTTTATTAT 2003 ATTCATATATATAATAGTTTTATTTAAT 1 ATT-A-ATATATAA-A-TTTTA-TT-AT 2031 ATTAATAT 1 ATTAATAT 2039 GTTTTTTTTC Statistics Matches: 29, Mismatches: 1, Indels: 8 0.76 0.03 0.21 Matches are distributed among these distances: 22 3 0.10 23 1 0.03 24 7 0.24 25 1 0.03 26 9 0.31 27 3 0.10 28 5 0.17 ACGTcount: A:0.41, C:0.02, G:0.02, T:0.55 Consensus pattern (22 bp): ATTAATATATAAATTTTATTAT Found at i:2150 original size:22 final size:21 Alignment explanation

Indices: 2105--2152 Score: 60 Period size: 22 Copynumber: 2.2 Consensus size: 21 2095 TATTTCGGGC * 2105 TCGGGTCGGGTTCGGGTAATT 1 TCGGGTCGGGTTCGGGTAAGT ** 2126 TCGGGTTCGGGTTCGGGTGGGT 1 TCGGG-TCGGGTTCGGGTAAGT 2148 TCGGG 1 TCGGG 2153 ACGTTGACTT Statistics Matches: 23, Mismatches: 3, Indels: 1 0.85 0.11 0.04 Matches are distributed among these distances: 21 5 0.22 22 18 0.78 ACGTcount: A:0.04, C:0.15, G:0.50, T:0.31 Consensus pattern (21 bp): TCGGGTCGGGTTCGGGTAAGT Found at i:8104 original size:20 final size:20 Alignment explanation

Indices: 8079--8116 Score: 67 Period size: 20 Copynumber: 1.9 Consensus size: 20 8069 ATATCATGAG * 8079 TTGAATTATCACCAACTTTT 1 TTGAATCATCACCAACTTTT 8099 TTGAATCATCACCAACTT 1 TTGAATCATCACCAACTT 8117 AATAAGTATA Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 20 17 1.00 ACGTcount: A:0.32, C:0.24, G:0.05, T:0.39 Consensus pattern (20 bp): TTGAATCATCACCAACTTTT Found at i:9380 original size:10 final size:10 Alignment explanation

Indices: 9365--9391 Score: 54 Period size: 10 Copynumber: 2.7 Consensus size: 10 9355 TAAATTAGTT 9365 TATGTATGTA 1 TATGTATGTA 9375 TATGTATGTA 1 TATGTATGTA 9385 TATGTAT 1 TATGTAT 9392 AACTCTGAAT Statistics Matches: 17, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 10 17 1.00 ACGTcount: A:0.30, C:0.00, G:0.19, T:0.52 Consensus pattern (10 bp): TATGTATGTA Found at i:9830 original size:2 final size:2 Alignment explanation

Indices: 9823--9849 Score: 54 Period size: 2 Copynumber: 13.5 Consensus size: 2 9813 GTATTGAAGA 9823 AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT A 9850 AGCTGTTGAA Statistics Matches: 25, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 25 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:14301 original size:17 final size:17 Alignment explanation

Indices: 14279--14313 Score: 70 Period size: 17 Copynumber: 2.1 Consensus size: 17 14269 AAGAAGATGC 14279 CAATTCAACGTGAATGA 1 CAATTCAACGTGAATGA 14296 CAATTCAACGTGAATGA 1 CAATTCAACGTGAATGA 14313 C 1 C 14314 TTTATTTAAT Statistics Matches: 18, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 17 18 1.00 ACGTcount: A:0.40, C:0.20, G:0.17, T:0.23 Consensus pattern (17 bp): CAATTCAACGTGAATGA Found at i:15066 original size:6 final size:6 Alignment explanation

Indices: 15055--15081 Score: 54 Period size: 6 Copynumber: 4.5 Consensus size: 6 15045 CTATTTCAAG 15055 AAAGAA AAAGAA AAAGAA AAAGAA AAA 1 AAAGAA AAAGAA AAAGAA AAAGAA AAA 15082 TCTCTTTAAG Statistics Matches: 21, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 21 1.00 ACGTcount: A:0.85, C:0.00, G:0.15, T:0.00 Consensus pattern (6 bp): AAAGAA Found at i:18122 original size:29 final size:29 Alignment explanation

Indices: 18076--18148 Score: 98 Period size: 29 Copynumber: 2.6 Consensus size: 29 18066 TCATATCATT 18076 TTGCAAAATGATT-ATTTTTT-TTAGAA-C 1 TTGCAAAATGATTAATTTTTTGTT-GAAGC * 18103 TTGCAAAATGATTAATTTTTTGTTGAAGG 1 TTGCAAAATGATTAATTTTTTGTTGAAGC * 18132 ATGCAAAATGATTAATT 1 TTGCAAAATGATTAATT 18149 AATTGCAATG Statistics Matches: 41, Mismatches: 2, Indels: 4 0.87 0.04 0.09 Matches are distributed among these distances: 27 13 0.32 28 10 0.24 29 18 0.44 ACGTcount: A:0.36, C:0.05, G:0.15, T:0.44 Consensus pattern (29 bp): TTGCAAAATGATTAATTTTTTGTTGAAGC Found at i:23279 original size:34 final size:32 Alignment explanation

Indices: 23202--23329 Score: 130 Period size: 32 Copynumber: 3.9 Consensus size: 32 23192 TTTTGAAAGG * ** 23202 TAAAATCATGACAACTTCTGGTGTCAATTGAA 1 TAAAATCATGACATCTTCAAGTGTCAATTGAA * * 23234 TAAAATTATGACATCTTCAAGTGTCTATTGGAAA 1 TAAAATCATGACATCTTCAAGTGTCAATT-G-AA ** * ** 23268 TTTAATCATGACAACTTCTGGTGTCAATTGAA 1 TAAAATCATGACATCTTCAAGTGTCAATTGAA * * 23300 TAAAATTATGACATCTTCAAGTATCAATTG 1 TAAAATCATGACATCTTCAAGTGTCAATTG 23330 CAAGATCATG Statistics Matches: 75, Mismatches: 19, Indels: 4 0.77 0.19 0.04 Matches are distributed among these distances: 32 49 0.65 33 2 0.03 34 24 0.32 ACGTcount: A:0.37, C:0.14, G:0.14, T:0.35 Consensus pattern (32 bp): TAAAATCATGACATCTTCAAGTGTCAATTGAA Found at i:23282 original size:66 final size:62 Alignment explanation

Indices: 23203--23417 Score: 245 Period size: 66 Copynumber: 3.4 Consensus size: 62 23193 TTTGAAAGGT * * 23203 AAAATCATGACAACTTCTGGTGTCAATTGAATAAAATTATGACATCTTCAAGTGTCTATTGGA 1 AAAATCATGACAACTTCTGGTGTCAATTGAATAAAATTATGACATCTTCAAGTATCAATT-GA * 23266 AATTTAATCATGACAACTTCTGGTGTCAATTGAATAAAATTATGACATCTTCAAGTATCAATTGC 1 AA---AATCATGACAACTTCTGGTGTCAATTGAATAAAATTATGACATCTTCAAGTATCAATTGA * * * * * * *** * * 23331 AAGATCATGACAACTTTTGGTGTCAATTG--CAACATCATGACAACTTTTGGTGTCAATTGC 1 AAAATCATGACAACTTCTGGTGTCAATTGAATAAAATTATGACATCTTCAAGTATCAATTGA * 23391 AAAATCATGACAGCTTCTGGTGTCAAT 1 AAAATCATGACAACTTCTGGTGTCAAT 23418 AGCAAGACCA Statistics Matches: 133, Mismatches: 16, Indels: 9 0.84 0.10 0.06 Matches are distributed among these distances: 60 47 0.35 62 25 0.19 63 2 0.02 65 3 0.02 66 56 0.42 ACGTcount: A:0.34, C:0.16, G:0.16, T:0.33 Consensus pattern (62 bp): AAAATCATGACAACTTCTGGTGTCAATTGAATAAAATTATGACATCTTCAAGTATCAATTGA Found at i:23357 original size:30 final size:30 Alignment explanation

Indices: 23274--23655 Score: 487 Period size: 30 Copynumber: 12.7 Consensus size: 30 23264 GAAATTTAAT * * * 23274 CATGACAACTTCTGGTGTCAATTGAATAAAAT 1 CATGACAACTTCTGGTGTCAATTG--CAAGAC * * ** * * 23306 TATGACATCTTCAAGTATCAATTGCAAGAT 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * * * 23336 CATGACAACTTTTGGTGTCAATTGCAACAT 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * * * 23366 CATGACAACTTTTGGTGTCAATTGCAAAAT 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * * 23396 CATGACAGCTTCTGGTGTCAATAGCAAGAC 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * 23426 CATGACAACTTCTGGTGTCAATTGCAAGGC 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * 23456 CATGACAACTTCTGGTGTCATTTGCAAGAC 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * 23486 CATGACAACTTCTGGTGTCATTTGCAAGAC 1 CATGACAACTTCTGGTGTCAATTGCAAGAC 23516 CATGACAACTTCTGGTGTCAATTGCAAGAC 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * 23546 CATGACAACTTATGGTGTCAATTGCAAGAC 1 CATGACAACTTCTGGTGTCAATTGCAAGAC * * * 23576 CATGGCAACTTCTGGTGTC-ATCTGTAAGAT 1 CATGACAACTTCTGGTGTCAAT-TGCAAGAC * * * 23606 CATGACAACTTCTGGTGTCGATTGTAAAAC 1 CATGACAACTTCTGGTGTCAATTGCAAGAC 23636 CATGACAACTTCTGGTGTCA 1 CATGACAACTTCTGGTGTCA 23656 TTTAGAGAGT Statistics Matches: 313, Mismatches: 35, Indels: 6 0.88 0.10 0.02 Matches are distributed among these distances: 29 2 0.01 30 290 0.93 31 2 0.01 32 19 0.06 ACGTcount: A:0.30, C:0.21, G:0.19, T:0.30 Consensus pattern (30 bp): CATGACAACTTCTGGTGTCAATTGCAAGAC Found at i:25027 original size:18 final size:18 Alignment explanation

Indices: 24992--25029 Score: 53 Period size: 16 Copynumber: 2.2 Consensus size: 18 24982 ATTAATTAAT 24992 TATATAAATATAATTATA 1 TATATAAATATAATTATA * 25010 TATATAAA-AT-ATGATA 1 TATATAAATATAATTATA 25026 TATA 1 TATA 25030 CTACCTATAA Statistics Matches: 19, Mismatches: 1, Indels: 2 0.86 0.05 0.09 Matches are distributed among these distances: 16 9 0.47 17 2 0.11 18 8 0.42 ACGTcount: A:0.55, C:0.00, G:0.03, T:0.42 Consensus pattern (18 bp): TATATAAATATAATTATA Found at i:26796 original size:16 final size:17 Alignment explanation

Indices: 26768--26799 Score: 57 Period size: 16 Copynumber: 1.9 Consensus size: 17 26758 TTTAGGCTAA 26768 AAAGGATAAAAGAAATG 1 AAAGGATAAAAGAAATG 26785 AAAGGA-AAAAGAAAT 1 AAAGGATAAAAGAAAT 26800 AAAATTAATG Statistics Matches: 15, Mismatches: 0, Indels: 1 0.94 0.00 0.06 Matches are distributed among these distances: 16 9 0.60 17 6 0.40 ACGTcount: A:0.69, C:0.00, G:0.22, T:0.09 Consensus pattern (17 bp): AAAGGATAAAAGAAATG Found at i:29583 original size:31 final size:31 Alignment explanation

Indices: 29545--29607 Score: 117 Period size: 31 Copynumber: 2.0 Consensus size: 31 29535 TCTAACATAA 29545 TTAAATTGCTGGAAAAAAACATAATTTCTTT 1 TTAAATTGCTGGAAAAAAACATAATTTCTTT * 29576 TTAAATTGTTGGAAAAAAACATAATTTCTTT 1 TTAAATTGCTGGAAAAAAACATAATTTCTTT 29607 T 1 T 29608 GAAAGAATAC Statistics Matches: 31, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 31 31 1.00 ACGTcount: A:0.41, C:0.08, G:0.10, T:0.41 Consensus pattern (31 bp): TTAAATTGCTGGAAAAAAACATAATTTCTTT Found at i:35894 original size:3 final size:3 Alignment explanation

Indices: 35886--35991 Score: 212 Period size: 3 Copynumber: 35.3 Consensus size: 3 35876 ATATTCCATC 35886 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 1 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 35934 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 1 ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT ATT 35982 ATT ATT ATT A 1 ATT ATT ATT A 35992 CCTCATTCTG Statistics Matches: 103, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 3 103 1.00 ACGTcount: A:0.34, C:0.00, G:0.00, T:0.66 Consensus pattern (3 bp): ATT Found at i:37143 original size:15 final size:15 Alignment explanation

Indices: 37123--37153 Score: 53 Period size: 15 Copynumber: 2.1 Consensus size: 15 37113 GTTAAACAAA * 37123 GCCCAATGAGGAAAT 1 GCCCAAGGAGGAAAT 37138 GCCCAAGGAGGAAAT 1 GCCCAAGGAGGAAAT 37153 G 1 G 37154 GGAAAATTAC Statistics Matches: 15, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 15 15 1.00 ACGTcount: A:0.39, C:0.19, G:0.32, T:0.10 Consensus pattern (15 bp): GCCCAAGGAGGAAAT Found at i:37904 original size:38 final size:38 Alignment explanation

Indices: 37862--37935 Score: 139 Period size: 38 Copynumber: 1.9 Consensus size: 38 37852 GTTACAAAAT 37862 CATTAATATTAATTTGAAGGTTATATACATAATATTAC 1 CATTAATATTAATTTGAAGGTTATATACATAATATTAC * 37900 CATTAATATTAATTTGGAGGTTATATACATAATATT 1 CATTAATATTAATTTGAAGGTTATATACATAATATT 37936 TGGGAGGTTA Statistics Matches: 35, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 38 35 1.00 ACGTcount: A:0.41, C:0.07, G:0.09, T:0.43 Consensus pattern (38 bp): CATTAATATTAATTTGAAGGTTATATACATAATATTAC Found at i:38641 original size:19 final size:18 Alignment explanation

Indices: 38617--38652 Score: 54 Period size: 19 Copynumber: 1.9 Consensus size: 18 38607 TGAAGATTTA 38617 TTGAAGACAATTTGAAGAT 1 TTGAAGACAA-TTGAAGAT * 38636 TTGAAGACCATTGAAGA 1 TTGAAGACAATTGAAGA 38653 ATAATTTCAA Statistics Matches: 16, Mismatches: 1, Indels: 1 0.89 0.06 0.06 Matches are distributed among these distances: 18 7 0.44 19 9 0.56 ACGTcount: A:0.42, C:0.08, G:0.22, T:0.28 Consensus pattern (18 bp): TTGAAGACAATTGAAGAT Done.