Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01012992.1 Corchorus capsularis cultivar CVL-1 contig13013, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 35163
ACGTcount: A:0.34, C:0.18, G:0.17, T:0.31


Found at i:7 original size:2 final size:2

Alignment explanation

Indices: 1--75 Score: 150 Period size: 2 Copynumber: 37.5 Consensus size: 2 1 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT 1 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT 43 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT C 1 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT C 76 ATATCTAGCA Statistics Matches: 73, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 73 1.00 ACGTcount: A:0.00, C:0.51, G:0.00, T:0.49 Consensus pattern (2 bp): CT Found at i:3572 original size:63 final size:63 Alignment explanation

Indices: 3494--3620 Score: 245 Period size: 63 Copynumber: 2.0 Consensus size: 63 3484 AGTATAATAA * 3494 TCCTAAGACAGAAGAAATTTATCACCCTCCAGCTAACAGGTAACTAGTATTGTATGGTAAGAC 1 TCCTAAGACAGAAGAAATTTATCACCCTCCAGCTAACAGATAACTAGTATTGTATGGTAAGAC 3557 TCCTAAGACAGAAGAAATTTATCACCCTCCAGCTAACAGATAACTAGTATTGTATGGTAAGAC 1 TCCTAAGACAGAAGAAATTTATCACCCTCCAGCTAACAGATAACTAGTATTGTATGGTAAGAC 3620 T 1 T 3621 TCTTACTTCA Statistics Matches: 63, Mismatches: 1, Indels: 0 0.98 0.02 0.00 Matches are distributed among these distances: 63 63 1.00 ACGTcount: A:0.37, C:0.20, G:0.17, T:0.26 Consensus pattern (63 bp): TCCTAAGACAGAAGAAATTTATCACCCTCCAGCTAACAGATAACTAGTATTGTATGGTAAGAC Found at i:5197 original size:1 final size:1 Alignment explanation

Indices: 5191--5217 Score: 54 Period size: 1 Copynumber: 27.0 Consensus size: 1 5181 CTAAGCTTTA 5191 TTTTTTTTTTTTTTTTTTTTTTTTTTT 1 TTTTTTTTTTTTTTTTTTTTTTTTTTT 5218 GCCTACATAT Statistics Matches: 26, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 1 26 1.00 ACGTcount: A:0.00, C:0.00, G:0.00, T:1.00 Consensus pattern (1 bp): T Found at i:5756 original size:133 final size:133 Alignment explanation

Indices: 5517--5785 Score: 502 Period size: 133 Copynumber: 2.0 Consensus size: 133 5507 ATATAAAAAA 5517 ATAGTATAATTAATTCAAGTGCATAATTTCTACATCAAATGAAAAATATTACAATGATTCTCTAA 1 ATAGTATAATTAATTCAAGTGCATAATTTCTACATCAAATGAAAAATATTACAATGATTCTCTAA * * * 5582 GAGTTTTCATGACTTGAAATCGTTGCAAAATGTTTTCACTATCAATACTATAAAACACATTTTTT 66 GAGTTTTCATAACTTGAAATCGTTACAAAATGTTTTCACTATCAATACTACAAAACACATTTTTT 5647 TCT 131 TCT 5650 ATAGTATAATTAATTCAAGTGCATAATTTCTACATCAAATGAAAAATATTACAATGATTCTCTAA 1 ATAGTATAATTAATTCAAGTGCATAATTTCTACATCAAATGAAAAATATTACAATGATTCTCTAA * 5715 GAGTTTTCATAACTTGAAATCGTTACAAAATGTTTTTACTATCAATACTACAAAACACATTTTTT 66 GAGTTTTCATAACTTGAAATCGTTACAAAATGTTTTCACTATCAATACTACAAAACACATTTTTT 5780 TCT 131 TCT 5783 ATA 1 ATA 5786 TATGTAACCA Statistics Matches: 132, Mismatches: 4, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 133 132 1.00 ACGTcount: A:0.39, C:0.14, G:0.08, T:0.38 Consensus pattern (133 bp): ATAGTATAATTAATTCAAGTGCATAATTTCTACATCAAATGAAAAATATTACAATGATTCTCTAA GAGTTTTCATAACTTGAAATCGTTACAAAATGTTTTCACTATCAATACTACAAAACACATTTTTT TCT Found at i:6973 original size:20 final size:21 Alignment explanation

Indices: 6948--6986 Score: 71 Period size: 20 Copynumber: 1.9 Consensus size: 21 6938 TTTAGAAGCA 6948 ATTAATTAAAAA-CATTAAAC 1 ATTAATTAAAAACCATTAAAC 6968 ATTAATTAAAAACCATTAA 1 ATTAATTAAAAACCATTAA 6987 GGAAGGGAAA Statistics Matches: 18, Mismatches: 0, Indels: 1 0.95 0.00 0.05 Matches are distributed among these distances: 20 12 0.67 21 6 0.33 ACGTcount: A:0.59, C:0.10, G:0.00, T:0.31 Consensus pattern (21 bp): ATTAATTAAAAACCATTAAAC Found at i:7079 original size:74 final size:74 Alignment explanation

Indices: 6993--7149 Score: 280 Period size: 74 Copynumber: 2.1 Consensus size: 74 6983 TTAAGGAAGG * * 6993 GAAATGTGTAATTACGAAAAAGGGTAGAAGGAAAAGGAATGGGGGAAACTCATAGAGGGGCTTTT 1 GAAAAGTGTAATTACGAAAAAGGGTAGAAGGAAAAGGAATAGGGGAAACTCATAGAGGGGCTTTT 7058 TAGTCATCC- 66 TAGTCA-CCT 7067 GAAAAGTGTAATTACGAAAAAGGGTAGAAGGAAAAGGAATAGGGGAAACTCATAGAGGGGCTTTT 1 GAAAAGTGTAATTACGAAAAAGGGTAGAAGGAAAAGGAATAGGGGAAACTCATAGAGGGGCTTTT 7132 TAGTCACCT 66 TAGTCACCT 7141 GAAAAGTGT 1 GAAAAGTGT 7150 GAAAAGACCA Statistics Matches: 80, Mismatches: 2, Indels: 2 0.95 0.02 0.02 Matches are distributed among these distances: 73 2 0.03 74 78 0.98 ACGTcount: A:0.39, C:0.09, G:0.31, T:0.21 Consensus pattern (74 bp): GAAAAGTGTAATTACGAAAAAGGGTAGAAGGAAAAGGAATAGGGGAAACTCATAGAGGGGCTTTT TAGTCACCT Found at i:10035 original size:30 final size:30 Alignment explanation

Indices: 10001--10057 Score: 89 Period size: 30 Copynumber: 1.9 Consensus size: 30 9991 GTTCTTTTTC * 10001 CTTGAAATCTTTCTTCAATG-ATCTTCATAA 1 CTTGAAAT-TATCTTCAATGAATCTTCATAA 10031 CTTGAAATTATCTTCAATGAATCTTCA 1 CTTGAAATTATCTTCAATGAATCTTCA 10058 ATCACAAACT Statistics Matches: 25, Mismatches: 1, Indels: 2 0.89 0.04 0.07 Matches are distributed among these distances: 29 10 0.40 30 15 0.60 ACGTcount: A:0.32, C:0.19, G:0.07, T:0.42 Consensus pattern (30 bp): CTTGAAATTATCTTCAATGAATCTTCATAA Found at i:12548 original size:18 final size:18 Alignment explanation

Indices: 12484--12548 Score: 51 Period size: 18 Copynumber: 3.3 Consensus size: 18 12474 AATGGATCGA 12484 ATGGCCGGTTGTGGCCGG 1 ATGGCCGGTTGTGGCCGG * 12502 ATGGCGCATGCGTTG-GCCCGTGCG 1 ATGGC-C--G-GTTGTGGCC--G-G 12526 ATGGCCGGTTGTGGCCGG 1 ATGGCCGGTTGTGGCCGG 12544 ATGGC 1 ATGGC 12549 TTGTGCGATG Statistics Matches: 37, Mismatches: 2, Indels: 16 0.67 0.04 0.29 Matches are distributed among these distances: 18 11 0.30 19 2 0.05 20 4 0.11 21 8 0.22 22 4 0.11 23 2 0.05 24 6 0.16 ACGTcount: A:0.08, C:0.25, G:0.46, T:0.22 Consensus pattern (18 bp): ATGGCCGGTTGTGGCCGG Found at i:16349 original size:21 final size:21 Alignment explanation

Indices: 16316--16364 Score: 62 Period size: 21 Copynumber: 2.3 Consensus size: 21 16306 ATGATGCACC * * * * 16316 TGGGCACATAAGCCAAGTGCT 1 TGGGCGCACAAGCCAAATGCA 16337 TGGGCGCACAAGCCAAATGCA 1 TGGGCGCACAAGCCAAATGCA 16358 TGGGCGC 1 TGGGCGC 16365 CAGGAGGAGT Statistics Matches: 24, Mismatches: 4, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 21 24 1.00 ACGTcount: A:0.27, C:0.27, G:0.33, T:0.14 Consensus pattern (21 bp): TGGGCGCACAAGCCAAATGCA Found at i:17679 original size:2 final size:2 Alignment explanation

Indices: 17672--17704 Score: 66 Period size: 2 Copynumber: 16.5 Consensus size: 2 17662 ATCAGGATAC 17672 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 17705 CGAGTACATT Statistics Matches: 31, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 31 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:21292 original size:6 final size:6 Alignment explanation

Indices: 21276--21306 Score: 55 Period size: 6 Copynumber: 5.3 Consensus size: 6 21266 CTAAGCAAAG 21276 TAAAT- TAAATC TAAATC TAAATC TAAATC TA 1 TAAATC TAAATC TAAATC TAAATC TAAATC TA 21307 TAGCAATTAT Statistics Matches: 25, Mismatches: 0, Indels: 1 0.96 0.00 0.04 Matches are distributed among these distances: 5 5 0.20 6 20 0.80 ACGTcount: A:0.52, C:0.13, G:0.00, T:0.35 Consensus pattern (6 bp): TAAATC Found at i:25780 original size:33 final size:33 Alignment explanation

Indices: 25730--25922 Score: 205 Period size: 33 Copynumber: 5.7 Consensus size: 33 25720 CCGCGCAACA 25730 CCGGCCACAAGACCGGCCACGCGACATGGACATGT 1 CCGGCCAC-A-ACCGGCCACGCGACATGGACATGT 25765 CCGGCCATC-ACCGGCCACGCGACATGGACATGT 1 CCGGCCA-CAACCGGCCACGCGACATGGACATGT * ** 25798 CCGGCTACAACCGGCCAAACGAC-TCGGCCAACATGT 1 CCGGCCACAACCGGCCACGCGACAT-GG---ACATGT 25834 CCGGCCATC-ACCGGCCACGCGACATGGACATGT 1 CCGGCCA-CAACCGGCCACGCGACATGGACATGT * ** * * 25867 CCGGCTACAACCGGCCAAACGAC-TCGGCCATGC 1 CCGGCCACAACCGGCCACGCGACAT-GGACATGT 25900 CCGGCCACAACCGGCCACGCGAC 1 CCGGCCACAACCGGCCACGCGAC 25923 CCTTTGTCTA Statistics Matches: 134, Mismatches: 14, Indels: 22 0.79 0.08 0.13 Matches are distributed among these distances: 32 4 0.03 33 94 0.70 35 7 0.05 36 27 0.20 37 2 0.01 ACGTcount: A:0.24, C:0.41, G:0.26, T:0.09 Consensus pattern (33 bp): CCGGCCACAACCGGCCACGCGACATGGACATGT Found at i:25840 original size:69 final size:67 Alignment explanation

Indices: 25759--25922 Score: 278 Period size: 69 Copynumber: 2.4 Consensus size: 67 25749 CGCGACATGG 25759 ACATGTCCGGCCATCACCGGCCACGCGACATGGACATGTCCGGCTACAACCGGCCAAACGACTCG 1 ACATGTCCGGCCATCACCGGCCACGCGACATGGACATGTCCGGCTACAACCGGCCAAACGACTCG 25824 GCC 66 G-C 25827 AACATGTCCGGCCATCACCGGCCACGCGACATGGACATGTCCGGCTACAACCGGCCAAACGACTC 1 -ACATGTCCGGCCATCACCGGCCACGCGACATGGACATGTCCGGCTACAACCGGCCAAACGACTC 25892 GGC 65 GGC * 25895 -CATGCCCGGCCA-CAACCGGCCACGCGAC 1 ACATGTCCGGCCATC-ACCGGCCACGCGAC 25923 CCTTTGTCTA Statistics Matches: 93, Mismatches: 1, Indels: 5 0.94 0.01 0.05 Matches are distributed among these distances: 65 1 0.01 66 25 0.27 68 1 0.01 69 66 0.71 ACGTcount: A:0.24, C:0.41, G:0.25, T:0.10 Consensus pattern (67 bp): ACATGTCCGGCCATCACCGGCCACGCGACATGGACATGTCCGGCTACAACCGGCCAAACGACTCG GC Found at i:26838 original size:8 final size:8 Alignment explanation

Indices: 26825--26858 Score: 50 Period size: 8 Copynumber: 4.1 Consensus size: 8 26815 CACCTTCTTG 26825 AAAAATTC 1 AAAAATTC 26833 AAAAATTC 1 AAAAATTC * 26841 AGAAACTTC 1 A-AAAATTC 26850 AAAAATTC 1 AAAAATTC 26858 A 1 A 26859 TAGCTGATTC Statistics Matches: 23, Mismatches: 2, Indels: 2 0.85 0.07 0.07 Matches are distributed among these distances: 8 16 0.70 9 7 0.30 ACGTcount: A:0.59, C:0.15, G:0.03, T:0.24 Consensus pattern (8 bp): AAAAATTC Found at i:26944 original size:10 final size:9 Alignment explanation

Indices: 26931--26965 Score: 52 Period size: 10 Copynumber: 3.7 Consensus size: 9 26921 AGTTATATCG 26931 AAAAATATAA 1 AAAAATA-AA 26941 AAAAATAAA 1 AAAAATAAA 26950 ATAAAATAAA 1 A-AAAATAAA 26960 AAAAAT 1 AAAAAT 26966 TTCGACCAGA Statistics Matches: 24, Mismatches: 0, Indels: 3 0.89 0.00 0.11 Matches are distributed among these distances: 9 8 0.33 10 16 0.67 ACGTcount: A:0.83, C:0.00, G:0.00, T:0.17 Consensus pattern (9 bp): AAAAATAAA Found at i:29127 original size:33 final size:33 Alignment explanation

Indices: 29032--29133 Score: 138 Period size: 33 Copynumber: 3.2 Consensus size: 33 29022 ATCAGATTTA * * 29032 TTTTCAATGC--T-ATCAACCAAAACAGGATTA 1 TTTTCAATGCTATGATCAACCAAAACAGAATTG * 29062 TTTGCAATGCTATGATCAACCAAAACAGAATTG 1 TTTTCAATGCTATGATCAACCAAAACAGAATTG * * 29095 TTTTTAATGCTATGTTCAACCAAAACAGAATTG 1 TTTTCAATGCTATGATCAACCAAAACAGAATTG 29128 TTTTCA 1 TTTTCA 29134 TCACAATTAG Statistics Matches: 62, Mismatches: 7, Indels: 3 0.86 0.10 0.04 Matches are distributed among these distances: 30 9 0.15 32 1 0.02 33 52 0.84 ACGTcount: A:0.37, C:0.18, G:0.12, T:0.33 Consensus pattern (33 bp): TTTTCAATGCTATGATCAACCAAAACAGAATTG Found at i:29194 original size:33 final size:33 Alignment explanation

Indices: 29157--29277 Score: 127 Period size: 33 Copynumber: 3.7 Consensus size: 33 29147 CCAAAACAGA * * * 29157 TTTAGTTTTATTGCAAACAACACTCAAGTTAGG 1 TTTAGTATCATTGCAAACAACACTCAAATTAGG * ** * 29190 TTTAGTATCATCGCAAACAACA-TCTAAAACAGA 1 TTTAGTATCATTGCAAACAACACTC-AAATTAGG * * 29223 TTTAGTGTCATTGCAAACAACACTCAATTTAGG 1 TTTAGTATCATTGCAAACAACACTCAAATTAGG ** 29256 TTTAGTATCACCGCAAACAACA 1 TTTAGTATCATTGCAAACAACA 29278 TCTAAAAGAC Statistics Matches: 70, Mismatches: 16, Indels: 4 0.78 0.18 0.04 Matches are distributed among these distances: 32 2 0.03 33 66 0.94 34 2 0.03 ACGTcount: A:0.38, C:0.20, G:0.12, T:0.30 Consensus pattern (33 bp): TTTAGTATCATTGCAAACAACACTCAAATTAGG Found at i:29201 original size:66 final size:66 Alignment explanation

Indices: 29144--29284 Score: 210 Period size: 66 Copynumber: 2.1 Consensus size: 66 29134 TCACAATTAG * * * 29144 CATCCAAAACAGATTTAGTTTTATTGCAAACAACACTCAAGTTAGGTTTAGTATCATCGCAAACA 1 CATCCAAAACAGATTTAGTGTCATTGCAAACAACACTCAAGTTAGGTTTAGTATCATCACAAACA 29209 A 66 A * * * * 29210 CATCTAAAACAGATTTAGTGTCATTGCAAACAACACTCAATTTAGGTTTAGTATCACCGCAAACA 1 CATCCAAAACAGATTTAGTGTCATTGCAAACAACACTCAAGTTAGGTTTAGTATCATCACAAACA 29275 A 66 A * 29276 CATCTAAAA 1 CATCCAAAA 29285 GACTCTTTTC Statistics Matches: 70, Mismatches: 5, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 66 70 1.00 ACGTcount: A:0.40, C:0.21, G:0.11, T:0.28 Consensus pattern (66 bp): CATCCAAAACAGATTTAGTGTCATTGCAAACAACACTCAAGTTAGGTTTAGTATCATCACAAACA A Found at i:30793 original size:8 final size:8 Alignment explanation

Indices: 30765--30798 Score: 50 Period size: 8 Copynumber: 4.1 Consensus size: 8 30755 GAATCGGCTA 30765 TGAATTTT 1 TGAATTTT * 30773 TGAAGTTTC 1 TGAA-TTTT 30782 TGAATTTT 1 TGAATTTT 30790 TGAATTTT 1 TGAATTTT 30798 T 1 T 30799 CAAGAAGGTG Statistics Matches: 23, Mismatches: 2, Indels: 2 0.85 0.07 0.07 Matches are distributed among these distances: 8 16 0.70 9 7 0.30 ACGTcount: A:0.24, C:0.03, G:0.15, T:0.59 Consensus pattern (8 bp): TGAATTTT Found at i:32648 original size:33 final size:33 Alignment explanation

Indices: 32598--32715 Score: 121 Period size: 33 Copynumber: 3.6 Consensus size: 33 32588 TTTCAGATGC * 32598 TGTTTGCGATGATACTAAACCTAAATTGAGTGT 1 TGTTTGCAATGATACTAAACCTAAATTGAGTGT * * ** * 32631 TGTTTGCAATGACACTAAATCT-GTTTTAGATGT 1 TGTTTGCAATGATACTAAACCTAAATTGAG-TGT * * * 32664 TGTTTGCGATGATACTAAACCTAACTTGAGTGG 1 TGTTTGCAATGATACTAAACCTAAATTGAGTGT * * 32697 TGTTTGCAATAAAACTAAA 1 TGTTTGCAATGATACTAAA 32716 TCTGTTTTGG Statistics Matches: 67, Mismatches: 16, Indels: 4 0.77 0.18 0.05 Matches are distributed among these distances: 32 4 0.06 33 59 0.88 34 4 0.06 ACGTcount: A:0.31, C:0.13, G:0.19, T:0.36 Consensus pattern (33 bp): TGTTTGCAATGATACTAAACCTAAATTGAGTGT Found at i:32701 original size:66 final size:66 Alignment explanation

Indices: 32592--32730 Score: 224 Period size: 66 Copynumber: 2.1 Consensus size: 66 32582 AGAGTCTTTC * * * 32592 AGATGCTGTTTGCGATGATACTAAACCTAAATTGAGTGTTGTTTGCAATGACACTAAATCTGTTT 1 AGATGCTGTTTGCGATGATACTAAACCTAAATTGAGTGGTGTTTGCAATAAAACTAAATCTGTTT 32657 T 66 T * * 32658 AGATGTTGTTTGCGATGATACTAAACCTAACTTGAGTGGTGTTTGCAATAAAACTAAATCTGTTT 1 AGATGCTGTTTGCGATGATACTAAACCTAAATTGAGTGGTGTTTGCAATAAAACTAAATCTGTTT 32723 T 66 T * 32724 GGATGCT 1 AGATGCT 32731 AATTGTGATG Statistics Matches: 66, Mismatches: 7, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 66 66 1.00 ACGTcount: A:0.29, C:0.13, G:0.21, T:0.37 Consensus pattern (66 bp): AGATGCTGTTTGCGATGATACTAAACCTAAATTGAGTGGTGTTTGCAATAAAACTAAATCTGTTT T Found at i:32778 original size:33 final size:33 Alignment explanation

Indices: 32741--32828 Score: 122 Period size: 33 Copynumber: 2.7 Consensus size: 33 32731 AATTGTGATG 32741 AAAACAATTCTGTTTTGGTTGAACATAGCATTA 1 AAAACAATTCTGTTTTGGTTGAACATAGCATTA ** * 32774 AAAACAATTCTGTTTTGGTTGATTATAGCATTG 1 AAAACAATTCTGTTTTGGTTGAACATAGCATTA * * * 32807 CAAATAATCCTGTTTTGGTTGA 1 AAAACAATTCTGTTTTGGTTGA 32829 TAGCATTGAA Statistics Matches: 49, Mismatches: 6, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 33 49 1.00 ACGTcount: A:0.32, C:0.11, G:0.17, T:0.40 Consensus pattern (33 bp): AAAACAATTCTGTTTTGGTTGAACATAGCATTA Found at i:32834 original size:30 final size:31 Alignment explanation

Indices: 32739--32843 Score: 113 Period size: 33 Copynumber: 3.3 Consensus size: 31 32729 CTAATTGTGA * * 32739 TGAAAACAATTCTGTTTTGGTTGAACATAGCAT 1 TGAAAATAATCCTGTTTTGGTTG-A-ATAGCAT * * * 32772 TAAAAACAATTCTGTTTTGGTTGATTATAGCAT 1 TGAAAATAATCCTGTTTTGGTTGA--ATAGCAT * 32805 TGCAAATAATCCTGTTTTGGTTG-ATAGCAT 1 TGAAAATAATCCTGTTTTGGTTGAATAGCAT 32835 TGAAAATAA 1 TGAAAATAA 32844 ATCTGATTCA Statistics Matches: 64, Mismatches: 7, Indels: 5 0.84 0.09 0.07 Matches are distributed among these distances: 30 15 0.23 32 1 0.02 33 48 0.75 ACGTcount: A:0.34, C:0.10, G:0.17, T:0.38 Consensus pattern (31 bp): TGAAAATAATCCTGTTTTGGTTGAATAGCAT Done.