Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01012440.1 Corchorus olitorius cultivar O-4 contig12473, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 44543
ACGTcount: A:0.34, C:0.17, G:0.17, T:0.33


Found at i:791 original size:15 final size:16

Alignment explanation

Indices: 761--798 Score: 60 Period size: 15 Copynumber: 2.4 Consensus size: 16 751 TTACTTTGCT * 761 TTGTTTTCTAGTTTAA 1 TTGTTTTATAGTTTAA 777 TTGTTTTAT-GTTTAA 1 TTGTTTTATAGTTTAA 792 TTGTTTT 1 TTGTTTT 799 CTGTCAACCT Statistics Matches: 21, Mismatches: 1, Indels: 1 0.91 0.04 0.04 Matches are distributed among these distances: 15 13 0.62 16 8 0.38 ACGTcount: A:0.16, C:0.03, G:0.13, T:0.68 Consensus pattern (16 bp): TTGTTTTATAGTTTAA Found at i:1360 original size:12 final size:13 Alignment explanation

Indices: 1328--1360 Score: 50 Period size: 13 Copynumber: 2.6 Consensus size: 13 1318 AATAACCCTC * 1328 ACTCAATCTTACA 1 ACTCAATCTAACA 1341 ACTCAATCTAACA 1 ACTCAATCTAACA 1354 A-TCAATC 1 ACTCAATC 1361 ATAAATCAAA Statistics Matches: 19, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 12 6 0.32 13 13 0.68 ACGTcount: A:0.42, C:0.30, G:0.00, T:0.27 Consensus pattern (13 bp): ACTCAATCTAACA Found at i:1459 original size:2 final size:2 Alignment explanation

Indices: 1452--1490 Score: 69 Period size: 2 Copynumber: 19.5 Consensus size: 2 1442 ACTAAACAAT * 1452 CA CA CA CA CA CA CA CA CC CA CA CA CA CA CA CA CA CA CA C 1 CA CA CA CA CA CA CA CA CA CA CA CA CA CA CA CA CA CA CA C 1491 CATCCTCTTG Statistics Matches: 35, Mismatches: 2, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 2 35 1.00 ACGTcount: A:0.46, C:0.54, G:0.00, T:0.00 Consensus pattern (2 bp): CA Found at i:3060 original size:20 final size:20 Alignment explanation

Indices: 3035--3079 Score: 81 Period size: 20 Copynumber: 2.2 Consensus size: 20 3025 CCCATGTTAA * 3035 GTAAAATTGGGGGTAGTATT 1 GTAAAATTGAGGGTAGTATT 3055 GTAAAATTGAGGGTAGTATT 1 GTAAAATTGAGGGTAGTATT 3075 GTAAA 1 GTAAA 3080 GTATAGTAAA Statistics Matches: 24, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 20 24 1.00 ACGTcount: A:0.36, C:0.00, G:0.31, T:0.33 Consensus pattern (20 bp): GTAAAATTGAGGGTAGTATT Found at i:5659 original size:25 final size:25 Alignment explanation

Indices: 5631--5846 Score: 283 Period size: 25 Copynumber: 8.6 Consensus size: 25 5621 CGCTCATGTT * * 5631 CTTGTGTTTGGAAAACGAGCCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG 5656 CTTGCGTTTGGAAAACGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG 5681 CTTGCGTTTGGAAAACGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG 5706 CTTGCGTTTGGAAAACGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG * * * * 5731 CATGCGTTTGGCAAGCGAGCCT-TAG 1 CTTGCGTTTGGAAAACGAACCTGT-G * 5756 CTTGCGTTTTGAAAACGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG * * 5781 CTTGCGTTTGGCAAGCGAACCT-TAG 1 CTTGCGTTTGGAAAACGAACCTGT-G * * * 5806 CTTGCGTTTAGCAAACGAGCCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG * 5831 CTTGCGTTTAGAAAAC 1 CTTGCGTTTGGAAAAC 5847 ACATAGGCTA Statistics Matches: 169, Mismatches: 18, Indels: 8 0.87 0.09 0.04 Matches are distributed among these distances: 24 2 0.01 25 165 0.98 26 2 0.01 ACGTcount: A:0.23, C:0.21, G:0.28, T:0.29 Consensus pattern (25 bp): CTTGCGTTTGGAAAACGAACCTGTG Found at i:8500 original size:15 final size:15 Alignment explanation

Indices: 8480--8535 Score: 67 Period size: 15 Copynumber: 3.5 Consensus size: 15 8470 CACCAGATGA 8480 TGTTTCTGCAACGGT 1 TGTTTCTGCAACGGT * 8495 TGTTTCTGAAACGGAT 1 TGTTTCTGCAACGG-T * 8511 GATGTTTTTGCAACGGT 1 --TGTTTCTGCAACGGT 8528 TGTTTCTG 1 TGTTTCTG 8536 GAACAGTGCC Statistics Matches: 34, Mismatches: 4, Indels: 6 0.77 0.09 0.14 Matches are distributed among these distances: 15 20 0.59 16 1 0.03 17 1 0.03 18 12 0.35 ACGTcount: A:0.16, C:0.14, G:0.27, T:0.43 Consensus pattern (15 bp): TGTTTCTGCAACGGT Found at i:8525 original size:18 final size:18 Alignment explanation

Indices: 8464--8526 Score: 69 Period size: 18 Copynumber: 3.7 Consensus size: 18 8454 CATTTGCAAT * * 8464 TTTCTGCACCAGATGATG 1 TTTCTGCAACGGATGATG 8482 TTTCTGCAACGG-T--TG 1 TTTCTGCAACGGATGATG * 8497 TTTCTGAAACGGATGATG 1 TTTCTGCAACGGATGATG * 8515 TTTTTGCAACGG 1 TTTCTGCAACGG 8527 TTGTTTCTGG Statistics Matches: 37, Mismatches: 5, Indels: 6 0.77 0.10 0.12 Matches are distributed among these distances: 15 13 0.35 16 1 0.03 17 1 0.03 18 22 0.59 ACGTcount: A:0.21, C:0.17, G:0.25, T:0.37 Consensus pattern (18 bp): TTTCTGCAACGGATGATG Found at i:8643 original size:27 final size:27 Alignment explanation

Indices: 8605--8715 Score: 186 Period size: 27 Copynumber: 4.1 Consensus size: 27 8595 GGCCATTCAA * * 8605 TTGGGGTTGCGGATGAGGCACAGCCAC 1 TTGGGGTTGCGGATGAAGCGCAGCCAC 8632 TTGGGGTTGCGGATGAAGCGCAGCCAC 1 TTGGGGTTGCGGATGAAGCGCAGCCAC * * 8659 CTGGGGTTGCGGATGAAGCGCAACCAC 1 TTGGGGTTGCGGATGAAGCGCAGCCAC 8686 TTGGGGTTGCGGATGAAGCGCAGCCAC 1 TTGGGGTTGCGGATGAAGCGCAGCCAC 8713 TTG 1 TTG 8716 AGGTGGCGCC Statistics Matches: 78, Mismatches: 6, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 27 78 1.00 ACGTcount: A:0.19, C:0.23, G:0.40, T:0.19 Consensus pattern (27 bp): TTGGGGTTGCGGATGAAGCGCAGCCAC Found at i:10212 original size:25 final size:25 Alignment explanation

Indices: 10178--10418 Score: 315 Period size: 25 Copynumber: 9.6 Consensus size: 25 10168 TGCTCATGTT * * 10178 CTTGTGTTTGGAAAACGAGCCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG 10203 CTTGCGTTTGGAAAACGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG 10228 CTTGCGTTTGGAAAACGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG * 10253 CTTGCGTTTGGAAAGCGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG 10278 CTTGCGTTTGGAAAACGAACCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG * * * 10303 CTTGCGTTTGGCAAGCGAGCCT-TAG 1 CTTGCGTTTGGAAAACGAACCTGT-G * * 10328 CTTGCGTTTTGAAAACGAACTTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG * * * 10353 CTTGCGTTTGGCAAGCGAGCCT-TAG 1 CTTGCGTTTGGAAAACGAACCTGT-G * * * 10378 CTTGCGTTTAGCAAACGAGCCTGTG 1 CTTGCGTTTGGAAAACGAACCTGTG * 10403 CTTGCGTTTAGAAAAC 1 CTTGCGTTTGGAAAAC 10419 ACATAGGCTA Statistics Matches: 192, Mismatches: 20, Indels: 8 0.87 0.09 0.04 Matches are distributed among these distances: 24 2 0.01 25 188 0.98 26 2 0.01 ACGTcount: A:0.22, C:0.20, G:0.29, T:0.29 Consensus pattern (25 bp): CTTGCGTTTGGAAAACGAACCTGTG Found at i:15511 original size:188 final size:181 Alignment explanation

Indices: 15043--15523 Score: 520 Period size: 198 Copynumber: 2.6 Consensus size: 181 15033 GTGGATAATC * * * * 15043 GAGGCAAGTGAGTGTTATTTTGATCCTTTAATCTCAGTCGTTTAAAGCTAATCTTAATATATAGG 1 GAGGCAAGTGAGGGTTATCTTGATCCTTCAATCTCAGTCGTTTAAAGGTAATCTTAATATATAGG * * * * * * 15108 TGTCAAGAATCAGGGACTATATTGGAAATGATTT--TATAAGGTATATTGGAGGGGAGTTGTACT 66 AGTCAAGAATCCGAGACTATATTGGAAATGATTTGATATAAGGTATATTGGAGAGGAGTTGGACC ** * * * * ** * 15171 TGTCAAAGAACAAGATGAGAAGAGTAAGGGAAGGAAAGGGTGTGGATAACT 131 CATCAAAAAACAAGAAGAGAAAAGTAAGGGAAGGAAACGGCATGAATAACT * * * * 15222 AAGGCAAGTGAGGGTTATCTTGATCCTTTAGTCTTTTAATCTTAGCCATTTAAAGGTAATCTTAA 1 GAGGCAAGTGAGGGTTATCTTGATCC--T--TC----AATCTCAGTCGTTTAAAGGTAATCTTAA * * 15287 TATATAGGTTGGTCAAGAATCCGAGACTATATTGGAAATGATTTGATATAAGGTTTATTGGAGAG 58 TATATAGG--AGTCAAGAATCCGAGACTATATTGGAAATGATTTGATATAAGGTATATTGGAGAG 15352 GAGTTGGACCCAT-AAAAAACAAGTTAAAGAAGGTGACAAAGTGAA-GGAAGGAAACGGCATGAA 121 GAGTTGGACCCATCAAAAAACAAG---AAG-A---GA-AAAGT-AAGGGAAGGAAACGGCATGAA 15415 TAACT 177 TAACT * 15420 GAGGCAATTGAGGGTTATCTTGATCCTTCAATCTCAGTCGTTTAAAGGTAATCTTAATATATAGG 1 GAGGCAAGTGAGGGTTATCTTGATCCTTCAATCTCAGTCGTTTAAAGGTAATCTTAATATATAGG * 15485 AGTCAAGAATCCCAGACTATATTGGAAATGATTTGATAT 66 AGTCAAGAATCCGAGACTATATTGGAAATGATTTGATAT 15524 GTGTATCGGA Statistics Matches: 250, Mismatches: 31, Indels: 33 0.80 0.10 0.11 Matches are distributed among these distances: 179 23 0.09 181 1 0.00 183 1 0.00 187 32 0.13 188 37 0.15 189 31 0.12 190 42 0.17 191 26 0.10 193 2 0.01 194 3 0.01 196 1 0.00 197 2 0.01 198 47 0.19 199 2 0.01 ACGTcount: A:0.35, C:0.10, G:0.24, T:0.30 Consensus pattern (181 bp): GAGGCAAGTGAGGGTTATCTTGATCCTTCAATCTCAGTCGTTTAAAGGTAATCTTAATATATAGG AGTCAAGAATCCGAGACTATATTGGAAATGATTTGATATAAGGTATATTGGAGAGGAGTTGGACC CATCAAAAAACAAGAAGAGAAAAGTAAGGGAAGGAAACGGCATGAATAACT Found at i:15602 original size:3 final size:3 Alignment explanation

Indices: 15594--15674 Score: 137 Period size: 3 Copynumber: 27.0 Consensus size: 3 15584 AATCAACATT 15594 ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA 1 ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA ATA * 15642 ATA ATA ATA ATA ATA ATA TTA AT- ATA TATA ATA 1 ATA ATA ATA ATA ATA ATA ATA ATA ATA -ATA ATA 15675 TCTAATTAGA Statistics Matches: 74, Mismatches: 2, Indels: 4 0.93 0.03 0.05 Matches are distributed among these distances: 2 2 0.03 3 69 0.93 4 3 0.04 ACGTcount: A:0.64, C:0.00, G:0.00, T:0.36 Consensus pattern (3 bp): ATA Found at i:15687 original size:15 final size:15 Alignment explanation

Indices: 15594--15688 Score: 65 Period size: 15 Copynumber: 6.3 Consensus size: 15 15584 AATCAACATT * 15594 ATAATAATAATAATA- 1 ATAATATTAAT-ATAG * 15609 ATAATAATAATAATA- 1 ATAATATTAAT-ATAG * 15624 ATAATAATAATAATA- 1 ATAATATTAAT-ATAG * 15639 ATAATAATAATAATA- 1 ATAATATTAAT-ATAG * 15654 ATAATATTAATATAT 1 ATAATATTAATATAG 15669 ATAATATCTAAT-TAG 1 ATAATAT-TAATATAG 15684 ATAAT 1 ATAAT 15689 GTAAAATACA Statistics Matches: 76, Mismatches: 2, Indels: 4 0.93 0.02 0.05 Matches are distributed among these distances: 14 3 0.04 15 69 0.91 16 4 0.05 ACGTcount: A:0.61, C:0.01, G:0.01, T:0.37 Consensus pattern (15 bp): ATAATATTAATATAG Found at i:16404 original size:6 final size:6 Alignment explanation

Indices: 16393--16423 Score: 62 Period size: 6 Copynumber: 5.2 Consensus size: 6 16383 CATCTTCTGC 16393 TGCTGT TGCTGT TGCTGT TGCTGT TGCTGT T 1 TGCTGT TGCTGT TGCTGT TGCTGT TGCTGT T 16424 ACCCCTCAGG Statistics Matches: 25, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 25 1.00 ACGTcount: A:0.00, C:0.16, G:0.32, T:0.52 Consensus pattern (6 bp): TGCTGT Found at i:18214 original size:30 final size:28 Alignment explanation

Indices: 18145--18215 Score: 74 Period size: 27 Copynumber: 2.5 Consensus size: 28 18135 AAAATGTTAT * 18145 TGTGATTAAAATTCTACTAAAAAGA-TA 1 TGTGATTATAATTCTACTAAAAAGATTA * * 18172 T-TAGAATATAATTCTACTAGAAAGAATTA 1 TGT-GATTATAATTCTACTAAAAAG-ATTA 18201 TAGTGATTATAATTC 1 T-GTGATTATAATTC 18216 AATATTTTTA Statistics Matches: 35, Mismatches: 4, Indels: 7 0.76 0.09 0.15 Matches are distributed among these distances: 26 1 0.03 27 19 0.54 28 1 0.03 29 3 0.09 30 10 0.29 31 1 0.03 ACGTcount: A:0.45, C:0.07, G:0.11, T:0.37 Consensus pattern (28 bp): TGTGATTATAATTCTACTAAAAAGATTA Found at i:20277 original size:128 final size:128 Alignment explanation

Indices: 20114--20370 Score: 360 Period size: 128 Copynumber: 2.0 Consensus size: 128 20104 ACTTGGTTAA * 20114 TTACTAAGAAGGCCCTCAATTAGTTTCTCAACATT-CTCTTCTCCTCCAGCCCTTTTT-AGTAAT 1 TTACTAAGAAGGCCATCAATTAGTTTCTCAACATTCCT-TTCTCCTCC-GCCCTTTTTCAGTAAT * * 20177 TGCAAAGGTTTTTAACCAGTTGAATGTGAAAGC-C-TTTATTGACCATAAAATCATGTATTATTT 64 TACAAAGGTTTTTAACCAGTTCAATGT-AAA-CACATTTATTGACCATAAAATCATGTATTATTT 20240 AT 127 AT * * * * ** 20242 TTACTAAGAAGGCCATCAATTAGTTTCTCACCATTCCTTTTTCTTCCGGCCTTTTTCTTTAATTA 1 TTACTAAGAAGGCCATCAATTAGTTTCTCAACATTCCTTTCTCCTCCGCCCTTTTTCAGTAATTA * 20307 CAAGGGTTTTTAACCAGTTCAATGTAAACACATTTATTGACCATAAAATCATGTATTATTTAT 66 CAAAGGTTTTTAACCAGTTCAATGTAAACACATTTATTGACCATAAAATCATGTATTATTTAT 20370 T 1 T 20371 CACTTCATCC Statistics Matches: 115, Mismatches: 10, Indels: 8 0.86 0.08 0.06 Matches are distributed among these distances: 126 1 0.01 127 12 0.10 128 100 0.87 129 2 0.02 ACGTcount: A:0.29, C:0.20, G:0.11, T:0.40 Consensus pattern (128 bp): TTACTAAGAAGGCCATCAATTAGTTTCTCAACATTCCTTTCTCCTCCGCCCTTTTTCAGTAATTA CAAAGGTTTTTAACCAGTTCAATGTAAACACATTTATTGACCATAAAATCATGTATTATTTAT Found at i:21398 original size:21 final size:23 Alignment explanation

Indices: 21374--21429 Score: 57 Period size: 21 Copynumber: 2.6 Consensus size: 23 21364 CATAGTATTA 21374 AAATTATTATATAATA-AT-AAC 1 AAATTATTATATAATATATAAAC * * 21395 AAATT-TT-TTTAATATTATAAAT 1 AAATTATTATATAATA-TATAAAC 21417 AAATTATTATATA 1 AAATTATTATATA 21430 TGTGATAACT Statistics Matches: 27, Mismatches: 3, Indels: 7 0.73 0.08 0.19 Matches are distributed among these distances: 19 6 0.22 20 2 0.07 21 7 0.26 22 7 0.26 23 2 0.07 24 3 0.11 ACGTcount: A:0.52, C:0.02, G:0.00, T:0.46 Consensus pattern (23 bp): AAATTATTATATAATATATAAAC Found at i:25650 original size:22 final size:22 Alignment explanation

Indices: 25625--25667 Score: 59 Period size: 22 Copynumber: 2.0 Consensus size: 22 25615 TTTTATTTGA * 25625 GTAAAACTATAAAAGTAAAATT 1 GTAAAACTATAAAAATAAAATT * * 25647 GTAAAATTGTAAAAATAAAAT 1 GTAAAACTATAAAAATAAAAT 25668 AGTTATAAGG Statistics Matches: 18, Mismatches: 3, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 22 18 1.00 ACGTcount: A:0.60, C:0.02, G:0.09, T:0.28 Consensus pattern (22 bp): GTAAAACTATAAAAATAAAATT Found at i:25723 original size:93 final size:93 Alignment explanation

Indices: 25563--25731 Score: 284 Period size: 93 Copynumber: 1.8 Consensus size: 93 25553 AGTATTATCT * * 25563 TAAAAATAAAATAGGTATAAGGATATTAGATTTAATTAAATAAAAATAGAGTTTTTATTTGAGTA 1 TAAAAATAAAATAGGTATAAGGATATTAGATTTAATTAAATAAAAATAGAGTTTTCAGTTGAGTA * 25628 AAACTATAAAAGTAAAATTGTAAAATTG 66 AAACAATAAAAGTAAAATTGTAAAATTG * * * 25656 TAAAAATAAAATAGTTATAAGGATATTAGATTTAATTAAATCAAAATAGAGTTTTCAGTTGATTA 1 TAAAAATAAAATAGGTATAAGGATATTAGATTTAATTAAATAAAAATAGAGTTTTCAGTTGAGTA 25721 AAACAATAAAA 66 AAACAATAAAA 25732 ATTTAAATAA Statistics Matches: 70, Mismatches: 6, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 93 70 1.00 ACGTcount: A:0.52, C:0.02, G:0.12, T:0.34 Consensus pattern (93 bp): TAAAAATAAAATAGGTATAAGGATATTAGATTTAATTAAATAAAAATAGAGTTTTCAGTTGAGTA AAACAATAAAAGTAAAATTGTAAAATTG Found at i:25805 original size:31 final size:32 Alignment explanation

Indices: 25767--25830 Score: 112 Period size: 31 Copynumber: 2.0 Consensus size: 32 25757 ATATTCAAAA * 25767 AATAAGGGTATGATAGGCGATTCAAAA-GTTT 1 AATAAGGGTATAATAGGCGATTCAAAAGGTTT 25798 AATAAGGGTATAATAGGCGATTCAAAAGGTTT 1 AATAAGGGTATAATAGGCGATTCAAAAGGTTT 25830 A 1 A 25831 CAAAACTCGT Statistics Matches: 31, Mismatches: 1, Indels: 1 0.94 0.03 0.03 Matches are distributed among these distances: 31 26 0.84 32 5 0.16 ACGTcount: A:0.41, C:0.06, G:0.25, T:0.28 Consensus pattern (32 bp): AATAAGGGTATAATAGGCGATTCAAAAGGTTT Found at i:27221 original size:32 final size:32 Alignment explanation

Indices: 27180--27240 Score: 104 Period size: 32 Copynumber: 1.9 Consensus size: 32 27170 AAATATGTTT * 27180 GAAAAATAAGAGTATAATGGTCGATTCAATTA 1 GAAAAATAAGAGTATAATAGTCGATTCAATTA * 27212 GAAAAATAAGGGTATAATAGTCGATTCAA 1 GAAAAATAAGAGTATAATAGTCGATTCAA 27241 AAGTTTTACA Statistics Matches: 27, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 32 27 1.00 ACGTcount: A:0.48, C:0.07, G:0.20, T:0.26 Consensus pattern (32 bp): GAAAAATAAGAGTATAATAGTCGATTCAATTA Found at i:28664 original size:2 final size:2 Alignment explanation

Indices: 28657--28696 Score: 73 Period size: 2 Copynumber: 20.5 Consensus size: 2 28647 TAATATTTAG 28657 TA TA TA TA TA TA TA T- TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 28697 TTTGTTCTAT Statistics Matches: 37, Mismatches: 0, Indels: 2 0.95 0.00 0.05 Matches are distributed among these distances: 1 1 0.03 2 36 0.97 ACGTcount: A:0.47, C:0.00, G:0.00, T:0.53 Consensus pattern (2 bp): TA Done.