Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01010409.1 Corchorus capsularis cultivar CVL-1 contig10430, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 111665
ACGTcount: A:0.33, C:0.17, G:0.17, T:0.33


Found at i:6199 original size:25 final size:26

Alignment explanation

Indices: 6151--6200 Score: 75 Period size: 25 Copynumber: 2.0 Consensus size: 26 6141 TGGAGTTAGT ** 6151 TAGTTAGTTAATTTTTTTGTTGGCAA 1 TAGTTAGTTAATTTTTTTAATGGCAA 6177 TAGTTAGTT-ATTTTTTTAATGGCA 1 TAGTTAGTTAATTTTTTTAATGGCA 6201 CAAATTATTT Statistics Matches: 22, Mismatches: 2, Indels: 1 0.88 0.08 0.04 Matches are distributed among these distances: 25 13 0.59 26 9 0.41 ACGTcount: A:0.24, C:0.04, G:0.18, T:0.54 Consensus pattern (26 bp): TAGTTAGTTAATTTTTTTAATGGCAA Found at i:17734 original size:18 final size:18 Alignment explanation

Indices: 17711--17750 Score: 64 Period size: 18 Copynumber: 2.3 Consensus size: 18 17701 TTTATTACAT * 17711 TATTATTATATTATATTA 1 TATTATTATATTAAATTA 17729 TATTATTATATTAAATTA 1 TATTATTATATTAAATTA 17747 -ATTA 1 TATTA 17751 ATTTCTTCGT Statistics Matches: 21, Mismatches: 1, Indels: 1 0.91 0.04 0.04 Matches are distributed among these distances: 17 4 0.19 18 17 0.81 ACGTcount: A:0.42, C:0.00, G:0.00, T:0.57 Consensus pattern (18 bp): TATTATTATATTAAATTA Found at i:17812 original size:15 final size:16 Alignment explanation

Indices: 17792--17821 Score: 53 Period size: 15 Copynumber: 1.9 Consensus size: 16 17782 CACGATTAAC 17792 TAAAATAAA-ACTAAT 1 TAAAATAAATACTAAT 17807 TAAAATAAATACTAA 1 TAAAATAAATACTAA 17822 ATAAGAACAA Statistics Matches: 14, Mismatches: 0, Indels: 1 0.93 0.00 0.07 Matches are distributed among these distances: 15 9 0.64 16 5 0.36 ACGTcount: A:0.67, C:0.07, G:0.00, T:0.27 Consensus pattern (16 bp): TAAAATAAATACTAAT Found at i:18283 original size:2 final size:2 Alignment explanation

Indices: 18276--18306 Score: 62 Period size: 2 Copynumber: 15.5 Consensus size: 2 18266 TGATATAGTG 18276 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 18307 GTGACATTGA Statistics Matches: 29, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 29 1.00 ACGTcount: A:0.48, C:0.00, G:0.00, T:0.52 Consensus pattern (2 bp): TA Found at i:20126 original size:45 final size:45 Alignment explanation

Indices: 20077--20166 Score: 171 Period size: 45 Copynumber: 2.0 Consensus size: 45 20067 CTTATGAATA * 20077 TATTTTGTCAATTATATGGATTTGTTTAGGACACCATGTTGAGTT 1 TATTTTATCAATTATATGGATTTGTTTAGGACACCATGTTGAGTT 20122 TATTTTATCAATTATATGGATTTGTTTAGGACACCATGTTGAGTT 1 TATTTTATCAATTATATGGATTTGTTTAGGACACCATGTTGAGTT 20167 GAGATCATAT Statistics Matches: 44, Mismatches: 1, Indels: 0 0.98 0.02 0.00 Matches are distributed among these distances: 45 44 1.00 ACGTcount: A:0.26, C:0.09, G:0.19, T:0.47 Consensus pattern (45 bp): TATTTTATCAATTATATGGATTTGTTTAGGACACCATGTTGAGTT Found at i:25325 original size:32 final size:31 Alignment explanation

Indices: 25289--25391 Score: 120 Period size: 32 Copynumber: 3.2 Consensus size: 31 25279 TCCGAACTGA * * 25289 GACCCGCAACCCAGATGACCTGAGACCCGAAT 1 GACCCG-AACCCAGATGACCCGAAACCCGAAT 25321 GACCCGTAACCCAGATGACCCGAAACCC-AGAT 1 GACCCG-AACCCAGATGACCCGAAACCCGA-AT 25353 GACCCGAGACCC-GTATGACCCGAAACCCGAAT 1 GACCCGA-ACCCAG-ATGACCCGAAACCCGAAT * 25385 AACCCGA 1 GACCCGA 25392 GAAGTTAACT Statistics Matches: 63, Mismatches: 4, Indels: 8 0.84 0.05 0.11 Matches are distributed among these distances: 31 3 0.05 32 59 0.94 33 1 0.02 ACGTcount: A:0.33, C:0.38, G:0.20, T:0.09 Consensus pattern (31 bp): GACCCGAACCCAGATGACCCGAAACCCGAAT Found at i:25341 original size:48 final size:48 Alignment explanation

Indices: 25289--25380 Score: 148 Period size: 48 Copynumber: 1.9 Consensus size: 48 25279 TCCGAACTGA * * * 25289 GACCCGCAACCCAGATGACCTGAGACCCGAATGACCCGTAACCCAGAT 1 GACCCGAAACCCAGATGACCCGAGACCCGAATGACCCGAAACCCAGAT * 25337 GACCCGAAACCCAGATGACCCGAGACCCGTATGACCCGAAACCC 1 GACCCGAAACCCAGATGACCCGAGACCCGAATGACCCGAAACCC 25381 GAATAACCCG Statistics Matches: 40, Mismatches: 4, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 48 40 1.00 ACGTcount: A:0.32, C:0.39, G:0.21, T:0.09 Consensus pattern (48 bp): GACCCGAAACCCAGATGACCCGAGACCCGAATGACCCGAAACCCAGAT Found at i:25381 original size:16 final size:16 Alignment explanation

Indices: 25289--25380 Score: 107 Period size: 16 Copynumber: 5.8 Consensus size: 16 25279 TCCGAACTGA * 25289 GACCCGCAACCCAGAT 1 GACCCGAAACCCAGAT * * 25305 GACCTGAGACCC-GAAT 1 GACCCGAAACCCAG-AT * 25321 GACCCGTAACCCAGAT 1 GACCCGAAACCCAGAT 25337 GACCCGAAACCCAGAT 1 GACCCGAAACCCAGAT * 25353 GACCCGAGACCC-GTAT 1 GACCCGAAACCCAG-AT 25369 GACCCGAAACCC 1 GACCCGAAACCC 25381 GAATAACCCG Statistics Matches: 64, Mismatches: 9, Indels: 6 0.81 0.11 0.08 Matches are distributed among these distances: 15 2 0.03 16 61 0.95 17 1 0.02 ACGTcount: A:0.32, C:0.39, G:0.21, T:0.09 Consensus pattern (16 bp): GACCCGAAACCCAGAT Found at i:25389 original size:16 final size:16 Alignment explanation

Indices: 25289--25391 Score: 95 Period size: 16 Copynumber: 6.4 Consensus size: 16 25279 TCCGAACTGA * 25289 GACCCGCAACCC-AGAT 1 GACCCGAAACCCGA-AT * * 25305 GACCTGAGACCCGAAT 1 GACCCGAAACCCGAAT * 25321 GACCCGTAACCC-AGAT 1 GACCCGAAACCCGA-AT 25337 GACCCGAAACCC-AGAT 1 GACCCGAAACCCGA-AT * * 25353 GACCCGAGACCCGTAT 1 GACCCGAAACCCGAAT 25369 GACCCGAAACCCGAAT 1 GACCCGAAACCCGAAT * 25385 AACCCGA 1 GACCCGA 25392 GAAGTTAACT Statistics Matches: 72, Mismatches: 12, Indels: 6 0.80 0.13 0.07 Matches are distributed among these distances: 15 1 0.01 16 70 0.97 17 1 0.01 ACGTcount: A:0.33, C:0.38, G:0.20, T:0.09 Consensus pattern (16 bp): GACCCGAAACCCGAAT Found at i:25390 original size:48 final size:48 Alignment explanation

Indices: 25290--25391 Score: 143 Period size: 48 Copynumber: 2.1 Consensus size: 48 25280 CCGAACTGAG * * * * 25290 ACCCGCAACCCAGATGACCTGAGACCCGAATGACCCGTAACCCAGATG 1 ACCCGAAACCCAGATGACCCGAGACCCGAATGACCCGAAACCCAGATA * 25338 ACCCGAAACCCAGATGACCCGAGACCCGTATGACCCGAAACCC-GAATA 1 ACCCGAAACCCAGATGACCCGAGACCCGAATGACCCGAAACCCAG-ATA 25386 ACCCGA 1 ACCCGA 25392 GAAGTTAACT Statistics Matches: 48, Mismatches: 5, Indels: 2 0.87 0.09 0.04 Matches are distributed among these distances: 47 1 0.02 48 47 0.98 ACGTcount: A:0.33, C:0.38, G:0.20, T:0.09 Consensus pattern (48 bp): ACCCGAAACCCAGATGACCCGAGACCCGAATGACCCGAAACCCAGATA Found at i:27620 original size:11 final size:11 Alignment explanation

Indices: 27595--27641 Score: 60 Period size: 11 Copynumber: 4.3 Consensus size: 11 27585 ATTTATTCAT 27595 TATTAATTA-A 1 TATTAATTAGA * 27605 CTATTAGTTAGA 1 -TATTAATTAGA * 27617 TATTAATTAGC 1 TATTAATTAGA 27628 TATTAATTAGA 1 TATTAATTAGA 27639 TAT 1 TAT 27642 AGTATAATGA Statistics Matches: 31, Mismatches: 4, Indels: 2 0.84 0.11 0.05 Matches are distributed among these distances: 11 30 0.97 12 1 0.03 ACGTcount: A:0.40, C:0.04, G:0.09, T:0.47 Consensus pattern (11 bp): TATTAATTAGA Found at i:27621 original size:22 final size:22 Alignment explanation

Indices: 27595--27641 Score: 76 Period size: 22 Copynumber: 2.1 Consensus size: 22 27585 ATTTATTCAT * 27595 TATTAATTAACTATTAGTTAGA 1 TATTAATTAACTATTAATTAGA * 27617 TATTAATTAGCTATTAATTAGA 1 TATTAATTAACTATTAATTAGA 27639 TAT 1 TAT 27642 AGTATAATGA Statistics Matches: 23, Mismatches: 2, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 22 23 1.00 ACGTcount: A:0.40, C:0.04, G:0.09, T:0.47 Consensus pattern (22 bp): TATTAATTAACTATTAATTAGA Found at i:28612 original size:12 final size:12 Alignment explanation

Indices: 28595--28621 Score: 54 Period size: 12 Copynumber: 2.2 Consensus size: 12 28585 TATGGATTAT 28595 TTATTTATTTAC 1 TTATTTATTTAC 28607 TTATTTATTTAC 1 TTATTTATTTAC 28619 TTA 1 TTA 28622 AGTTTAATTT Statistics Matches: 15, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 12 15 1.00 ACGTcount: A:0.26, C:0.07, G:0.00, T:0.67 Consensus pattern (12 bp): TTATTTATTTAC Found at i:32490 original size:14 final size:16 Alignment explanation

Indices: 32471--32508 Score: 53 Period size: 16 Copynumber: 2.5 Consensus size: 16 32461 TCGATCAAAT 32471 GTCGGGTC-ATT-TGG 1 GTCGGGTCAATTCTGG 32485 GTCGGGTCAATTCTGG 1 GTCGGGTCAATTCTGG * 32501 GTTGGGTC 1 GTCGGGTC 32509 GTTTTCTGTT Statistics Matches: 21, Mismatches: 1, Indels: 2 0.88 0.04 0.08 Matches are distributed among these distances: 14 8 0.38 15 3 0.14 16 10 0.48 ACGTcount: A:0.08, C:0.16, G:0.42, T:0.34 Consensus pattern (16 bp): GTCGGGTCAATTCTGG Found at i:32524 original size:17 final size:17 Alignment explanation

Indices: 32504--32542 Score: 69 Period size: 17 Copynumber: 2.3 Consensus size: 17 32494 ATTCTGGGTT * 32504 GGGTCGTTTTCTGTTTC 1 GGGTCGTTTTCGGTTTC 32521 GGGTCGTTTTCGGTTTC 1 GGGTCGTTTTCGGTTTC 32538 GGGTC 1 GGGTC 32543 ATACGGTTTG Statistics Matches: 21, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 17 21 1.00 ACGTcount: A:0.00, C:0.18, G:0.36, T:0.46 Consensus pattern (17 bp): GGGTCGTTTTCGGTTTC Found at i:33538 original size:30 final size:28 Alignment explanation

Indices: 33478--33539 Score: 88 Period size: 28 Copynumber: 2.1 Consensus size: 28 33468 AATAGCACAA * 33478 TATAATCCATTTTATTTATATTATAACT 1 TATAATCCATTTTATTTATATTAAAACT * 33506 TATAATCCATTTTATTTACCTCTTAAAACT 1 TATAATCCATTTTATTTA--TATTAAAACT 33536 TATA 1 TATA 33540 TAACTTATAA Statistics Matches: 30, Mismatches: 2, Indels: 2 0.88 0.06 0.06 Matches are distributed among these distances: 28 18 0.60 30 12 0.40 ACGTcount: A:0.35, C:0.15, G:0.00, T:0.50 Consensus pattern (28 bp): TATAATCCATTTTATTTATATTAAAACT Found at i:33950 original size:30 final size:30 Alignment explanation

Indices: 33914--33986 Score: 146 Period size: 30 Copynumber: 2.4 Consensus size: 30 33904 TTTGACTCAT 33914 TTCGGGTTCGGGTTGTTTGGATTCGGGTAA 1 TTCGGGTTCGGGTTGTTTGGATTCGGGTAA 33944 TTCGGGTTCGGGTTGTTTGGATTCGGGTAA 1 TTCGGGTTCGGGTTGTTTGGATTCGGGTAA 33974 TTCGGGTTCGGGT 1 TTCGGGTTCGGGT 33987 ACCCAAAAAT Statistics Matches: 43, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 30 43 1.00 ACGTcount: A:0.08, C:0.11, G:0.41, T:0.40 Consensus pattern (30 bp): TTCGGGTTCGGGTTGTTTGGATTCGGGTAA Found at i:33983 original size:15 final size:15 Alignment explanation

Indices: 33914--33987 Score: 76 Period size: 15 Copynumber: 4.9 Consensus size: 15 33904 TTTGACTCAT ** 33914 TTCGGGTTCGGGTTG 1 TTCGGGTTCGGGTAA * * 33929 TTTGGATTCGGGTAA 1 TTCGGGTTCGGGTAA ** 33944 TTCGGGTTCGGGTTG 1 TTCGGGTTCGGGTAA * * 33959 TTTGGATTCGGGTAA 1 TTCGGGTTCGGGTAA 33974 TTCGGGTTCGGGTA 1 TTCGGGTTCGGGTA 33988 CCCAAAAATT Statistics Matches: 45, Mismatches: 14, Indels: 0 0.76 0.24 0.00 Matches are distributed among these distances: 15 45 1.00 ACGTcount: A:0.09, C:0.11, G:0.41, T:0.39 Consensus pattern (15 bp): TTCGGGTTCGGGTAA Found at i:50372 original size:24 final size:25 Alignment explanation

Indices: 50340--50403 Score: 105 Period size: 24 Copynumber: 2.6 Consensus size: 25 50330 AAAGTCAATT * 50340 AAAATAACTTAATTATTTTCACC-A 1 AAAAAAACTTAATTATTTTCACCAA 50364 AAAAAAACTTAATTATTTTCACCAA 1 AAAAAAACTTAATTATTTTCACCAA 50389 AAAAAAAC-TAATTAT 1 AAAAAAACTTAATTAT 50404 GATTGAAATA Statistics Matches: 38, Mismatches: 1, Indels: 2 0.93 0.02 0.05 Matches are distributed among these distances: 24 29 0.76 25 9 0.24 ACGTcount: A:0.53, C:0.14, G:0.00, T:0.33 Consensus pattern (25 bp): AAAAAAACTTAATTATTTTCACCAA Found at i:62221 original size:22 final size:22 Alignment explanation

Indices: 62179--62222 Score: 70 Period size: 22 Copynumber: 2.0 Consensus size: 22 62169 TATTCATATG * * 62179 AAATTATGATAATTTCTCTATT 1 AAATTATGATAATTACACTATT 62201 AAATTATGATAATTACACTATT 1 AAATTATGATAATTACACTATT 62223 TTTTATGATC Statistics Matches: 20, Mismatches: 2, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 22 20 1.00 ACGTcount: A:0.41, C:0.09, G:0.05, T:0.45 Consensus pattern (22 bp): AAATTATGATAATTACACTATT Found at i:62262 original size:22 final size:22 Alignment explanation

Indices: 62237--62975 Score: 226 Period size: 22 Copynumber: 33.8 Consensus size: 22 62227 ATGATCCCAT 62237 TATGAAATTTTGATAACCTTCC 1 TATGAAATTTTGATAACCTTCC * *** * 62259 TATGAAATTTTAATAATGATAC 1 TATGAAATTTTGATAACCTTCC * * ** 62281 TATG-AATTTCGAGAACCTTTT 1 TATGAAATTTTGATAACCTTCC * ** * * 62302 TAT-AATTTTTTTTAACGTTCT 1 TATGAAATTTTGATAACCTTCC * * 62323 TATGAAATTTTGTTAACCTCCC 1 TATGAAATTTTGATAACCTTCC * * * 62345 TAAGGAATTTTGA-AGACC-TCAA 1 TATGAAATTTTGATA-ACCTTC-C 62367 TATGAAATTTTGATAA-CTTCGC 1 TATGAAATTTTGATAACCTTC-C * ** 62389 AATGAAATTTTGATAACCAACAC 1 TATGAAATTTTGATAACCTTC-C * * * 62412 AATGAGA-TGTGATAACC-TCC 1 TATGAAATTTTGATAACCTTCC * * * * 62432 ATATGATATATTGATAACC-ACGT 1 -TATGAAATTTTGATAACCTTC-C * * * 62455 TATGAAAATTTAAAAACC-TCC 1 TATGAAATTTTGATAACCTTCC * * 62476 ATATG-AATTGTT-AGTAATC-ACAC 1 -TATGAAATT-TTGA-TAACCTTC-C * * * 62499 TCTGAAATTTTGATAATC-ACAC 1 TATGAAATTTTGATAACCTTC-C * 62521 TATGAAATTGTGATAACC-TCGC 1 TATGAAATTTTGATAACCTTC-C * * 62543 TATGAAATTTTGTTAAATCTTCC 1 TATGAAATTTTGAT-AACCTTCC * ** ** 62566 TATAAAATTTTGATAATTTTTT 1 TATGAAATTTTGATAACCTTCC * * 62588 TATGAAATCTTGATAA-C-TAC 1 TATGAAATTTTGATAACCTTCC 62608 -A-G--ATTTTGATAACC-TCC 1 TATGAAATTTTGATAACCTTCC * * * 62625 ATATGAATTTTTAATAACC-TCAT 1 -TATGAAATTTTGATAACCTTC-C * * 62648 TATGAAATTTTGTTAACCTCCC 1 TATGAAATTTTGATAACCTTCC * * * 62670 TATGAAATTTTGATCTA-CATAC 1 TATGAAATTTTGAT-AACCTTCC * 62692 TATGAAATTTTGATAA-CTCTCT 1 TATGAAATTTTGATAACCT-TCC ** * * ** 62714 TATGACGTTTT-AAAAACTAAAC 1 TATGAAATTTTGATAACCT-TCC * 62736 TATGAAAATTTTGATAACCTTCA 1 TATG-AAATTTTGATAACCTTCC * * * * 62759 TAGGATATTTTGATATCCTCCC 1 TATGAAATTTTGATAACCTTCC * ** ** 62781 TATGTAATTTCAATAACCAACC 1 TATGAAATTTTGATAACCTTCC * * * 62803 TAAGAAATTTTAATAACTTGATCC 1 TATGAAATTTTGATAACCT--TCC * * ** 62827 TATGAAATTTTGGTAACCATAT 1 TATGAAATTTTGATAACCTTCC * * 62849 TATGAAA-TTTGGTAACC-ACAC 1 TATGAAATTTTGATAACCTTC-C * * 62870 TATGGAATTTTGATAA-CTTCAA 1 TATGAAATTTTGATAACCTTC-C * 62892 TATGAAATTTT-AGTAACC-ACAC 1 TATGAAATTTTGA-TAACCTTC-C 62914 TATGAAATTTTGATAACC-TCC 1 TATGAAATTTTGATAACCTTCC * * * * * 62935 TCATGAAATTATAATAAACATCT 1 T-ATGAAATTTTGATAACCTTCC 62958 TATGAAATTTTGATAACC 1 TATGAAATTTTGATAACC 62976 ACATAGAGAA Statistics Matches: 522, Mismatches: 154, Indels: 82 0.69 0.20 0.11 Matches are distributed among these distances: 16 9 0.02 17 3 0.01 18 1 0.00 19 2 0.00 20 3 0.01 21 63 0.12 22 371 0.71 23 47 0.09 24 23 0.04 ACGTcount: A:0.37, C:0.15, G:0.10, T:0.38 Consensus pattern (22 bp): TATGAAATTTTGATAACCTTCC Found at i:67983 original size:3 final size:3 Alignment explanation

Indices: 67975--68006 Score: 64 Period size: 3 Copynumber: 10.7 Consensus size: 3 67965 TTAAAACGGT 67975 TTA TTA TTA TTA TTA TTA TTA TTA TTA TTA TT 1 TTA TTA TTA TTA TTA TTA TTA TTA TTA TTA TT 68007 TTGAAATAGG Statistics Matches: 29, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 3 29 1.00 ACGTcount: A:0.31, C:0.00, G:0.00, T:0.69 Consensus pattern (3 bp): TTA Found at i:92526 original size:17 final size:17 Alignment explanation

Indices: 92485--92534 Score: 57 Period size: 17 Copynumber: 2.9 Consensus size: 17 92475 CCATGTAATC * 92485 TTTGATCACCGGTGA-T 1 TTTGATCACTGGTGATT * 92501 CTTGCATCACTGGTGATT 1 TTTG-ATCACTGGTGATT * 92519 TTTGATCACTAGTGAT 1 TTTGATCACTGGTGAT 92535 CTGGGGGTAT Statistics Matches: 28, Mismatches: 4, Indels: 3 0.80 0.11 0.09 Matches are distributed among these distances: 16 3 0.11 17 21 0.75 18 4 0.14 ACGTcount: A:0.20, C:0.18, G:0.22, T:0.40 Consensus pattern (17 bp): TTTGATCACTGGTGATT Found at i:94058 original size:12 final size:12 Alignment explanation

Indices: 94043--94067 Score: 50 Period size: 12 Copynumber: 2.1 Consensus size: 12 94033 TGAGACACAT 94043 TTAACAAATCAA 1 TTAACAAATCAA 94055 TTAACAAATCAA 1 TTAACAAATCAA 94067 T 1 T 94068 GAGGTGCTAC Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 12 13 1.00 ACGTcount: A:0.56, C:0.16, G:0.00, T:0.28 Consensus pattern (12 bp): TTAACAAATCAA Done.