Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01007497.1 Corchorus capsularis cultivar CVL-1 contig07518, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 85510
ACGTcount: A:0.32, C:0.17, G:0.18, T:0.33


Found at i:1181 original size:24 final size:27

Alignment explanation

Indices: 1154--1206 Score: 76 Period size: 26 Copynumber: 2.1 Consensus size: 27 1144 TTCTAGTGAT * 1154 GAGAAT-AAG-GAAGGAAAA-GAGGAG 1 GAGAATCAAGAGAAGGAAAATAAGGAG 1178 GAGAATCAAGAGAAGGAAAATAAGGAG 1 GAGAATCAAGAGAAGGAAAATAAGGAG 1205 GA 1 GA 1207 ATTTCATCCT Statistics Matches: 25, Mismatches: 1, Indels: 3 0.86 0.03 0.10 Matches are distributed among these distances: 24 6 0.24 25 3 0.12 26 9 0.36 27 7 0.28 ACGTcount: A:0.55, C:0.02, G:0.38, T:0.06 Consensus pattern (27 bp): GAGAATCAAGAGAAGGAAAATAAGGAG Found at i:1781 original size:14 final size:14 Alignment explanation

Indices: 1762--1788 Score: 54 Period size: 14 Copynumber: 1.9 Consensus size: 14 1752 TTTTCCCCAA 1762 ATTTTTTGAAAAAG 1 ATTTTTTGAAAAAG 1776 ATTTTTTGAAAAA 1 ATTTTTTGAAAAA 1789 ATTGATTTTT Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 14 13 1.00 ACGTcount: A:0.44, C:0.00, G:0.11, T:0.44 Consensus pattern (14 bp): ATTTTTTGAAAAAG Found at i:2990 original size:19 final size:18 Alignment explanation

Indices: 2966--3001 Score: 54 Period size: 19 Copynumber: 1.9 Consensus size: 18 2956 TGAAGATTTC 2966 TTGAAGATAATTTGAAGAT 1 TTGAAGATAA-TTGAAGAT * 2985 TTGAAGATCATTGAAGA 1 TTGAAGATAATTGAAGA 3002 ATTATTTCAA Statistics Matches: 16, Mismatches: 1, Indels: 1 0.89 0.06 0.06 Matches are distributed among these distances: 18 7 0.44 19 9 0.56 ACGTcount: A:0.42, C:0.03, G:0.22, T:0.33 Consensus pattern (18 bp): TTGAAGATAATTGAAGAT Found at i:11094 original size:41 final size:41 Alignment explanation

Indices: 11032--11119 Score: 122 Period size: 41 Copynumber: 2.1 Consensus size: 41 11022 CAATTTATTG * * * 11032 AACTGTTGAAGATAAAGAACTTCATACAACACAAGCTAAAA 1 AACTATTGAAGATAAAGAACTTCACACAACACAAGCCAAAA * * * 11073 CACTATTGAAGATGAAGAACTTCACACTACACAAGCCAAAA 1 AACTATTGAAGATAAAGAACTTCACACAACACAAGCCAAAA 11114 AACTAT 1 AACTAT 11120 GTCCAAGCAT Statistics Matches: 40, Mismatches: 7, Indels: 0 0.85 0.15 0.00 Matches are distributed among these distances: 41 40 1.00 ACGTcount: A:0.49, C:0.20, G:0.11, T:0.19 Consensus pattern (41 bp): AACTATTGAAGATAAAGAACTTCACACAACACAAGCCAAAA Found at i:20032 original size:32 final size:31 Alignment explanation

Indices: 19985--20069 Score: 107 Period size: 32 Copynumber: 2.7 Consensus size: 31 19975 ATCTGGCTAA * * 19985 AACCCAAACTGAACCCGAACCTGAATTAACCT 1 AACCCAAATTCAACCCGAACC-GAATTAACCT * * * 20017 GACCCAAATTTAACCCGAATCCGAATTGACCT 1 AACCCAAATTCAACCCGAA-CCGAATTAACCT 20049 AACCCAAATTCAACCCGAACC 1 AACCCAAATTCAACCCGAACC 20070 CAACTTAAAC Statistics Matches: 46, Mismatches: 6, Indels: 3 0.84 0.11 0.05 Matches are distributed among these distances: 31 2 0.04 32 42 0.91 33 2 0.04 ACGTcount: A:0.39, C:0.35, G:0.09, T:0.16 Consensus pattern (31 bp): AACCCAAATTCAACCCGAACCGAATTAACCT Found at i:20071 original size:17 final size:16 Alignment explanation

Indices: 19985--20104 Score: 82 Period size: 17 Copynumber: 7.3 Consensus size: 16 19975 ATCTGGCTAA * 19985 AACCCAAACTGAACCCG 1 AACCCAAA-TTAACCCG ** * 20002 AACCTGAATTAACCTG 1 AACCCAAATTAACCCG 20018 -ACCCAAATTTAACCCG 1 AACCCAAA-TTAACCCG * * * * 20034 AATCCGAATTGA-CCT 1 AACCCAAATTAACCCG 20049 AACCCAAATTCAACCCG 1 AACCCAAATT-AACCCG * 20066 AACCCAACTTAAACCCG 1 AACCCAAATT-AACCCG * * 20083 AACCCGAAAATAATCCG 1 AACCC-AAATTAACCCG 20100 AACCC 1 AACCC 20105 GAACCCAACC Statistics Matches: 78, Mismatches: 20, Indels: 10 0.72 0.19 0.09 Matches are distributed among these distances: 15 15 0.19 16 17 0.22 17 43 0.55 18 3 0.04 ACGTcount: A:0.40, C:0.36, G:0.09, T:0.15 Consensus pattern (16 bp): AACCCAAATTAACCCG Found at i:27099 original size:37 final size:38 Alignment explanation

Indices: 27021--27099 Score: 115 Period size: 38 Copynumber: 2.1 Consensus size: 38 27011 ATAATTACCC * ** 27021 ATTTAATTTTGCCTTTTGTCTTTGTTTCCAATCGTTGT 1 ATTTAATTTTGCCTTTTGTCTTTGTCTCCAATCGTCCT * 27059 ATTTAATTTTGCTTTTTGTCTTTGTCTCCAA-CGTCCT 1 ATTTAATTTTGCCTTTTGTCTTTGTCTCCAATCGTCCT 27096 ATTT 1 ATTT 27100 GGGCTTAGAT Statistics Matches: 37, Mismatches: 4, Indels: 1 0.88 0.10 0.02 Matches are distributed among these distances: 37 8 0.22 38 29 0.78 ACGTcount: A:0.14, C:0.18, G:0.11, T:0.57 Consensus pattern (38 bp): ATTTAATTTTGCCTTTTGTCTTTGTCTCCAATCGTCCT Found at i:28354 original size:22 final size:22 Alignment explanation

Indices: 28329--28431 Score: 61 Period size: 22 Copynumber: 4.7 Consensus size: 22 28319 TTCCTTAGAG 28329 AGGTTAATAAAATTTCATAAGA 1 AGGTTAATAAAATTTCATAAGA * * * 28351 AGGTTAAAAAAAATT-ATAAAA 1 AGGTTAATAAAATTTCATAAGA * * * 28372 AGATT-TTCGAAATTTCAT-AGTA 1 AGGTTAAT-AAAATTTCATAAG-A ** * * 28394 TCGTTATTAAAATTTCATAGGA 1 AGGTTAATAAAATTTCATAAGA 28416 AGGTT-ATCAAAATTTC 1 AGGTTAAT-AAAATTTC 28432 GTAATGGGAT Statistics Matches: 58, Mismatches: 17, Indels: 12 0.67 0.20 0.14 Matches are distributed among these distances: 21 16 0.28 22 39 0.67 23 3 0.05 ACGTcount: A:0.46, C:0.07, G:0.13, T:0.35 Consensus pattern (22 bp): AGGTTAATAAAATTTCATAAGA Found at i:28561 original size:2 final size:2 Alignment explanation

Indices: 28554--28579 Score: 52 Period size: 2 Copynumber: 13.0 Consensus size: 2 28544 ACTAAAACTA 28554 AT AT AT AT AT AT AT AT AT AT AT AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT 28580 TAATATGTAA Statistics Matches: 24, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 24 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): AT Found at i:37308 original size:21 final size:21 Alignment explanation

Indices: 37279--37318 Score: 62 Period size: 21 Copynumber: 1.9 Consensus size: 21 37269 ATAAAATAAC * 37279 ATATTATAAATATTTTTTAGA 1 ATATAATAAATATTTTTTAGA * 37300 ATATAATAATTATTTTTTA 1 ATATAATAAATATTTTTTA 37319 TATAAGGGTA Statistics Matches: 17, Mismatches: 2, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 21 17 1.00 ACGTcount: A:0.42, C:0.00, G:0.03, T:0.55 Consensus pattern (21 bp): ATATAATAAATATTTTTTAGA Found at i:37876 original size:32 final size:31 Alignment explanation

Indices: 37810--37878 Score: 93 Period size: 32 Copynumber: 2.2 Consensus size: 31 37800 GTAGTACTCA ** 37810 GCTCGACTCTAAAATAAAGTCTCGAAAGTCG 1 GCTCGACTCTAAAATAAAAACTCGAAAGTCG * * 37841 GCTCGACTCTAAAATAGAAAACTCGAAGGTTG 1 GCTCGACTCTAAAATA-AAAACTCGAAAGTCG 37873 GCTCGA 1 GCTCGA 37879 GTTCCACTGA Statistics Matches: 33, Mismatches: 4, Indels: 1 0.87 0.11 0.03 Matches are distributed among these distances: 31 16 0.48 32 17 0.52 ACGTcount: A:0.35, C:0.22, G:0.22, T:0.22 Consensus pattern (31 bp): GCTCGACTCTAAAATAAAAACTCGAAAGTCG Found at i:38143 original size:129 final size:129 Alignment explanation

Indices: 37908--38166 Score: 338 Period size: 129 Copynumber: 2.0 Consensus size: 129 37898 TGGAGGCTTG * * ** 37908 AGCTTGAGCTTAAACAAACTGTTCGATTAAGCTAGTCTTTCGATCTTTTGAAAAGCAGACGAACA 1 AGCTCGAGCTTAAACAAACTGTTCGATTAAGCTAGTCTTTCGATATCCTGAAAAGCAGACGAACA * *** * * * 37973 CATGTCTTCATTCCATATAACCAAACTTAAGTTAACTTTAATTTACCCTAAAATTCGTTATGCT 66 AAAACCTTCATTCCAGATAACCAAACTTAAGTTAACTTTAATTTACCATAAAACTCGTTATGCT * * * * * 38037 AGCTCGAGCTTAAACGAACTGTTTGATTAAGCTTGTCTTTCGATATCCTGAAGAGCATACGAACA 1 AGCTCGAGCTTAAACAAACTGTTCGATTAAGCTAGTCTTTCGATATCCTGAAAAGCAGACGAACA * * * * 38102 AAAACCTTCATTCTAGATAACCAAGCTTAAGTTAACTTTAATTTGCCATAGAACTCGTTATGCT 66 AAAACCTTCATTCCAGATAACCAAACTTAAGTTAACTTTAATTTACCATAAAACTCGTTATGCT 38166 A 1 A 38167 AAAACAAAAA Statistics Matches: 110, Mismatches: 20, Indels: 0 0.85 0.15 0.00 Matches are distributed among these distances: 129 110 1.00 ACGTcount: A:0.33, C:0.20, G:0.14, T:0.33 Consensus pattern (129 bp): AGCTCGAGCTTAAACAAACTGTTCGATTAAGCTAGTCTTTCGATATCCTGAAAAGCAGACGAACA AAAACCTTCATTCCAGATAACCAAACTTAAGTTAACTTTAATTTACCATAAAACTCGTTATGCT Found at i:45764 original size:132 final size:132 Alignment explanation

Indices: 45544--45807 Score: 415 Period size: 131 Copynumber: 2.0 Consensus size: 132 45534 AAGAATTATT * * * 45544 TTTAAAAATTCTAATATATGTAAGTTTTTTAATTAAATTAGTAAAATGGTAAAAATAAAATAAAA 1 TTTAAAAATTATAATATATATAAGTTTTTTAATTAAAATAGTAAAATGGTAAAAAT---ATAAAA * * 45609 TAGGTATAAGGATATTAGATTTAATTAAATAAAAAATAGAGTTTTTAGTTGAGTAAAACTATAAA 63 TAGGTATAAGGATATTAGATTTAATTAAAT-AAAAATAGAGTTTTTAATTGACTAAAACTATAAA 45674 AGTATA 127 AGTATA 45680 TTTAAAAATTATAATATATATAAGTTTTTTTAATTAAAATAGTAAAATGGTAAAAAT-T-AAATA 1 TTTAAAAATTATAATATATATAAG-TTTTTTAATTAAAATAGTAAAATGGTAAAAATATAAAATA * 45743 GTTATAAGGATATTAGATTTAATTAAATAAAAATAGAGTTTTTAATTGACTAAAACTATAAAAGT 65 GGTATAAGGATATTAGATTTAATTAAATAAAAATAGAGTTTTTAATTGACTAAAACTATAAAAGT 45808 TTAAACAATG Statistics Matches: 121, Mismatches: 6, Indels: 7 0.90 0.04 0.05 Matches are distributed among these distances: 131 35 0.29 132 32 0.26 133 1 0.01 136 22 0.18 137 31 0.26 ACGTcount: A:0.50, C:0.02, G:0.11, T:0.38 Consensus pattern (132 bp): TTTAAAAATTATAATATATATAAGTTTTTTAATTAAAATAGTAAAATGGTAAAAATATAAAATAG GTATAAGGATATTAGATTTAATTAAATAAAAATAGAGTTTTTAATTGACTAAAACTATAAAAGTA TA Found at i:55648 original size:20 final size:20 Alignment explanation

Indices: 55625--55706 Score: 55 Period size: 20 Copynumber: 4.2 Consensus size: 20 55615 ATTGCTGTGA 55625 TTGATTGCCTATTCCTTGAT 1 TTGATTGCCTATTCCTTGAT * * * 55645 TTGAAT---TATTGCATTGCT 1 TTGATTGCCTATT-CCTTGAT ** ** 55663 TTGATTGATTATTGTTGTGA- 1 TTGATTGCCTATTCCT-TGAT 55683 TTGATTGCCTATTCCTTGAT 1 TTGATTGCCTATTCCTTGAT 55703 TTGA 1 TTGA 55707 ATTATTGCAT Statistics Matches: 45, Mismatches: 11, Indels: 12 0.66 0.16 0.18 Matches are distributed among these distances: 17 4 0.09 18 10 0.22 19 3 0.07 20 22 0.49 21 6 0.13 ACGTcount: A:0.18, C:0.12, G:0.18, T:0.51 Consensus pattern (20 bp): TTGATTGCCTATTCCTTGAT Found at i:55688 original size:58 final size:58 Alignment explanation

Indices: 55598--55719 Score: 226 Period size: 58 Copynumber: 2.1 Consensus size: 58 55588 CTTTTTCTTA * 55598 CATTGCTTTTATTGATTATTGCTGTGATTGATTGCCTATTCCTTGATTTGAATTATTG 1 CATTGCTTTGATTGATTATTGCTGTGATTGATTGCCTATTCCTTGATTTGAATTATTG * 55656 CATTGCTTTGATTGATTATTGTTGTGATTGATTGCCTATTCCTTGATTTGAATTATTG 1 CATTGCTTTGATTGATTATTGCTGTGATTGATTGCCTATTCCTTGATTTGAATTATTG 55714 CATTGC 1 CATTGC 55720 ATTTTTCATA Statistics Matches: 62, Mismatches: 2, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 58 62 1.00 ACGTcount: A:0.19, C:0.12, G:0.18, T:0.51 Consensus pattern (58 bp): CATTGCTTTGATTGATTATTGCTGTGATTGATTGCCTATTCCTTGATTTGAATTATTG Found at i:60895 original size:28 final size:28 Alignment explanation

Indices: 60855--60911 Score: 105 Period size: 28 Copynumber: 2.0 Consensus size: 28 60845 TTCGTATCCC * 60855 TTTGGAGATTTCAATCAATTATTTATGG 1 TTTGGAGATTTCAATCAATTAATTATGG 60883 TTTGGAGATTTCAATCAATTAATTATGG 1 TTTGGAGATTTCAATCAATTAATTATGG 60911 T 1 T 60912 GAGATTTGTG Statistics Matches: 28, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 28 28 1.00 ACGTcount: A:0.30, C:0.07, G:0.18, T:0.46 Consensus pattern (28 bp): TTTGGAGATTTCAATCAATTAATTATGG Found at i:62702 original size:21 final size:19 Alignment explanation

Indices: 62676--62714 Score: 51 Period size: 19 Copynumber: 1.9 Consensus size: 19 62666 ACCACATCTA * 62676 TTCCAAACACTCTCCAAACTT 1 TTCC-AACAC-CTCAAAACTT 62697 TTCCAACACCTCAAAACT 1 TTCCAACACCTCAAAACT 62715 CAAACACTAC Statistics Matches: 17, Mismatches: 1, Indels: 2 0.85 0.05 0.10 Matches are distributed among these distances: 19 8 0.47 20 5 0.29 21 4 0.24 ACGTcount: A:0.36, C:0.38, G:0.00, T:0.26 Consensus pattern (19 bp): TTCCAACACCTCAAAACTT Found at i:63526 original size:14 final size:14 Alignment explanation

Indices: 63509--63535 Score: 54 Period size: 14 Copynumber: 1.9 Consensus size: 14 63499 TGGGGTGGGG 63509 GGGGGGGGGGGGAA 1 GGGGGGGGGGGGAA 63523 GGGGGGGGGGGGA 1 GGGGGGGGGGGGA 63536 GATGAGTCTA Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 14 13 1.00 ACGTcount: A:0.11, C:0.00, G:0.89, T:0.00 Consensus pattern (14 bp): GGGGGGGGGGGGAA Found at i:69954 original size:26 final size:27 Alignment explanation

Indices: 69916--69974 Score: 66 Period size: 26 Copynumber: 2.2 Consensus size: 27 69906 CAGGACGTCA * 69916 CCCTCTGGCATGGGAGACGG-AAATTT 1 CCCTTTGGCATGGGAGACGGAAAATTT * * * 69942 CCCTTTGGTATGGGTGATGGAAAATTT 1 CCCTTTGGCATGGGAGACGGAAAATTT 69969 CTCCTT 1 C-CCTT 69975 CTGCCTTGTC Statistics Matches: 27, Mismatches: 4, Indels: 2 0.82 0.12 0.06 Matches are distributed among these distances: 26 16 0.59 27 7 0.26 28 4 0.15 ACGTcount: A:0.20, C:0.20, G:0.27, T:0.32 Consensus pattern (27 bp): CCCTTTGGCATGGGAGACGGAAAATTT Found at i:75606 original size:111 final size:109 Alignment explanation

Indices: 75472--75684 Score: 318 Period size: 111 Copynumber: 1.9 Consensus size: 109 75462 GTTGATGGCA * * 75472 CATTTTTTTTGTTGATAACTAATTTGGTATATCCTTTAAACAACATTCTGCTATCATGTATGTAA 1 CATTCTTTTTGTTGATAACTAATTTGGTATATCCTTTAAACAACATTCTGCTATCACG--TGTAA * * 75537 AGCAGTGTAAAAGGTTACCGATTTAGTACTTAGCTTAATCTTTAGC 64 AGCAGTGTAAAAGGTTACCGACTTAATACTTAGCTTAATCTTTAGC * * 75583 CATTCTTTTTGTTGATAACTAATTTGGTATATCCTTTAAACTATATTCTGCTATCACGTGTAAAG 1 CATTCTTTTTGTTGATAACTAATTTGGTATATCCTTTAAACAACATTCTGCTATCACGTGTAAAG * * * * 75648 TAGTGTAAAGGGTTACCTACTTAATACTTAGGTTAAT 66 CAGTGTAAAAGGTTACCGACTTAATACTTAGCTTAAT 75685 GTTTTTAAGC Statistics Matches: 92, Mismatches: 10, Indels: 2 0.88 0.10 0.02 Matches are distributed among these distances: 109 38 0.41 111 54 0.59 ACGTcount: A:0.30, C:0.14, G:0.15, T:0.42 Consensus pattern (109 bp): CATTCTTTTTGTTGATAACTAATTTGGTATATCCTTTAAACAACATTCTGCTATCACGTGTAAAG CAGTGTAAAAGGTTACCGACTTAATACTTAGCTTAATCTTTAGC Found at i:79889 original size:21 final size:20 Alignment explanation

Indices: 79863--79902 Score: 53 Period size: 20 Copynumber: 1.9 Consensus size: 20 79853 GATTTTCTTT * 79863 CTCTTTCCTCTTCCCGAAAAA 1 CTCTTT-CTCTTCCCCAAAAA * 79884 CTCTTTTTCTTCCCCAAAA 1 CTCTTTCTCTTCCCCAAAA 79903 TTCTTCTTTT Statistics Matches: 17, Mismatches: 2, Indels: 1 0.85 0.10 0.05 Matches are distributed among these distances: 20 11 0.65 21 6 0.35 ACGTcount: A:0.23, C:0.38, G:0.03, T:0.38 Consensus pattern (20 bp): CTCTTTCTCTTCCCCAAAAA Found at i:84921 original size:33 final size:32 Alignment explanation

Indices: 84870--84974 Score: 97 Period size: 33 Copynumber: 3.2 Consensus size: 32 84860 TTGCAAAGAG * 84870 TGTTTT-AGATGTTGTTTGCGATGATACTAATCC 1 TGTTTTAAG-TGTTGTTTGCGATGAAACTAAT-C * * * 84903 T-TATTTGAGTGTTGTTTGCAATGACACTAAATC 1 TGT-TTTAAGTGTTGTTTGCGATGAAACT-AATC * * 84936 TGTTTTAAGTGTTGTTTGTGATGAAACTAAATT 1 TGTTTTAAGTGTTGTTTGCGATGAAACT-AATC 84969 TGTTTT 1 TGTTTT 84975 GGATGCTAAT Statistics Matches: 61, Mismatches: 7, Indels: 8 0.80 0.09 0.11 Matches are distributed among these distances: 32 1 0.02 33 54 0.89 34 6 0.10 ACGTcount: A:0.24, C:0.09, G:0.20, T:0.48 Consensus pattern (32 bp): TGTTTTAAGTGTTGTTTGCGATGAAACTAATC Found at i:85041 original size:33 final size:33 Alignment explanation

Indices: 85004--85108 Score: 174 Period size: 33 Copynumber: 3.2 Consensus size: 33 84994 AACAAATCTA * 85004 TTTTGATTGATCATAGCATTGCAAATAATTCTG 1 TTTTGGTTGATCATAGCATTGCAAATAATTCTG * 85037 TTTTGGTTGATCATAGCATTGCAAATAATTCTA 1 TTTTGGTTGATCATAGCATTGCAAATAATTCTG * * 85070 TTTTGGTTGATCATAACATTGAAAATAATTCTG 1 TTTTGGTTGATCATAGCATTGCAAATAATTCTG 85103 TTTTGG 1 TTTTGG 85109 GTGAAAAGAA Statistics Matches: 67, Mismatches: 5, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 33 67 1.00 ACGTcount: A:0.30, C:0.10, G:0.16, T:0.44 Consensus pattern (33 bp): TTTTGGTTGATCATAGCATTGCAAATAATTCTG Done.