Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01016013.1 Corchorus olitorius cultivar O-4 contig16046, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 26427
ACGTcount: A:0.32, C:0.17, G:0.19, T:0.31


Found at i:551 original size:12 final size:12

Alignment explanation

Indices: 534--605 Score: 101 Period size: 12 Copynumber: 5.9 Consensus size: 12 524 CATCGATACC 534 TCGATATAT-TGG 1 TCGATATATCT-G 546 TCGATATATCTG 1 TCGATATATCTG * 558 TCGATATATCGG 1 TCGATATATCTG 570 TCGATATATCTG 1 TCGATATATCTG * 582 TCGATATATCTAT 1 TCGATATATCT-G 595 TCGATATATCT 1 TCGATATATCT 606 ATAGATGCTT Statistics Matches: 55, Mismatches: 3, Indels: 3 0.90 0.05 0.05 Matches are distributed among these distances: 12 43 0.78 13 12 0.22 ACGTcount: A:0.26, C:0.15, G:0.17, T:0.42 Consensus pattern (12 bp): TCGATATATCTG Found at i:688 original size:35 final size:35 Alignment explanation

Indices: 641--712 Score: 126 Period size: 35 Copynumber: 2.1 Consensus size: 35 631 CTTCAAGGTG 641 ACAACGCTCCGATATTAGGGATCAATCACGTGACA 1 ACAACGCTCCGATATTAGGGATCAATCACGTGACA * * 676 ACAATGCTCCGATATTAGGGATCGATCACGTGACA 1 ACAACGCTCCGATATTAGGGATCAATCACGTGACA 711 AC 1 AC 713 GCTCTTGTTA Statistics Matches: 35, Mismatches: 2, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 35 35 1.00 ACGTcount: A:0.33, C:0.25, G:0.21, T:0.21 Consensus pattern (35 bp): ACAACGCTCCGATATTAGGGATCAATCACGTGACA Found at i:1727 original size:14 final size:14 Alignment explanation

Indices: 1708--1738 Score: 53 Period size: 14 Copynumber: 2.2 Consensus size: 14 1698 CTAAAAAGAT * 1708 AAAAAAAAACGCAG 1 AAAAAAAAACACAG 1722 AAAAAAAAACACAG 1 AAAAAAAAACACAG 1736 AAA 1 AAA 1739 TAGCGGGTCG Statistics Matches: 16, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 14 16 1.00 ACGTcount: A:0.77, C:0.13, G:0.10, T:0.00 Consensus pattern (14 bp): AAAAAAAAACACAG Found at i:1876 original size:21 final size:20 Alignment explanation

Indices: 1842--1880 Score: 60 Period size: 20 Copynumber: 1.9 Consensus size: 20 1832 ATTTCTGCGT 1842 TTTTTTTAAATCTTTTTAGA 1 TTTTTTTAAATCTTTTTAGA * 1862 TTTTTTTAATTTCTTTTTA 1 TTTTTTTAA-ATCTTTTTA 1881 TCTTTAATAA Statistics Matches: 17, Mismatches: 1, Indels: 1 0.89 0.05 0.05 Matches are distributed among these distances: 20 9 0.53 21 8 0.47 ACGTcount: A:0.21, C:0.05, G:0.03, T:0.72 Consensus pattern (20 bp): TTTTTTTAAATCTTTTTAGA Found at i:3024 original size:154 final size:154 Alignment explanation

Indices: 2479--4042 Score: 2138 Period size: 154 Copynumber: 10.2 Consensus size: 154 2469 TTAAGAAATT * * ** * 2479 ATTCAAAACAACACTAATGGGCCACGAAAGGTCCAAAATAACAAGTGTTCAAAATGAGCGAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * * * * 2544 TTTCACAGTGAACTAATCTCACCAAAATGATTATATTTAGACCATAAAAACAATGGGAAAAAAAA 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCAT--AAACAAT-GG-AAAGAAA * * * * 2609 AGCTTTGTGGTTTACTATATCGAAGACA 127 AGCTTTGTGGTTTGCCAAATCGAAGACG * * * * * * * 2637 ACTCAAAATAGCATTAAT--G-CCCGAACGACCTAAAATAACAAGTGTTCAAAATGAACTAAAAG 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * * ** * * 2699 TTTCACAGTGGAGTAATCTCACCAAAAGGATTATAGTTAAACCATAAACAATAGAAAGAAAAGTT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT * 2764 TTGTGGTTTGCTAAATCGAAGACG 131 TTGTGGTTTGCCAAATCGAAGACG * * * 2788 AGTC-AAACAGCACTAATGGGCCCCGAAAGGCCCAAAATAACAAGTGTTCAAAATGAGCTGAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * * * * 2852 CTTCAGAGTGGAATAATCTCACCAAAAGGATTATAGTTAGGCCATAAACAATAGAAAAAAAAG-T 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT * * * 2916 CTTGTGGTTTTCCAGATCGAAAACG 131 -TTGTGGTTTGCCAAATCGAAGACG * * 2941 ATTCAAAACAGCACTAATGGGCCCCGAAAGGCCCAAAATAACAAGTGTTCAAAATGAACTAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * 3006 CTTCACAGTGGACGAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGGAAGAAAAGCT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT 3071 TTGTGGTTTGCCAAATCGAAGACG 131 TTGTGGTTTGCCAAATCGAAGACG * * 3095 ATTCAAAACAGCACTAATGGGCCACGAAAGACCTAAAATAACAAGTGTTCAAAATGAGCTAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * * * 3160 CTTCACACTTGACTAATCTCATCAAAATGATTATAGTTAGGCCATAAACAATAGAAAGAAAAGCT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT * 3225 TTGTGGTTTCCCAAATCGAAGACG 131 TTGTGGTTTGCCAAATCGAAGACG * *** * * * 3249 ATTCAAAACAACACTAAT---ATTCAAAAGACCCAAAATAACAAGTATTCAAAACGAGCTAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * * 3311 CTTCCCAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAACGGAAAGAAAAGAT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT * 3376 TTGTGGTTTGCCAAATCGAAAACG 131 TTGTGGTTTGCCAAATCGAAGACG * ** ** * 3400 ATTTAAAACAGCACTAATTAGCCCTAAAAGGCCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA 3465 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT 3530 TTGTGGTTTGCCAAATCGAAGACG 131 TTGTGGTTTGCCAAATCGAAGACG * * * * * 3554 ATTCAAAACAACACTAATGGGCCCCAAAAGACCTAAAATAACAAGTATTCAGAATGAGCTAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * * * * * 3619 CTTCAGACTGGACTAATCTCATCATAATGATTATAGTTTGGCCATAAACAATAGAAAGAAAAGCT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT * * 3684 TTGTGGTTT-CATAAATCGAAAACG 131 TTGTGGTTTGC-CAAATCGAAGACG * 3708 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGTTAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * * 3773 CTTCCCAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAACGGAAAGAAAAGCT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT * 3838 TTGTGGTTTGCCAAATCGAAAACG 131 TTGTGGTTTGCCAAATCGAAGACG ** * * 3862 ATTCAAAACAGCACTAATTAGCCCCGAATGGCCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA 1 ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA * ** ** 3927 CTTCACAGTGGACTAATCTCACCAAAATAATTATAGTTACACCATAAACAATTTAAAGAAAAGCT 66 CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT * * 3992 TTATGGTTTGTCAAATCGAAGACG 131 TTGTGGTTTGCCAAATCGAAGACG * 4016 ATTCAAAACAGCACTAGTGGGCCCCGA 1 ATTCAAAACAGCACTAATGGGCCCCGA 4043 TGTAGACACC Statistics Matches: 1249, Mismatches: 146, Indels: 26 0.88 0.10 0.02 Matches are distributed among these distances: 150 11 0.01 151 166 0.13 152 3 0.00 153 125 0.10 154 852 0.68 155 77 0.06 156 1 0.00 158 14 0.01 ACGTcount: A:0.43, C:0.19, G:0.16, T:0.22 Consensus pattern (154 bp): ATTCAAAACAGCACTAATGGGCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAA CTTCACAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCT TTGTGGTTTGCCAAATCGAAGACG Found at i:3987 original size:767 final size:766 Alignment explanation

Indices: 2511--4031 Score: 2251 Period size: 767 Copynumber: 2.0 Consensus size: 766 2501 CACGAAAGGT * * * 2511 CCAAAATAACAAGTGTTCAAAATGAGCGAAAAATTTCACAGTGAACTAATCTCACCAAAATGATT 1 CCAAAATAACAAGTATTCAAAACGAGCGAAAAACTTCACAGTGAACTAATCTCACCAAAATGATT * * * * * * 2576 ATATTTAGACCATAAAAACAATGGGAAAAAAAAAGCTTTGTGGTTTACTATATCGAAGACAACTC 66 ATAGTTAGACCAT-AAAACAATCGGAAAAAAAAAGATTTGTGGTTTACCAAATCGAAAACAACTC * * * * * ** 2641 AAAATAGCATTAATGCCCGAACGACCTAAAATAACAAGTGTTCAAAATGAACTAAAAGTTTCACA 130 AAAACAGCACTAATGCCCAAAAGACCCAAAATAACAAGTGTTCAAAATGAACTAAAAACTTCACA * * 2706 GTGGAGTAATCTCACCAAAAGGATTATAGTTAAACCATAAACAATAGAAAGAAAAGTTTTGTGGT 195 GTGGACTAATCTCACCAAAAGGATTATAGTTAAACCATAAACAATAGAAAGAAAAGCTTTGTGGT * * * * * 2771 TTGCTAAATCGAAGACGAGTCAAACAGCACTAATGGGCCCCGAAAGGCCCAAAATAACAAGTGTT 260 TTGCCAAATCGAAGACGAGTCAAACAACACTAATGGGCCCCAAAAGACCCAAAATAACAAGTATT * * 2836 CAAAATGAGCTGAAAACTTCAGAGTGGAATAATCTCACCAAAAGGATTATAGTTAGGCCATAAAC 325 CAAAATGAGCTAAAAACTTCAGACTGGAATAATCTCACCAAAAGGATTATAGTTAGGCCATAAAC * 2901 AATAGAAAAAAAAGTCTTGTGGTTTTCCAGATCGAAAACGATTCAAAACAGCACTAATGGGCCCC 390 AATAGAAAAAAAAGTCTTGTGGTTTTCCAAATCGAAAACGATTCAAAACAGCACTAATGGGCCCC * 2966 GAAAGGCCCAAAATAACAAGTGTTCAAAATGAACTAAAAACTTCACAGTGGACGAATCTCACCAA 455 GAAAGACCCAAAATAACAAGTGTTCAAAATGAACTAAAAACTTCACAGTGGACGAATCTCACCAA * * * 3031 AATGATTATAGTTAGGCCATAAACAATGGGAAGAAAAGCTTTGTGGTTTGCCAAATCGAAGACGA 520 AATGATTATAGTTAGGCCATAAACAACGGAAAGAAAAGCTTTGTGGTTTGCCAAATCGAAAACGA * * 3096 TTCAAAACAGCACTAATGGGCCACGAAAGACCTAAAATAACAAGTGTTCAAAATGAGCTAAAAAC 585 TTCAAAACAGCACTAATGAGCCACGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAAC * * * ** 3161 TTCACACTTGACTAATCTCATCAAAATGATTATAGTTAGGCCATAAACAATAGAAAGAAAAGCTT 650 TTCACACTGGACTAATCTCACCAAAATAATTATAGTTACACCATAAACAATAGAAAGAAAAGCTT * 3226 TGTGGTTTCCCAAATCGAAGACGATTCAAAACAACACTAATATTCAAAAGAC 715 TATGGTTTCCCAAATCGAAGACGATTCAAAACAACACTAATATTCAAAAGAC * * * 3278 CCAAAATAACAAGTATTCAAAACGAGCTAAAAACTTCCCAGTGGACTAATCTCACCAAAATGATT 1 CCAAAATAACAAGTATTCAAAACGAGCGAAAAACTTCACAGTGAACTAATCTCACCAAAATGATT * * * * * * 3343 ATAGTTAGGCCAT-AAACAA-CGG-AAAGAAAAGATTTGTGGTTTGCCAAATCGAAAACGATTTA 66 ATAGTTAGACCATAAAACAATCGGAAAAAAAAAGATTTGTGGTTTACCAAATCGAAAACAACTCA * * 3405 AAACAGCACTAATTAGCCCTAAAAGGCCCAAAATAACAAGTGTTCAAAATGAGCTAAAAACTTCA 131 AAACAGCACTAA-T-GCCC-AAAAGACCCAAAATAACAAGTGTTCAAAATGAACTAAAAACTTCA * ** * 3470 CAGTGGACTAATCTCACCAAAATGATTATAGTTAGGCCATAAACAATGGAAAGAAAAGCTTTGTG 193 CAGTGGACTAATCTCACCAAAAGGATTATAGTTAAACCATAAACAATAGAAAGAAAAGCTTTGTG * * 3535 GTTTGCCAAATCGAAGACGATTCAAAACAACACTAATGGGCCCCAAAAGACCTAAAATAACAAGT 258 GTTTGCCAAATCGAAGACGAGTC-AAACAACACTAATGGGCCCCAAAAGACCCAAAATAACAAGT * * * * * * 3600 ATTCAGAATGAGCTAAAAACTTCAGACTGGACTAATCTCATCATAATGATTATAGTTTGGCCATA 322 ATTCAAAATGAGCTAAAAACTTCAGACTGGAATAATCTCACCAAAAGGATTATAGTTAGGCCATA * * 3665 AACAATAGAAAGAAAAG-CTTTGTGG-TTTCATAAATCGAAAACGATTCAAAACAGCACTAATGG 387 AACAATAGAAAAAAAAGTC-TTGTGGTTTTC-CAAATCGAAAACGATTCAAAACAGCACTAATGG ** * * 3728 GCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAGTTAAAAACTTCCCAGTGGACTAATCTC 450 GCCCCGAAAGACCCAAAATAACAAGTGTTCAAAATGAACTAAAAACTTCACAGTGGACGAATCTC 3793 ACCAAAATGATTATAGTTAGGCCATAAACAACGGAAAGAAAAGCTTTGTGGTTTGCCAAATCGAA 515 ACCAAAATGATTATAGTTAGGCCATAAACAACGGAAAGAAAAGCTTTGTGGTTTGCCAAATCGAA * * * * 3858 AACGATTCAAAACAGCACTAATTAGCCCCGAATGGCCCAAAATAACAAGTGTTCAAAATGAGCTA 580 AACGATTCAAAACAGCACTAATGAGCCACGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTA * ** 3923 AAAACTTCACAGTGGACTAATCTCACCAAAATAATTATAGTTACACCATAAACAATTTAAAGAAA 645 AAAACTTCACACTGGACTAATCTCACCAAAATAATTATAGTTACACCATAAACAATAGAAAGAAA ** * 3988 AGCTTTATGGTTTGTCAAATCGAAGACGATTCAAAACAGCACTA 710 AGCTTTATGGTTTCCCAAATCGAAGACGATTCAAAACAACACTA 4032 GTGGGCCCCG Statistics Matches: 671, Mismatches: 77, Indels: 12 0.88 0.10 0.02 Matches are distributed among these distances: 763 41 0.06 764 3 0.00 765 10 0.01 766 123 0.18 767 494 0.74 ACGTcount: A:0.43, C:0.18, G:0.16, T:0.22 Consensus pattern (766 bp): CCAAAATAACAAGTATTCAAAACGAGCGAAAAACTTCACAGTGAACTAATCTCACCAAAATGATT ATAGTTAGACCATAAAACAATCGGAAAAAAAAAGATTTGTGGTTTACCAAATCGAAAACAACTCA AAACAGCACTAATGCCCAAAAGACCCAAAATAACAAGTGTTCAAAATGAACTAAAAACTTCACAG TGGACTAATCTCACCAAAAGGATTATAGTTAAACCATAAACAATAGAAAGAAAAGCTTTGTGGTT TGCCAAATCGAAGACGAGTCAAACAACACTAATGGGCCCCAAAAGACCCAAAATAACAAGTATTC AAAATGAGCTAAAAACTTCAGACTGGAATAATCTCACCAAAAGGATTATAGTTAGGCCATAAACA ATAGAAAAAAAAGTCTTGTGGTTTTCCAAATCGAAAACGATTCAAAACAGCACTAATGGGCCCCG AAAGACCCAAAATAACAAGTGTTCAAAATGAACTAAAAACTTCACAGTGGACGAATCTCACCAAA ATGATTATAGTTAGGCCATAAACAACGGAAAGAAAAGCTTTGTGGTTTGCCAAATCGAAAACGAT TCAAAACAGCACTAATGAGCCACGAAAGACCCAAAATAACAAGTGTTCAAAATGAGCTAAAAACT TCACACTGGACTAATCTCACCAAAATAATTATAGTTACACCATAAACAATAGAAAGAAAAGCTTT ATGGTTTCCCAAATCGAAGACGATTCAAAACAACACTAATATTCAAAAGAC Found at i:4186 original size:27 final size:27 Alignment explanation

Indices: 4094--4186 Score: 93 Period size: 27 Copynumber: 3.4 Consensus size: 27 4084 AATTTACTTC 4094 TTTTGGTCATTTG--CATTCCCAGGGGCA 1 TTTTGGTCATTTGCACATT--CAGGGGCA * * * 4121 TTTTGGTTATTGGCACA-CCTAGGGGCA 1 TTTTGGTCATTTGCACATTC-AGGGGCA * * 4148 TTTCGGTCATTTGCACATTCATGGGCA 1 TTTTGGTCATTTGCACATTCAGGGGCA 4175 TTTTGGTCATTT 1 TTTTGGTCATTT 4187 TATGTCCACT Statistics Matches: 53, Mismatches: 9, Indels: 8 0.76 0.13 0.11 Matches are distributed among these distances: 26 1 0.02 27 49 0.92 28 1 0.02 29 2 0.04 ACGTcount: A:0.16, C:0.19, G:0.25, T:0.40 Consensus pattern (27 bp): TTTTGGTCATTTGCACATTCAGGGGCA Found at i:11204 original size:19 final size:18 Alignment explanation

Indices: 11180--11215 Score: 54 Period size: 19 Copynumber: 1.9 Consensus size: 18 11170 TGAAGTCTTA 11180 TTGAAGATAATTTGAAGAT 1 TTGAAGATAA-TTGAAGAT * 11199 TTGAAGATCATTGAAGA 1 TTGAAGATAATTGAAGA 11216 ATTATTTCCA Statistics Matches: 16, Mismatches: 1, Indels: 1 0.89 0.06 0.06 Matches are distributed among these distances: 18 7 0.44 19 9 0.56 ACGTcount: A:0.42, C:0.03, G:0.22, T:0.33 Consensus pattern (18 bp): TTGAAGATAATTGAAGAT Found at i:12508 original size:34 final size:35 Alignment explanation

Indices: 12453--12519 Score: 84 Period size: 34 Copynumber: 1.9 Consensus size: 35 12443 GAGGTTTCTT 12453 TTAATTATTTTCTCAATTTATCTT-TTGCTTTTAA 1 TTAATTATTTTCTCAATTTATCTTATTGCTTTTAA * * 12487 TTAATTGTTTTCTTTAA-TTATCTTGATTGCTTT 1 TTAATTATTTTC-TCAATTTATCTT-ATTGCTTT 12520 CTTAGATAGT Statistics Matches: 28, Mismatches: 2, Indels: 4 0.82 0.06 0.12 Matches are distributed among these distances: 34 18 0.64 35 3 0.11 36 7 0.25 ACGTcount: A:0.21, C:0.10, G:0.06, T:0.63 Consensus pattern (35 bp): TTAATTATTTTCTCAATTTATCTTATTGCTTTTAA Found at i:13826 original size:17 final size:17 Alignment explanation

Indices: 13806--13839 Score: 50 Period size: 17 Copynumber: 2.0 Consensus size: 17 13796 GGAAATTCTG 13806 CTCCAAAAACAATTTGA 1 CTCCAAAAACAATTTGA * * 13823 CTCCAACAACGATTTGA 1 CTCCAAAAACAATTTGA 13840 GCATCATTCA Statistics Matches: 15, Mismatches: 2, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 17 15 1.00 ACGTcount: A:0.41, C:0.26, G:0.09, T:0.24 Consensus pattern (17 bp): CTCCAAAAACAATTTGA Done.