Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01012399.1 Corchorus olitorius cultivar O-4 contig12432, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 57918
ACGTcount: A:0.32, C:0.19, G:0.17, T:0.31


Found at i:1022 original size:27 final size:27

Alignment explanation

Indices: 982--1041 Score: 104 Period size: 27 Copynumber: 2.3 Consensus size: 27 972 TTCATAAAAT * 982 TTCAT-TTAATTACAAAAGAAATTACA 1 TTCATATTAACTACAAAAGAAATTACA 1008 TTCATATTAACTACAAAAGAAATTACA 1 TTCATATTAACTACAAAAGAAATTACA 1035 TTCATAT 1 TTCATAT 1042 GAAATATAAG Statistics Matches: 32, Mismatches: 1, Indels: 1 0.94 0.03 0.03 Matches are distributed among these distances: 26 5 0.16 27 27 0.84 ACGTcount: A:0.48, C:0.13, G:0.03, T:0.35 Consensus pattern (27 bp): TTCATATTAACTACAAAAGAAATTACA Found at i:7323 original size:12 final size:12 Alignment explanation

Indices: 7301--7340 Score: 71 Period size: 12 Copynumber: 3.2 Consensus size: 12 7291 TTCAATCCTA 7301 AGGGTAAGTGTTT 1 AGGG-AAGTGTTT 7314 AGGGAAGTGTTT 1 AGGGAAGTGTTT 7326 AGGGAAGTGTTT 1 AGGGAAGTGTTT 7338 AGG 1 AGG 7341 TGAAAACTAA Statistics Matches: 27, Mismatches: 0, Indels: 1 0.96 0.00 0.04 Matches are distributed among these distances: 12 23 0.85 13 4 0.15 ACGTcount: A:0.25, C:0.00, G:0.42, T:0.33 Consensus pattern (12 bp): AGGGAAGTGTTT Found at i:9228 original size:18 final size:18 Alignment explanation

Indices: 9196--9231 Score: 56 Period size: 18 Copynumber: 2.0 Consensus size: 18 9186 ATCAAAGTGG 9196 TTAATATAAATTTTGGCC 1 TTAATATAAATTTTGGCC 9214 TTAAT-TAAATATTTGGCC 1 TTAATATAAAT-TTTGGCC 9232 AAATTAACTA Statistics Matches: 17, Mismatches: 0, Indels: 2 0.89 0.00 0.11 Matches are distributed among these distances: 17 5 0.29 18 12 0.71 ACGTcount: A:0.33, C:0.11, G:0.11, T:0.44 Consensus pattern (18 bp): TTAATATAAATTTTGGCC Found at i:9818 original size:24 final size:24 Alignment explanation

Indices: 9783--9838 Score: 94 Period size: 24 Copynumber: 2.3 Consensus size: 24 9773 AGGAAAATCT * 9783 TTCGACCCCATGGTGGTCGTGAGA 1 TTCGACCACATGGTGGTCGTGAGA 9807 TTCGACCACATGGTGGTCGTGAGA 1 TTCGACCACATGGTGGTCGTGAGA * 9831 TACGACCA 1 TTCGACCA 9839 AGTAGTGAAT Statistics Matches: 30, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 24 30 1.00 ACGTcount: A:0.21, C:0.25, G:0.30, T:0.23 Consensus pattern (24 bp): TTCGACCACATGGTGGTCGTGAGA Found at i:10883 original size:33 final size:33 Alignment explanation

Indices: 10846--10913 Score: 127 Period size: 33 Copynumber: 2.1 Consensus size: 33 10836 TCTTGATGAA * 10846 AGGGATTTTGTTGGAAGCCTTGGCTATATAAGC 1 AGGGATTTTGTTGGAAGCCTTGACTATATAAGC 10879 AGGGATTTTGTTGGAAGCCTTGACTATATAAGC 1 AGGGATTTTGTTGGAAGCCTTGACTATATAAGC 10912 AG 1 AG 10914 AAGCACCTCC Statistics Matches: 34, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 33 34 1.00 ACGTcount: A:0.26, C:0.12, G:0.29, T:0.32 Consensus pattern (33 bp): AGGGATTTTGTTGGAAGCCTTGACTATATAAGC Found at i:11167 original size:25 final size:25 Alignment explanation

Indices: 11133--11183 Score: 102 Period size: 25 Copynumber: 2.0 Consensus size: 25 11123 ATAACAATCC 11133 ATTGTTATCCTTTTCAAACTTGTTA 1 ATTGTTATCCTTTTCAAACTTGTTA 11158 ATTGTTATCCTTTTCAAACTTGTTA 1 ATTGTTATCCTTTTCAAACTTGTTA 11183 A 1 A 11184 ACCTGATTAC Statistics Matches: 26, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 25 26 1.00 ACGTcount: A:0.25, C:0.16, G:0.08, T:0.51 Consensus pattern (25 bp): ATTGTTATCCTTTTCAAACTTGTTA Found at i:20512 original size:1 final size:1 Alignment explanation

Indices: 20468--20495 Score: 56 Period size: 1 Copynumber: 28.0 Consensus size: 1 20458 CATTCTTGTG 20468 TTTTTTTTTTTTTTTTTTTTTTTTTTTT 1 TTTTTTTTTTTTTTTTTTTTTTTTTTTT 20496 ATGGTTTAAT Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 1 27 1.00 ACGTcount: A:0.00, C:0.00, G:0.00, T:1.00 Consensus pattern (1 bp): T Found at i:31649 original size:27 final size:27 Alignment explanation

Indices: 31617--31695 Score: 104 Period size: 27 Copynumber: 2.9 Consensus size: 27 31607 GCATTAGGGT * * 31617 CATCCAGGGGCATTTTAGTCATTTGCA 1 CATCCAAGGGCATTTTGGTCATTTGCA * * 31644 CCTCCATGGGCATTTTGGTCATTTGCA 1 CATCCAAGGGCATTTTGGTCATTTGCA * 31671 CATTCAAGGGCATTTTTGGTCATTT 1 CATCCAAGGGCA-TTTTGGTCATTT 31696 CAAGTTCACT Statistics Matches: 45, Mismatches: 6, Indels: 1 0.87 0.12 0.02 Matches are distributed among these distances: 27 33 0.73 28 12 0.27 ACGTcount: A:0.19, C:0.22, G:0.22, T:0.38 Consensus pattern (27 bp): CATCCAAGGGCATTTTGGTCATTTGCA Found at i:37232 original size:19 final size:19 Alignment explanation

Indices: 37179--37226 Score: 69 Period size: 19 Copynumber: 2.5 Consensus size: 19 37169 TTTTTTTATC * 37179 GGACCAGGTCAAACCGGTTT 1 GGACC-GGTCAAACCGGGTT * 37199 GGATCGGTCAAACCGGGTT 1 GGACCGGTCAAACCGGGTT 37218 GGACCGGTC 1 GGACCGGTC 37227 TGACCGAACC Statistics Matches: 25, Mismatches: 3, Indels: 1 0.86 0.10 0.03 Matches are distributed among these distances: 19 21 0.84 20 4 0.16 ACGTcount: A:0.21, C:0.25, G:0.35, T:0.19 Consensus pattern (19 bp): GGACCGGTCAAACCGGGTT Found at i:43387 original size:13 final size:13 Alignment explanation

Indices: 43369--43393 Score: 50 Period size: 13 Copynumber: 1.9 Consensus size: 13 43359 ATAAAAAATA 43369 AAAAAATGAAAAG 1 AAAAAATGAAAAG 43382 AAAAAATGAAAA 1 AAAAAATGAAAA 43394 AAATTAAACG Statistics Matches: 12, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 12 1.00 ACGTcount: A:0.80, C:0.00, G:0.12, T:0.08 Consensus pattern (13 bp): AAAAAATGAAAAG Found at i:43801 original size:19 final size:20 Alignment explanation

Indices: 43779--43855 Score: 111 Period size: 20 Copynumber: 3.9 Consensus size: 20 43769 CAAGGAATTC 43779 AAAAAAAAAAAATCAAATCA 1 AAAAAAAAAAAATCAAATCA ** 43799 AAAAATCAAAAAT-AAATCA 1 AAAAAAAAAAAATCAAATCA * 43818 AAAAAAAAAAAATCAAAGCA 1 AAAAAAAAAAAATCAAATCA 43838 AAAAAAACAAAAATCAAA 1 AAAAAAA-AAAAATCAAA 43856 AAGAGAATTG Statistics Matches: 50, Mismatches: 5, Indels: 3 0.86 0.09 0.05 Matches are distributed among these distances: 19 17 0.34 20 23 0.46 21 10 0.20 ACGTcount: A:0.79, C:0.10, G:0.01, T:0.09 Consensus pattern (20 bp): AAAAAAAAAAAATCAAATCA Found at i:43803 original size:15 final size:15 Alignment explanation

Indices: 43800--43849 Score: 57 Period size: 15 Copynumber: 3.3 Consensus size: 15 43790 ATCAAATCAA 43800 AAAATCAAAAATAAATC 1 AAAA-CAAAAA-AAATC * 43817 AAAAAAAAAAAAATC 1 AAAACAAAAAAAATC * 43832 AAAGCAAAAAAAA-C 1 AAAACAAAAAAAATC 43846 AAAA 1 AAAA 43850 ATCAAAAAGA Statistics Matches: 29, Mismatches: 4, Indels: 3 0.81 0.11 0.08 Matches are distributed among these distances: 14 4 0.14 15 16 0.55 16 5 0.17 17 4 0.14 ACGTcount: A:0.80, C:0.10, G:0.02, T:0.08 Consensus pattern (15 bp): AAAACAAAAAAAATC Found at i:43822 original size:13 final size:12 Alignment explanation

Indices: 43781--43826 Score: 58 Period size: 12 Copynumber: 3.8 Consensus size: 12 43771 AGGAATTCAA 43781 AAAAAAAAAATC 1 AAAAAAAAAATC * 43793 AAATCAAAAAATC 1 AAA-AAAAAAATC * 43806 -AAAAATAAATC 1 AAAAAAAAAATC 43817 AAAAAAAAAA 1 AAAAAAAAAA 43827 AAATCAAAGC Statistics Matches: 28, Mismatches: 4, Indels: 4 0.78 0.11 0.11 Matches are distributed among these distances: 11 7 0.25 12 13 0.46 13 8 0.29 ACGTcount: A:0.80, C:0.09, G:0.00, T:0.11 Consensus pattern (12 bp): AAAAAAAAAATC Found at i:43825 original size:24 final size:24 Alignment explanation

Indices: 43782--43827 Score: 65 Period size: 24 Copynumber: 1.9 Consensus size: 24 43772 GGAATTCAAA ** 43782 AAAAAAAAATCAAATCAAAAAATC 1 AAAAAAAAATCAAAAAAAAAAATC * 43806 AAAAATAAATCAAAAAAAAAAA 1 AAAAAAAAATCAAAAAAAAAAA 43828 AATCAAAGCA Statistics Matches: 19, Mismatches: 3, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 24 19 1.00 ACGTcount: A:0.80, C:0.09, G:0.00, T:0.11 Consensus pattern (24 bp): AAAAAAAAATCAAAAAAAAAAATC Found at i:43832 original size:39 final size:36 Alignment explanation

Indices: 43779--43857 Score: 113 Period size: 39 Copynumber: 2.1 Consensus size: 36 43769 CAAGGAATTC * 43779 AAAAAAAAAAAATCAAATCAAAAAATCAAAAATAAATCA 1 AAAAAAAAAAAATCAAAGCAAAAAA--AAAAA-AAATCA * 43818 AAAAAAAAAAAATCAAAGCAAAAAAAACAAAAATCA 1 AAAAAAAAAAAATCAAAGCAAAAAAAAAAAAAATCA 43854 AAAA 1 AAAA 43858 GAGAATTGAT Statistics Matches: 38, Mismatches: 2, Indels: 3 0.88 0.05 0.07 Matches are distributed among these distances: 36 10 0.26 37 4 0.11 39 24 0.63 ACGTcount: A:0.80, C:0.10, G:0.01, T:0.09 Consensus pattern (36 bp): AAAAAAAAAAAATCAAAGCAAAAAAAAAAAAAATCA Found at i:43842 original size:13 final size:12 Alignment explanation

Indices: 43785--43825 Score: 50 Period size: 11 Copynumber: 3.5 Consensus size: 12 43775 ATTCAAAAAA 43785 AAAAAATCAAATC 1 AAAAAATCAAA-C 43798 AAAAAATCAAA- 1 AAAAAATCAAAC * 43809 AATAAATCAAA- 1 AAAAAATCAAAC 43820 AAAAAA 1 AAAAAA 43826 AAAATCAAAG Statistics Matches: 26, Mismatches: 2, Indels: 2 0.87 0.07 0.07 Matches are distributed among these distances: 11 15 0.58 13 11 0.42 ACGTcount: A:0.78, C:0.10, G:0.00, T:0.12 Consensus pattern (12 bp): AAAAAATCAAAC Found at i:44182 original size:14 final size:15 Alignment explanation

Indices: 44158--44190 Score: 50 Period size: 14 Copynumber: 2.3 Consensus size: 15 44148 AGTGCCTGTA * 44158 AAAAAATGAATGATG 1 AAAAAATGAATGAAG 44173 AAAAAA-GAATGAAG 1 AAAAAATGAATGAAG 44187 AAAA 1 AAAA 44191 GAAGCTCTAG Statistics Matches: 17, Mismatches: 1, Indels: 1 0.89 0.05 0.05 Matches are distributed among these distances: 14 11 0.65 15 6 0.35 ACGTcount: A:0.70, C:0.00, G:0.18, T:0.12 Consensus pattern (15 bp): AAAAAATGAATGAAG Found at i:44677 original size:50 final size:50 Alignment explanation

Indices: 44614--44763 Score: 221 Period size: 50 Copynumber: 3.0 Consensus size: 50 44604 AAAGCAAGAA * * 44614 TTTTATAATAAGATTGCATTCCATTCGTGAGTCCAAGGTCAAAATTCGCT 1 TTTTATAATAAGATTGCATTCCATTTGTGAGTCCAAGATCAAAATTCGCT * * 44664 TTTTATAATAAGATTGCATTCCATTTGTGAGTCCAAGACCAAAATTTGCT 1 TTTTATAATAAGATTGCATTCCATTTGTGAGTCCAAGATCAAAATTCGCT * * * 44714 TTTCA-AAGTAAGGTTGCATTCCGTTTGTGAGTCCAAGATCAAAATTCGCT 1 TTTTATAA-TAAGATTGCATTCCATTTGTGAGTCCAAGATCAAAATTCGCT 44764 CTTCGAGGGG Statistics Matches: 90, Mismatches: 9, Indels: 2 0.89 0.09 0.02 Matches are distributed among these distances: 49 2 0.02 50 88 0.98 ACGTcount: A:0.30, C:0.17, G:0.17, T:0.36 Consensus pattern (50 bp): TTTTATAATAAGATTGCATTCCATTTGTGAGTCCAAGATCAAAATTCGCT Found at i:45995 original size:69 final size:69 Alignment explanation

Indices: 45922--46076 Score: 265 Period size: 69 Copynumber: 2.2 Consensus size: 69 45912 AGCAACATAA * * 45922 GCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAGCTCAAGCCTTGGTTCCATCCAAGCAGC 1 GCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAGCTCAAGCCTTGGTTCCACCCAAGCAAC 45987 AGGG 66 AGGG * * 45991 GCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAGTTCAAGCTTTGGTTCCACCCAAGCAAC 1 GCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAGCTCAAGCCTTGGTTCCACCCAAGCAAC 46056 AGGG 66 AGGG * 46060 GCTTTTCCATAAGCCAA 1 GCTTTTCCACAAGCCAA 46077 GTTCATTTAC Statistics Matches: 81, Mismatches: 5, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 69 81 1.00 ACGTcount: A:0.26, C:0.31, G:0.19, T:0.25 Consensus pattern (69 bp): GCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAGCTCAAGCCTTGGTTCCACCCAAGCAAC AGGG Found at i:46237 original size:67 final size:68 Alignment explanation

Indices: 46118--46263 Score: 179 Period size: 67 Copynumber: 2.1 Consensus size: 68 46108 ACTAAACTCG * * * * 46118 TTTCCATACGAGTCAGTTCAAGCCTTGGTTCCATCCAAGCAGCATGTGCTTTTCCATAAGCCAAA 1 TTTCCATACGAGTCAGTTCAA-CCTTGGTTCCATCCAAACAGCAAGGGCTTTTCCACAAGCCAAA 46183 CTAT 65 CTAT * * 46187 TTTCCTTACGAGTCAG-TCTAA-CTTGGTTCCATCCAAACAGTAAGGGCTTTTCCACAAGCCAAA 1 TTTCCATACGAGTCAGTTC-AACCTTGGTTCCATCCAAACAGCAAGGGCTTTTCCACAAGCCAAA * ** 46250 TTCG 65 CTAT 46254 TTTCCATACG 1 TTTCCATACG 46264 GTGCATTACC Statistics Matches: 66, Mismatches: 10, Indels: 4 0.82 0.12 0.05 Matches are distributed among these distances: 67 47 0.71 68 2 0.03 69 17 0.26 ACGTcount: A:0.26, C:0.27, G:0.16, T:0.31 Consensus pattern (68 bp): TTTCCATACGAGTCAGTTCAACCTTGGTTCCATCCAAACAGCAAGGGCTTTTCCACAAGCCAAAC TAT Found at i:49587 original size:36 final size:36 Alignment explanation

Indices: 49547--49634 Score: 158 Period size: 36 Copynumber: 2.4 Consensus size: 36 49537 TAGTATCCAC 49547 ATTAGTAATTAGGCATCATCATTCACATTAGTAATT 1 ATTAGTAATTAGGCATCATCATTCACATTAGTAATT ** 49583 ATTAGTAATTAGGCATCATTGTTCACATTAGTAATT 1 ATTAGTAATTAGGCATCATCATTCACATTAGTAATT 49619 ATTAGTAATTAGGCAT 1 ATTAGTAATTAGGCAT 49635 TGCATTCACA Statistics Matches: 50, Mismatches: 2, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 36 50 1.00 ACGTcount: A:0.35, C:0.11, G:0.14, T:0.40 Consensus pattern (36 bp): ATTAGTAATTAGGCATCATCATTCACATTAGTAATT Found at i:54794 original size:44 final size:43 Alignment explanation

Indices: 54718--54803 Score: 102 Period size: 44 Copynumber: 2.0 Consensus size: 43 54708 ATCTCCGTTG * * * 54718 ATATGTGTTATACATCCTTCATGCATGGTCCATGT-CTTTGTAT 1 ATATATGTTATACATCCATCATGCATGATCCAT-TCCTTTGTAT * * 54761 ATATATGTTCATACATTCATCATGCATTATCCATTCCTTTGTA 1 ATATATGTT-ATACATCCATCATGCATGATCCATTCCTTTGTA 54804 CATAGGTTCA Statistics Matches: 36, Mismatches: 5, Indels: 3 0.82 0.11 0.07 Matches are distributed among these distances: 43 9 0.25 44 27 0.75 ACGTcount: A:0.24, C:0.20, G:0.12, T:0.44 Consensus pattern (43 bp): ATATATGTTATACATCCATCATGCATGATCCATTCCTTTGTAT Found at i:56010 original size:13 final size:13 Alignment explanation

Indices: 55981--56018 Score: 67 Period size: 14 Copynumber: 2.8 Consensus size: 13 55971 TATTTTCAAG 55981 AAAAAGGTTTTCA 1 AAAAAGGTTTTCA 55994 AAAATAGGTTTTCA 1 AAAA-AGGTTTTCA 56008 AAAAAGGTTTT 1 AAAAAGGTTTT 56019 GAGTCTTTTA Statistics Matches: 24, Mismatches: 0, Indels: 2 0.92 0.00 0.08 Matches are distributed among these distances: 13 11 0.46 14 13 0.54 ACGTcount: A:0.45, C:0.05, G:0.16, T:0.34 Consensus pattern (13 bp): AAAAAGGTTTTCA Found at i:56307 original size:12 final size:12 Alignment explanation

Indices: 56290--56328 Score: 60 Period size: 12 Copynumber: 3.2 Consensus size: 12 56280 TCACTTCATC 56290 AAATTCAAATCA 1 AAATTCAAATCA * 56302 AAATTCAAATTA 1 AAATTCAAATCA * 56314 AAGTTCAAATCA 1 AAATTCAAATCA 56326 AAA 1 AAA 56329 GTGAATCAAA Statistics Matches: 23, Mismatches: 4, Indels: 0 0.85 0.15 0.00 Matches are distributed among these distances: 12 23 1.00 ACGTcount: A:0.59, C:0.13, G:0.03, T:0.26 Consensus pattern (12 bp): AAATTCAAATCA Found at i:56703 original size:12 final size:12 Alignment explanation

Indices: 56688--56732 Score: 58 Period size: 12 Copynumber: 3.8 Consensus size: 12 56678 AAAAACTAAA 56688 AAAAAAGAAATG 1 AAAAAAGAAATG * 56700 AAAAATGAAATG 1 AAAAAAGAAATG 56712 AAAAAATG--ATG 1 AAAAAA-GAAATG 56723 AAAAAAGAAA 1 AAAAAAGAAA 56733 AATAAGAACA Statistics Matches: 28, Mismatches: 2, Indels: 6 0.78 0.06 0.17 Matches are distributed among these distances: 10 1 0.04 11 9 0.32 12 17 0.61 13 1 0.04 ACGTcount: A:0.73, C:0.00, G:0.16, T:0.11 Consensus pattern (12 bp): AAAAAAGAAATG Found at i:57917 original size:50 final size:50 Alignment explanation

Indices: 57643--57915 Score: 465 Period size: 50 Copynumber: 5.5 Consensus size: 50 57633 CCATCCGAAT * * * 57643 ACATAGGCTTTTCCACAAGCCAAGCTCGTTTCCATACGGGTCAATTATCA 1 ACATGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA * * 57693 ACGTGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGGGTCAATTATCA 1 ACATGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA * 57743 ACATGGGCTTTCCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA 1 ACATGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA 57793 ACATGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA 1 ACATGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA * * 57843 ACATGGGCTTTTCCACAAGCCAAGCTCGTTTCCATATGAGTCAATTATCA 1 ACATGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA * 57893 ACATAGGCTTTTCCACAAGCCAA 1 ACATGGGCTTTTCCACAAGCCAA 57916 GCC Statistics Matches: 213, Mismatches: 10, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 50 213 1.00 ACGTcount: A:0.29, C:0.28, G:0.15, T:0.27 Consensus pattern (50 bp): ACATGGGCTTTTCCACAAGCCAAACTCGTTTCCATACGAGTCAATTATCA Done.