Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01015155.1 Corchorus olitorius cultivar O-4 contig15188, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 49421
ACGTcount: A:0.33, C:0.17, G:0.18, T:0.32


Found at i:3580 original size:10 final size:10

Alignment explanation

Indices: 3565--3597 Score: 57 Period size: 10 Copynumber: 3.3 Consensus size: 10 3555 TTAGTTCCTT * 3565 TATATATATA 1 TATATATGTA 3575 TATATATGTA 1 TATATATGTA 3585 TATATATGTA 1 TATATATGTA 3595 TAT 1 TAT 3598 TGCGCATTAA Statistics Matches: 22, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 10 22 1.00 ACGTcount: A:0.42, C:0.00, G:0.06, T:0.52 Consensus pattern (10 bp): TATATATGTA Found at i:4902 original size:17 final size:18 Alignment explanation

Indices: 4880--4916 Score: 67 Period size: 17 Copynumber: 2.1 Consensus size: 18 4870 GCAGTGTCTA 4880 AATTTTTGGATG-GACTG 1 AATTTTTGGATGAGACTG 4897 AATTTTTGGATGAGACTG 1 AATTTTTGGATGAGACTG 4915 AA 1 AA 4917 GAAACTGAAA Statistics Matches: 19, Mismatches: 0, Indels: 1 0.95 0.00 0.05 Matches are distributed among these distances: 17 12 0.63 18 7 0.37 ACGTcount: A:0.30, C:0.05, G:0.27, T:0.38 Consensus pattern (18 bp): AATTTTTGGATGAGACTG Found at i:7617 original size:2 final size:2 Alignment explanation

Indices: 7568--7601 Score: 59 Period size: 2 Copynumber: 17.0 Consensus size: 2 7558 TATTTTCAAG * 7568 TA TA TA CA TA TA TA TA TA TA TA TA TA TA TA TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 7602 CACACACACA Statistics Matches: 30, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 2 30 1.00 ACGTcount: A:0.50, C:0.03, G:0.00, T:0.47 Consensus pattern (2 bp): TA Found at i:8248 original size:24 final size:24 Alignment explanation

Indices: 8216--8281 Score: 69 Period size: 24 Copynumber: 2.8 Consensus size: 24 8206 TGATGATAAG * * 8216 GAAACCCTTGTAGAAATCCAAAAT 1 GAAACCCTTGAAGAAACCCAAAAT * * 8240 GAAAGCCTTGAAGAAACCCAAGAT 1 GAAACCCTTGAAGAAACCCAAAAT * * * 8264 GAGACCCATGATGAAACC 1 GAAACCCTTGAAGAAACC 8282 GAAGCAACCC Statistics Matches: 34, Mismatches: 8, Indels: 0 0.81 0.19 0.00 Matches are distributed among these distances: 24 34 1.00 ACGTcount: A:0.44, C:0.23, G:0.18, T:0.15 Consensus pattern (24 bp): GAAACCCTTGAAGAAACCCAAAAT Found at i:10710 original size:2 final size:2 Alignment explanation

Indices: 10699--10734 Score: 58 Period size: 2 Copynumber: 19.0 Consensus size: 2 10689 CCTATATTAG 10699 TA TA TA -A TA TA TA TA TA TA TA TA TA TA TA T- TA TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 10735 ATACACATCC Statistics Matches: 32, Mismatches: 0, Indels: 4 0.89 0.00 0.11 Matches are distributed among these distances: 1 2 0.06 2 30 0.94 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): TA Found at i:10725 original size:23 final size:23 Alignment explanation

Indices: 10691--10734 Score: 63 Period size: 23 Copynumber: 1.9 Consensus size: 23 10681 CGGCCCGACC 10691 TATATTAGTATATAATATATATA 1 TATATTAGTATATAATATATATA * 10714 TATATATA-TATATATTATATA 1 TATAT-TAGTATATAATATATA 10735 ATACACATCC Statistics Matches: 19, Mismatches: 1, Indels: 2 0.86 0.05 0.09 Matches are distributed among these distances: 23 17 0.89 24 2 0.11 ACGTcount: A:0.48, C:0.00, G:0.02, T:0.50 Consensus pattern (23 bp): TATATTAGTATATAATATATATA Found at i:11438 original size:12 final size:12 Alignment explanation

Indices: 11420--11453 Score: 50 Period size: 12 Copynumber: 2.8 Consensus size: 12 11410 AAGCGTTGAA 11420 GAAACCCAAGAT 1 GAAACCCAAGAT * * 11432 GAGACCCATGAT 1 GAAACCCAAGAT 11444 GAAACCCAAG 1 GAAACCCAAG 11454 CAACCCACGA Statistics Matches: 18, Mismatches: 4, Indels: 0 0.82 0.18 0.00 Matches are distributed among these distances: 12 18 1.00 ACGTcount: A:0.44, C:0.26, G:0.21, T:0.09 Consensus pattern (12 bp): GAAACCCAAGAT Found at i:11468 original size:21 final size:23 Alignment explanation

Indices: 11417--11471 Score: 60 Period size: 21 Copynumber: 2.4 Consensus size: 23 11407 TGAAAGCGTT * * 11417 GAAGAAACCCAAGATGAGACCCAT 1 GAAGAAACCCAAGA-CAGACCCAC * 11441 GATGAAACCCAAG-CA-ACCCAC 1 GAAGAAACCCAAGACAGACCCAC 11462 GAAGAAACCC 1 GAAGAAACCC 11472 CTACAGGTCT Statistics Matches: 27, Mismatches: 4, Indels: 3 0.79 0.12 0.09 Matches are distributed among these distances: 21 14 0.52 22 1 0.04 24 12 0.44 ACGTcount: A:0.45, C:0.31, G:0.18, T:0.05 Consensus pattern (23 bp): GAAGAAACCCAAGACAGACCCAC Found at i:13120 original size:12 final size:12 Alignment explanation

Indices: 13103--13164 Score: 72 Period size: 12 Copynumber: 5.2 Consensus size: 12 13093 TTTTGGTCTA * 13103 CCACGATGAAAC 1 CCACGAAGAAAC 13115 CCACGATA-AAAC 1 CCACGA-AGAAAC * 13127 CCAAGAAGAAAC 1 CCACGAAGAAAC * 13139 CTACGAAGAAAC 1 CCACGAAGAAAC * 13151 CCATGAAGAAAC 1 CCACGAAGAAAC 13163 CC 1 CC 13165 TTATAGGTAT Statistics Matches: 42, Mismatches: 6, Indels: 4 0.81 0.12 0.08 Matches are distributed among these distances: 11 1 0.02 12 41 0.98 ACGTcount: A:0.48, C:0.31, G:0.15, T:0.06 Consensus pattern (12 bp): CCACGAAGAAAC Found at i:15767 original size:31 final size:31 Alignment explanation

Indices: 15732--15791 Score: 102 Period size: 31 Copynumber: 1.9 Consensus size: 31 15722 AGGTAATTTT * * 15732 TGTTTGGCTAATTGCTCAAATAAGGGCCTAA 1 TGTTTGGCAAAATGCTCAAATAAGGGCCTAA 15763 TGTTTGGCAAAATGCTCAAATAAGGGCCT 1 TGTTTGGCAAAATGCTCAAATAAGGGCCT 15792 GATCTTTTAA Statistics Matches: 27, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 31 27 1.00 ACGTcount: A:0.30, C:0.17, G:0.23, T:0.30 Consensus pattern (31 bp): TGTTTGGCAAAATGCTCAAATAAGGGCCTAA Found at i:15820 original size:60 final size:59 Alignment explanation

Indices: 15748--15911 Score: 238 Period size: 60 Copynumber: 2.7 Consensus size: 59 15738 GCTAATTGCT * * 15748 CAAATAAGGGCCTAATGTTTGGCAAAATGCTCAAATAAGGGCCTGATCTTTTAATTTGGC 1 CAAATAAGGGCCTAACGTTTCG-AAAATGCTCAAATAAGGGCCTGATCTTTTAATTTGGC * * * * 15808 CAAATAAAGGCCTAACATTATCGAAAATGCTCAAATAAGGGCCCGGTCTTTTAATTTGGC 1 CAAATAAGGGCCTAACGTT-TCGAAAATGCTCAAATAAGGGCCTGATCTTTTAATTTGGC * 15868 CAAATAAGGGCCTAACGTAATCGAAAATGCTCAAATAAGGGCCT 1 CAAATAAGGGCCTAACGT-TTCGAAAATGCTCAAATAAGGGCCT 15912 AGCGTCAGTT Statistics Matches: 92, Mismatches: 10, Indels: 4 0.87 0.09 0.04 Matches are distributed among these distances: 60 90 0.98 61 2 0.02 ACGTcount: A:0.35, C:0.19, G:0.20, T:0.26 Consensus pattern (59 bp): CAAATAAGGGCCTAACGTTTCGAAAATGCTCAAATAAGGGCCTGATCTTTTAATTTGGC Found at i:15988 original size:31 final size:31 Alignment explanation

Indices: 15953--16053 Score: 102 Period size: 31 Copynumber: 3.3 Consensus size: 31 15943 CAACGTCAAA 15953 CCCTTATTTGAGCATTTTCAATAACGTTAGG 1 CCCTTATTTGAGCATTTTCAATAACGTTAGG ** * 15984 CCCTTATTTG-GCTAAATT-AA-AA-GATCAGG 1 CCCTTATTTGAGC-ATTTTCAATAACG-TTAGG * 16013 CCCTTATTTGAGCATTTTCGATAACGTTAGG 1 CCCTTATTTGAGCATTTTCAATAACGTTAGG ** 16044 AACTTATTTG 1 CCCTTATTTG 16054 GCCAAATTAA Statistics Matches: 55, Mismatches: 9, Indels: 12 0.72 0.12 0.16 Matches are distributed among these distances: 28 1 0.02 29 19 0.35 30 7 0.13 31 27 0.49 32 1 0.02 ACGTcount: A:0.28, C:0.18, G:0.17, T:0.38 Consensus pattern (31 bp): CCCTTATTTGAGCATTTTCAATAACGTTAGG Found at i:16103 original size:60 final size:61 Alignment explanation

Indices: 15952--16115 Score: 237 Period size: 60 Copynumber: 2.7 Consensus size: 61 15942 TCAACGTCAA * 15952 ACCCTTATTTGAGCA-TTTT-CAATAACGTTAGGCCCTTATTTGGCTAAATTAAAAGATCAG 1 ACCCTTATTTGAGCATTTTTGC-ATAACGTTAGGCCCTTATTTGGCCAAATTAAAAGATCAG * * ** 16012 GCCCTTATTTGAGCATTTTCG-ATAACGTTAGGAACTTATTTGGCCAAATTAAAAGATCAG 1 ACCCTTATTTGAGCATTTTTGCATAACGTTAGGCCCTTATTTGGCCAAATTAAAAGATCAG * 16072 ACCCTTATTTGAGCATTTTTGCA-AACATTAGGCCCTTATTTGGC 1 ACCCTTATTTGAGCATTTTTGCATAACGTTAGGCCCTTATTTGGC 16116 GCCTTTTATT Statistics Matches: 91, Mismatches: 10, Indels: 6 0.85 0.09 0.06 Matches are distributed among these distances: 60 87 0.96 61 4 0.04 ACGTcount: A:0.29, C:0.19, G:0.16, T:0.35 Consensus pattern (61 bp): ACCCTTATTTGAGCATTTTTGCATAACGTTAGGCCCTTATTTGGCCAAATTAAAAGATCAG Found at i:16526 original size:16 final size:15 Alignment explanation

Indices: 16488--16529 Score: 66 Period size: 15 Copynumber: 2.7 Consensus size: 15 16478 ACAGAGATTG * 16488 ACAGAAAGCAATTAA 1 ACAGAAAACAATTAA 16503 ACAGAAAACAATTAA 1 ACAGAAAACAATTAA 16518 ACTAGAAAACAA 1 AC-AGAAAACAA 16530 AGCAAAGTAA Statistics Matches: 25, Mismatches: 1, Indels: 1 0.93 0.04 0.04 Matches are distributed among these distances: 15 16 0.64 16 9 0.36 ACGTcount: A:0.64, C:0.14, G:0.10, T:0.12 Consensus pattern (15 bp): ACAGAAAACAATTAA Found at i:22365 original size:24 final size:26 Alignment explanation

Indices: 22338--22392 Score: 87 Period size: 26 Copynumber: 2.2 Consensus size: 26 22328 TTCAGAAAAA * 22338 ACGCAG-AA-AAACTTTTTTTTTATG 1 ACGCAGAAACAAAATTTTTTTTTATG 22362 ACGCAGAAACAAAATTTTTTTTTATG 1 ACGCAGAAACAAAATTTTTTTTTATG 22388 ACGCA 1 ACGCA 22393 ATTTTTTTTT Statistics Matches: 28, Mismatches: 1, Indels: 2 0.90 0.03 0.06 Matches are distributed among these distances: 24 6 0.21 25 2 0.07 26 20 0.71 ACGTcount: A:0.36, C:0.15, G:0.13, T:0.36 Consensus pattern (26 bp): ACGCAGAAACAAAATTTTTTTTTATG Found at i:22400 original size:19 final size:19 Alignment explanation

Indices: 22374--22412 Score: 62 Period size: 18 Copynumber: 2.1 Consensus size: 19 22364 GCAGAAACAA * 22374 AATTTTTTTTTAT-GACGC 1 AATTTTTTTTTTTCGACGC 22392 AATTTTTTTTTTTCGACGC 1 AATTTTTTTTTTTCGACGC 22411 AA 1 AA 22413 AACACAAAAC Statistics Matches: 19, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 18 12 0.63 19 7 0.37 ACGTcount: A:0.23, C:0.13, G:0.10, T:0.54 Consensus pattern (19 bp): AATTTTTTTTTTTCGACGC Found at i:33271 original size:17 final size:17 Alignment explanation

Indices: 33246--33280 Score: 52 Period size: 17 Copynumber: 2.1 Consensus size: 17 33236 CTAAATTTTC 33246 AATTATTATATATTATA 1 AATTATTATATATTATA * * 33263 AATTTTTATTTATTATA 1 AATTATTATATATTATA 33280 A 1 A 33281 TAAACTTTAA Statistics Matches: 16, Mismatches: 2, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 17 16 1.00 ACGTcount: A:0.43, C:0.00, G:0.00, T:0.57 Consensus pattern (17 bp): AATTATTATATATTATA Found at i:35106 original size:17 final size:16 Alignment explanation

Indices: 35086--35133 Score: 53 Period size: 16 Copynumber: 2.9 Consensus size: 16 35076 CCTTTCCCTT 35086 CCTTCTTATTTTCTTTA 1 CCTTCTTA-TTTCTTTA * * 35103 CCTTGTTATTTCTTTT 1 CCTTCTTATTTCTTTA 35119 CCTT-TCTATTTCTTT 1 CCTTCT-TATTTCTTT 35134 CCTCCCTACC Statistics Matches: 28, Mismatches: 2, Indels: 3 0.85 0.06 0.09 Matches are distributed among these distances: 15 1 0.04 16 20 0.71 17 7 0.25 ACGTcount: A:0.08, C:0.23, G:0.02, T:0.67 Consensus pattern (16 bp): CCTTCTTATTTCTTTA Found at i:37269 original size:2 final size:2 Alignment explanation

Indices: 37264--37291 Score: 56 Period size: 2 Copynumber: 14.0 Consensus size: 2 37254 TAACTCATTA 37264 AT AT AT AT AT AT AT AT AT AT AT AT AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT 37292 TATGTGAACC Statistics Matches: 26, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 26 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): AT Found at i:41951 original size:22 final size:21 Alignment explanation

Indices: 41925--41972 Score: 69 Period size: 21 Copynumber: 2.3 Consensus size: 21 41915 AGGTTACTAA 41925 TAAAAAGGCTTATAAGCTTAC 1 TAAAAAGGCTTATAAGCTTAC * * * 41946 TAAAAATGCTTATGAGCTTCC 1 TAAAAAGGCTTATAAGCTTAC 41967 TAAAAA 1 TAAAAA 41973 AGTTTTTACA Statistics Matches: 24, Mismatches: 3, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 21 24 1.00 ACGTcount: A:0.44, C:0.15, G:0.12, T:0.29 Consensus pattern (21 bp): TAAAAAGGCTTATAAGCTTAC Found at i:42866 original size:29 final size:29 Alignment explanation

Indices: 42830--42928 Score: 103 Period size: 29 Copynumber: 3.3 Consensus size: 29 42820 CCAAAATGCT 42830 CAAATAAGGGCCCGATCTTTTAATTTGGC 1 CAAATAAGGGCCCGATCTTTTAATTTGGC * * ** 42859 CAAATAAGGG-CCTAACGTTATTGAAAAT-GC 1 CAAATAAGGGCCCGATC-TT-TT-AATTTGGC * 42889 TCAAATAAGGGCCTGATCTTTTAATTTGGC 1 -CAAATAAGGGCCCGATCTTTTAATTTGGC 42919 CAAATAAGGG 1 CAAATAAGGG 42929 TCTAACGTTT Statistics Matches: 55, Mismatches: 9, Indels: 12 0.72 0.12 0.16 Matches are distributed among these distances: 28 4 0.07 29 25 0.45 30 8 0.15 31 15 0.27 32 3 0.05 ACGTcount: A:0.33, C:0.17, G:0.21, T:0.28 Consensus pattern (29 bp): CAAATAAGGGCCCGATCTTTTAATTTGGC Found at i:42886 original size:60 final size:59 Alignment explanation

Indices: 42801--42959 Score: 239 Period size: 60 Copynumber: 2.6 Consensus size: 59 42791 AAAAAGTATT * * 42801 AATAAGGGACTAATGTTTGCCAAAATGCTCAAATAAGGGCCCGATCTTTTAATTTGGCCA 1 AATAAGGGCCTAACGTTTG-CAAAATGCTCAAATAAGGGCCCGATCTTTTAATTTGGCCA * 42861 AATAAGGGCCTAACGTTATTG-AAAATGCTCAAATAAGGGCCTGATCTTTTAATTTGGCCA 1 AATAAGGGCCTAACG-T-TTGCAAAATGCTCAAATAAGGGCCCGATCTTTTAATTTGGCCA * 42921 AATAAGGGTCTAACGTTTGCCAAAATGCTCAAATAAGGG 1 AATAAGGGCCTAACGTTTG-CAAAATGCTCAAATAAGGG 42960 TTTGGCATCA Statistics Matches: 91, Mismatches: 4, Indels: 8 0.88 0.04 0.08 Matches are distributed among these distances: 58 3 0.03 59 1 0.01 60 83 0.91 61 1 0.01 62 3 0.03 ACGTcount: A:0.35, C:0.17, G:0.21, T:0.28 Consensus pattern (59 bp): AATAAGGGCCTAACGTTTGCAAAATGCTCAAATAAGGGCCCGATCTTTTAATTTGGCCA Found at i:42897 original size:31 final size:30 Alignment explanation

Indices: 42859--42959 Score: 93 Period size: 31 Copynumber: 3.3 Consensus size: 30 42849 TTAATTTGGC 42859 CAAATAAGGGCCTAACGTTATTGAAAATGCT 1 CAAATAAGGGCCTAACGTT-TTGAAAATGCT * ** 42890 CAAATAAGGGCCTGATC-TTTT-AATTTGGC- 1 CAAATAAGGGCCT-AACGTTTTGAAAAT-GCT * 42919 CAAATAAGGGTCTAACG-TTTGCCAAAATGCT 1 CAAATAAGGGCCTAACGTTTTG--AAAATGCT 42950 CAAATAAGGG 1 CAAATAAGGG 42960 TTTGGCATCA Statistics Matches: 56, Mismatches: 7, Indels: 14 0.73 0.09 0.18 Matches are distributed among these distances: 28 5 0.09 29 15 0.27 30 6 0.11 31 28 0.50 32 2 0.04 ACGTcount: A:0.36, C:0.17, G:0.21, T:0.27 Consensus pattern (30 bp): CAAATAAGGGCCTAACGTTTTGAAAATGCT Found at i:43074 original size:29 final size:28 Alignment explanation

Indices: 43036--43134 Score: 76 Period size: 29 Copynumber: 3.4 Consensus size: 28 43026 AAACGTTAGA 43036 CCCTTATTTGGCCAAATTAAAAGACCGGG 1 CCCTTATTTGGCCAAATTAAAAGA-CGGG ** ** 43065 CCCTTATTTGAG-CATTTTGGCAA-ACGTTAGG 1 CCCTTATTTG-GCCAAATT-AAAAGACG---GG 43096 CCCTTATTTGGCCAAATTAAAAGATCGGG 1 CCCTTATTTGGCCAAATTAAAAGA-CGGG * 43125 CCTTTATTTG 1 CCCTTATTTG 43135 AACATTTTGC Statistics Matches: 53, Mismatches: 9, Indels: 16 0.68 0.12 0.21 Matches are distributed among these distances: 28 2 0.04 29 26 0.49 30 6 0.11 31 17 0.32 32 2 0.04 ACGTcount: A:0.26, C:0.21, G:0.20, T:0.32 Consensus pattern (28 bp): CCCTTATTTGGCCAAATTAAAAGACGGG Found at i:43100 original size:60 final size:60 Alignment explanation

Indices: 43002--43165 Score: 265 Period size: 60 Copynumber: 2.7 Consensus size: 60 42992 TTCGATGGCA * 43002 GGCCCTTATTTGAGCATTTTTGGCAAACGTTAGACCCTTATTTGGCCAAATTAAAAGACCG 1 GGCCCTTATTTGAGCA-TTTTGGCAAACGTTAGGCCCTTATTTGGCCAAATTAAAAGACCG * 43063 GGCCCTTATTTGAGCATTTTGGCAAACGTTAGGCCCTTATTTGGCCAAATTAAAAGATCG 1 GGCCCTTATTTGAGCATTTTGGCAAACGTTAGGCCCTTATTTGGCCAAATTAAAAGACCG * * * * 43123 GGCCTTTATTTGAACATTTTGCCAAATGTTAGGCCCTTATTTG 1 GGCCCTTATTTGAGCATTTTGGCAAACGTTAGGCCCTTATTTG 43166 AGCAATTAGA Statistics Matches: 97, Mismatches: 6, Indels: 1 0.93 0.06 0.01 Matches are distributed among these distances: 60 81 0.84 61 16 0.16 ACGTcount: A:0.26, C:0.20, G:0.20, T:0.34 Consensus pattern (60 bp): GGCCCTTATTTGAGCATTTTGGCAAACGTTAGGCCCTTATTTGGCCAAATTAAAAGACCG Found at i:44795 original size:2 final size:2 Alignment explanation

Indices: 44788--44820 Score: 59 Period size: 2 Copynumber: 17.0 Consensus size: 2 44778 CTATTAACTA 44788 AT AT AT AT AT AT AT AT AT AT AT AT AT A- AT AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 44821 TTACCCTTTG Statistics Matches: 30, Mismatches: 0, Indels: 2 0.94 0.00 0.06 Matches are distributed among these distances: 1 1 0.03 2 29 0.97 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Done.