Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01019947.1 Corchorus olitorius cultivar O-4 contig19980, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 24044
ACGTcount: A:0.35, C:0.18, G:0.17, T:0.30


Found at i:604 original size:3 final size:3

Alignment explanation

Indices: 589--665 Score: 57 Period size: 3 Copynumber: 25.7 Consensus size: 3 579 CTCCTCATGG * * * ** * 589 TCA TCA CCA TCA TCA TCA TCG TCC TCA TTG TCA TC- TCCG TCA TCA 1 TCA TCA TCA TCA TCA TCA TCA TCA TCA TCA TCA TCA T-CA TCA TCA * * * 634 TCA TCA TCA TCC TCG TCA TCA TCC TCA TCA TC 1 TCA TCA TCA TCA TCA TCA TCA TCA TCA TCA TC 666 GCCATCAACT Statistics Matches: 57, Mismatches: 15, Indels: 4 0.75 0.20 0.05 Matches are distributed among these distances: 2 1 0.02 3 55 0.96 4 1 0.02 ACGTcount: A:0.22, C:0.39, G:0.05, T:0.34 Consensus pattern (3 bp): TCA Found at i:616 original size:30 final size:29 Alignment explanation

Indices: 580--665 Score: 91 Period size: 30 Copynumber: 2.9 Consensus size: 29 570 GAGCTAAATC * * 580 TCCTCATGGTCATCACCATCATCATCATCG 1 TCCTCATCGTCATCACC-TCATCATCATCA * * 610 TCCTCATTGTCATCTCCGTCATCATCATCA 1 TCCTCATCGTCATCACC-TCATCATCATCA * * 640 TCATCCTCGTCATCATCCTCATCATC 1 TCCTCATCGTCATCA-CCTCATCATC 666 GCCATCAACT Statistics Matches: 47, Mismatches: 8, Indels: 2 0.82 0.14 0.04 Matches are distributed among these distances: 30 45 0.96 31 2 0.04 ACGTcount: A:0.21, C:0.38, G:0.07, T:0.34 Consensus pattern (29 bp): TCCTCATCGTCATCACCTCATCATCATCA Found at i:622 original size:21 final size:20 Alignment explanation

Indices: 588--665 Score: 79 Period size: 21 Copynumber: 3.9 Consensus size: 20 578 TCTCCTCATG * 588 GTCATCACCATCATCATCATC 1 GTCATCATCATCATCATC-TC * ** 609 GTCCTCATTGTCATC-TC-C 1 GTCATCATCATCATCATCTC 627 GTCATCATCATCATCATCCTC 1 GTCATCATCATCATCAT-CTC * 648 GTCATCATCCTCATCATC 1 GTCATCATCATCATCATC 666 GCCATCAACT Statistics Matches: 46, Mismatches: 8, Indels: 7 0.75 0.13 0.11 Matches are distributed among these distances: 18 13 0.28 19 1 0.02 20 4 0.09 21 28 0.61 ACGTcount: A:0.22, C:0.38, G:0.06, T:0.33 Consensus pattern (20 bp): GTCATCATCATCATCATCTC Found at i:8818 original size:11 final size:11 Alignment explanation

Indices: 8802--8826 Score: 50 Period size: 11 Copynumber: 2.3 Consensus size: 11 8792 ACCATCCCAC 8802 AGAGGCTGTTT 1 AGAGGCTGTTT 8813 AGAGGCTGTTT 1 AGAGGCTGTTT 8824 AGA 1 AGA 8827 AGCTTGAACC Statistics Matches: 14, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 11 14 1.00 ACGTcount: A:0.24, C:0.08, G:0.36, T:0.32 Consensus pattern (11 bp): AGAGGCTGTTT Found at i:10033 original size:12 final size:12 Alignment explanation

Indices: 10012--10068 Score: 69 Period size: 12 Copynumber: 4.5 Consensus size: 12 10002 AAAATGGGAG 10012 TTTAAAACGGAA 1 TTTAAAACGGAA * 10024 TTTCAAACGGAAA 1 TTTAAAACGG-AA 10037 GTTTAAAAACGGAA 1 -TTT-AAAACGGAA 10051 TTTAAAACGGAA 1 TTTAAAACGGAA * 10063 TATAAA 1 TTTAAA 10069 TATCCATCTC Statistics Matches: 39, Mismatches: 3, Indels: 6 0.81 0.06 0.12 Matches are distributed among these distances: 12 23 0.59 13 5 0.13 14 5 0.13 15 6 0.15 ACGTcount: A:0.51, C:0.09, G:0.16, T:0.25 Consensus pattern (12 bp): TTTAAAACGGAA Found at i:10050 original size:27 final size:26 Alignment explanation

Indices: 10010--10062 Score: 88 Period size: 27 Copynumber: 2.0 Consensus size: 26 10000 GGAAAATGGG * 10010 AGTTTAAAACGGAATTTCAAACGGAA 1 AGTTTAAAACGGAATTTAAAACGGAA 10036 AGTTTAAAAACGGAATTTAAAACGGAA 1 AGTTT-AAAACGGAATTTAAAACGGAA 10063 TATAAATATC Statistics Matches: 25, Mismatches: 1, Indels: 1 0.93 0.04 0.04 Matches are distributed among these distances: 26 5 0.20 27 20 0.80 ACGTcount: A:0.49, C:0.09, G:0.19, T:0.23 Consensus pattern (26 bp): AGTTTAAAACGGAATTTAAAACGGAA Found at i:19615 original size:6 final size:6 Alignment explanation

Indices: 19604--19635 Score: 64 Period size: 6 Copynumber: 5.3 Consensus size: 6 19594 GTTACGTGCT 19604 GAAAGA GAAAGA GAAAGA GAAAGA GAAAGA GA 1 GAAAGA GAAAGA GAAAGA GAAAGA GAAAGA GA 19636 TGATTTTTTT Statistics Matches: 26, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 26 1.00 ACGTcount: A:0.66, C:0.00, G:0.34, T:0.00 Consensus pattern (6 bp): GAAAGA Done.