Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01022162.1 Corchorus olitorius cultivar O-4 contig22195, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 12114
ACGTcount: A:0.30, C:0.19, G:0.19, T:0.31


Found at i:605 original size:21 final size:21

Alignment explanation

Indices: 581--714 Score: 200 Period size: 21 Copynumber: 6.4 Consensus size: 21 571 CTTAGGCAAT * 581 TCCAATGAGCTTGAAACCTTC 1 TCCAATGAGCTTGGAACCTTC * 602 TCCAATGATCTTGGAACCTTC 1 TCCAATGAGCTTGGAACCTTC * 623 TCCAATGAACTTGGAACCTTC 1 TCCAATGAGCTTGGAACCTTC * 644 TCCAATGAACTTGGAACCTTC 1 TCCAATGAGCTTGGAACCTTC 665 TCCAATGAGCTTGGAA-CTTGC 1 TCCAATGAGCTTGGAACCTT-C 686 TCCAATGAGCTTGGAA-CTTGC 1 TCCAATGAGCTTGGAACCTT-C 707 TCCAATGA 1 TCCAATGA 715 ACTTCTAGCA Statistics Matches: 108, Mismatches: 4, Indels: 2 0.95 0.04 0.02 Matches are distributed among these distances: 20 3 0.03 21 105 0.97 ACGTcount: A:0.27, C:0.27, G:0.17, T:0.29 Consensus pattern (21 bp): TCCAATGAGCTTGGAACCTTC Found at i:4371 original size:38 final size:38 Alignment explanation

Indices: 4320--4695 Score: 268 Period size: 38 Copynumber: 9.8 Consensus size: 38 4310 GATTCTAATG * 4320 AGAGACCGAAGCAGGTTTGATTAAACGAAACTCTAAGC 1 AGAGACCTAAGCAGGTTTGATTAAACGAAACTCTAAGC * * * * * * 4358 CGAGACTTGAGCAGGTTT-ACTTAAATGGAAATTCTAAAC 1 AGAGACCTAAGCAGGTTTGA-TTAAA-CGAAACTCTAAGC * * * 4397 A-AGAACCTAAGCAGGTTCGATTAAACGAAGCTCTAAGA 1 AGAG-ACCTAAGCAGGTTTGATTAAACGAAACTCTAAGC * * ** 4435 AGAGACCTAAGCAGG-TTCATTTAAACGGAAATTCTAAAT 1 AGAGACCTAAGCAGGTTTGA-TTAAAC-GAAACTCTAAGC * * * * 4474 GGGGACCTAAGCAGGTTTGATCAAACAAAACTCTAAGC 1 AGAGACCTAAGCAGGTTTGATTAAACGAAACTCTAAGC * * * 4512 AGAGACCTAAGCAGGCTT-ACTTAAATGGAAATTCTGAA-C 1 AGAGACCTAAGCAGGTTTGA-TTAAA-CGAAACTCT-AAGC * * 4551 A-AGGACCTAGGCAGGTTTGATTAAACGAAGCTCTAAGC 1 AGA-GACCTAAGCAGGTTTGATTAAACGAAACTCTAAGC * * 4589 AGAGACCTAAGCAGGTTT-ACTTAAATGGAAATTCTGAA-C 1 AGAGACCTAAGCAGGTTTGA-TTAAA-CGAAACTCT-AAGC * * * * 4628 A-AGGACCTAAGCAAGTTTGATTGAACGAAGCTCTAAGT 1 AGA-GACCTAAGCAGGTTTGATTAAACGAAACTCTAAGC * * * 4666 AGAGACCTGAGCCGCTTT-ACTTAAACGAAA 1 AGAGACCTAAGCAGGTTTGA-TTAAACGAAA 4696 ATTCTAAATG Statistics Matches: 257, Mismatches: 58, Indels: 46 0.71 0.16 0.13 Matches are distributed among these distances: 37 10 0.04 38 129 0.50 39 108 0.42 40 10 0.04 ACGTcount: A:0.38, C:0.18, G:0.22, T:0.22 Consensus pattern (38 bp): AGAGACCTAAGCAGGTTTGATTAAACGAAACTCTAAGC Found at i:4409 original size:77 final size:77 Alignment explanation

Indices: 4323--4703 Score: 501 Period size: 77 Copynumber: 4.9 Consensus size: 77 4313 TCTAATGAGA * * * * * 4323 GACCGAAGCAGGTTTGATTAAACGAAACTCTAAGCCGAGACTTGAGCAGGTTTACTTAAATGGAA 1 GACCTAAGCAGGTTTGATTAAACGAAGCTCTAAGCAGAGACCTAAGCAGGTTTACTTAAATGGAA 4388 ATTCTAAACAAG 66 ATTCTAAACAAG * * * * * * 4400 AACCTAAGCAGGTTCGATTAAACGAAGCTCTAAGAAGAGACCTAAGCAGGTTCATTTAAACGGAA 1 GACCTAAGCAGGTTTGATTAAACGAAGCTCTAAGCAGAGACCTAAGCAGGTTTACTTAAATGGAA *** 4465 ATTCTAAATGGG 66 ATTCTAAACAAG * * * * 4477 GACCTAAGCAGGTTTGATCAAACAAAACTCTAAGCAGAGACCTAAGCAGGCTTACTTAAATGGAA 1 GACCTAAGCAGGTTTGATTAAACGAAGCTCTAAGCAGAGACCTAAGCAGGTTTACTTAAATGGAA * 4542 ATTCTGAACAAG 66 ATTCTAAACAAG * 4554 GACCTAGGCAGGTTTGATTAAACGAAGCTCTAAGCAGAGACCTAAGCAGGTTTACTTAAATGGAA 1 GACCTAAGCAGGTTTGATTAAACGAAGCTCTAAGCAGAGACCTAAGCAGGTTTACTTAAATGGAA * 4619 ATTCTGAACAAG 66 ATTCTAAACAAG * * * * * * * * 4631 GACCTAAGCAAGTTTGATTGAACGAAGCTCTAAGTAGAGACCTGAGCCGCTTTACTTAAACGAAA 1 GACCTAAGCAGGTTTGATTAAACGAAGCTCTAAGCAGAGACCTAAGCAGGTTTACTTAAATGGAA 4696 ATTCTAAA 66 ATTCTAAA 4704 TGGAGACCTA Statistics Matches: 261, Mismatches: 43, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 77 261 1.00 ACGTcount: A:0.38, C:0.18, G:0.21, T:0.23 Consensus pattern (77 bp): GACCTAAGCAGGTTTGATTAAACGAAGCTCTAAGCAGAGACCTAAGCAGGTTTACTTAAATGGAA ATTCTAAACAAG Found at i:4410 original size:39 final size:37 Alignment explanation

Indices: 4367--4645 Score: 177 Period size: 39 Copynumber: 7.2 Consensus size: 37 4357 CCGAGACTTG 4367 AGCAGGTTTACTTAAATGGAAATTCTAAACAAGAACCTA 1 AGCAGGTTTA-TTAAA-GGAAATTCTAAACAAGAACCTA * * ** * 4406 AGCAGGTTCGATTAAACGAAGCTCT-AAGAAGAGACCTA 1 AGCAGGTT-TATTAAAGGAAATTCTAAACAAGA-ACCTA * *** * 4444 AGCAGGTTCATTTAAACGGAAATTCTAAATGGGGACCTA 1 AGCAGGTTTA-TTAAA-GGAAATTCTAAACAAGAACCTA * ** * * 4483 AGCAGGTTTGATCAAACAAAACTCTAAGC-AGAGACCTA 1 AGCAGGTTT-ATTAAAGGAAATTCTAAACAAGA-ACCTA * * * 4521 AGCAGGCTTACTTAAATGGAAATTCTGAACAAGGACCTA 1 AGCAGGTTTA-TTAAA-GGAAATTCTAAACAAGAACCTA * * ** * 4560 GGCAGGTTTGATTAAACGAAGCTCTAAGC-AGAGACCTA 1 AGCAGGTTT-ATTAAAGGAAATTCTAAACAAGA-ACCTA * * 4598 AGCAGGTTTACTTAAATGGAAATTCTGAACAAGGACCTA 1 AGCAGGTTTA-TTAAA-GGAAATTCTAAACAAGAACCTA * 4637 AGCAAGTTT 1 AGCAGGTTT 4646 GATTGAACGA Statistics Matches: 179, Mismatches: 46, Indels: 30 0.70 0.18 0.12 Matches are distributed among these distances: 37 12 0.07 38 75 0.42 39 82 0.46 40 10 0.06 ACGTcount: A:0.39, C:0.17, G:0.21, T:0.23 Consensus pattern (37 bp): AGCAGGTTTATTAAAGGAAATTCTAAACAAGAACCTA Done.