Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01012983.1 Corchorus olitorius cultivar O-4 contig13016, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 34898
ACGTcount: A:0.33, C:0.17, G:0.19, T:0.31


Found at i:7998 original size:38 final size:38

Alignment explanation

Indices: 7902--8049 Score: 217 Period size: 38 Copynumber: 3.9 Consensus size: 38 7892 GGCTGTGCAT * 7902 AGTGGACCCGTACCTCAGGGGGTTAAACTGATGGTAAAG 1 AGTGGACCCGTACCTCAGGGGGTTAAACTGTTGGT-AAG * * 7941 AGTGGACCCATACCACAGGGGGTTAAACTGTTGGTAAG 1 AGTGGACCCGTACCTCAGGGGGTTAAACTGTTGGTAAG * * 7979 AGTGGACCCGTGCCTCAGGGGGTTAAATTGTTGGTAAG 1 AGTGGACCCGTACCTCAGGGGGTTAAACTGTTGGTAAG * * 8017 AGTGGACCCGTGCCTTAGGGGGTT-AACTGTTGG 1 AGTGGACCCGTACCTCAGGGGGTTAAACTGTTGG 8050 CTAGACTCGA Statistics Matches: 100, Mismatches: 9, Indels: 2 0.90 0.08 0.02 Matches are distributed among these distances: 37 8 0.08 38 60 0.60 39 32 0.32 ACGTcount: A:0.24, C:0.18, G:0.35, T:0.24 Consensus pattern (38 bp): AGTGGACCCGTACCTCAGGGGGTTAAACTGTTGGTAAG Found at i:8090 original size:6 final size:6 Alignment explanation

Indices: 8079--8110 Score: 64 Period size: 6 Copynumber: 5.3 Consensus size: 6 8069 CGTTAACGGA 8079 TGATTG TGATTG TGATTG TGATTG TGATTG TG 1 TGATTG TGATTG TGATTG TGATTG TGATTG TG 8111 GTGCAGCCTG Statistics Matches: 26, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 26 1.00 ACGTcount: A:0.16, C:0.00, G:0.34, T:0.50 Consensus pattern (6 bp): TGATTG Found at i:10247 original size:16 final size:16 Alignment explanation

Indices: 10228--10267 Score: 53 Period size: 16 Copynumber: 2.5 Consensus size: 16 10218 AGAATAATAA * 10228 AAAAATTAAAAAAAAG 1 AAAAATGAAAAAAAAG * * 10244 AAAAAAGAAAAAAAGG 1 AAAAATGAAAAAAAAG 10260 AAAAATGA 1 AAAAATGA 10268 TGAAGAAAAA Statistics Matches: 20, Mismatches: 4, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 16 20 1.00 ACGTcount: A:0.80, C:0.00, G:0.12, T:0.07 Consensus pattern (16 bp): AAAAATGAAAAAAAAG Found at i:10254 original size:24 final size:24 Alignment explanation

Indices: 10209--10257 Score: 71 Period size: 24 Copynumber: 2.0 Consensus size: 24 10199 GAGTGCATTC * 10209 TTAAAAAAAAGAATAATAAAAAAA 1 TTAAAAAAAAGAATAAAAAAAAAA * 10233 TTAAAAAAAAGAAAAAAGAAAAAAA 1 TTAAAAAAAAGAATAAA-AAAAAAA 10258 GGAAAAATGA Statistics Matches: 22, Mismatches: 2, Indels: 1 0.88 0.08 0.04 Matches are distributed among these distances: 24 15 0.68 25 7 0.32 ACGTcount: A:0.82, C:0.00, G:0.06, T:0.12 Consensus pattern (24 bp): TTAAAAAAAAGAATAAAAAAAAAA Found at i:10263 original size:25 final size:24 Alignment explanation

Indices: 10211--10264 Score: 63 Period size: 25 Copynumber: 2.2 Consensus size: 24 10201 GTGCATTCTT * * ** 10211 AAAAAAAAGAATAATAAAAAAATT 1 AAAAAAAAGAAAAAGAAAAAAAGG 10235 AAAAAAAAGAAAAAAGAAAAAAAGG 1 AAAAAAAAG-AAAAAGAAAAAAAGG 10260 AAAAA 1 AAAAA 10265 TGATGAAGAA Statistics Matches: 25, Mismatches: 4, Indels: 1 0.83 0.13 0.03 Matches are distributed among these distances: 24 9 0.36 25 16 0.64 ACGTcount: A:0.83, C:0.00, G:0.09, T:0.07 Consensus pattern (24 bp): AAAAAAAAGAAAAAGAAAAAAAGG Found at i:11449 original size:88 final size:88 Alignment explanation

Indices: 11347--11831 Score: 684 Period size: 88 Copynumber: 5.6 Consensus size: 88 11337 AGTTGACTCA * * * 11347 GGGTGGTCTTTTCTTCAATTTATGTC-GAAATGATCGGGGGTGGTTTTTCTTCAATTCTTCAATG 1 GGGTGGTC-TTTCTTCAGTTTATTTCAG-AATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATG * 11411 CTTCAATTTATTTCAGAATGATTGG 64 CTTCAATTTATTTCAGAATGATCGG * * * 11436 GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCAGGGGTGGTCTTTCTTCAATTCTTCACTTCT 1 GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGCT * 11501 TCAATTTATTTCAGAATAATCGG 66 TCAATTTATTTCAGAATGATCGG * * * 11524 GGGTGGTCTTTCTTCAATTTTTTTTCAGAATGATCGGGGGTGGTCTTTATTCAATTCTTCAATGC 1 GGGTGGTCTTTCTTC-AGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGC * 11589 TTCAAATTATTTCAGAATGATCGG 65 TTCAATTTATTTCAGAATGATCGG * 11613 GGGTGTTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAA----T--A--CT 1 GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGCT * * 11670 TTAATTTATTTCAGAATGATTGG 66 TCAATTTATTTCAGAATGATCGG 11693 GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAAT-CT 1 GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGCT 11757 TCAATTTATTTCAGAATGATCGG 66 TCAATTTATTTCAGAATGATCGG * ** * ** 11780 GGGTGGTCTTTCTACAGTTTATTTTGGGATGATCCAGGGTGGTCTTTCTTCA 1 GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCA 11832 CTTTAGTTCC Statistics Matches: 355, Mismatches: 32, Indels: 20 0.87 0.08 0.05 Matches are distributed among these distances: 80 74 0.21 82 1 0.00 84 2 0.01 86 1 0.00 87 69 0.19 88 120 0.34 89 88 0.25 ACGTcount: A:0.20, C:0.15, G:0.22, T:0.43 Consensus pattern (88 bp): GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGCT TCAATTTATTTCAGAATGATCGG Found at i:11458 original size:36 final size:36 Alignment explanation

Indices: 11411--11747 Score: 145 Period size: 36 Copynumber: 8.2 Consensus size: 36 11401 TTCTTCAATG ** 11411 CTTCAATTTATTTCAGAATGATTGGGGGTGGTCTTT 1 CTTCAATTTATTTCAGAATGATCAGGGGTGGTCTTT * 11447 CTTCAGTTTATTTCAGAATGATCAGGGGTGGTCTTT 1 CTTCAATTTATTTCAGAATGATCAGGGGTGGTCTTT * 11483 CTTCAATTCTTCACTTCTTCA-ATTTATTTCAGAATAATCGGGGGTGGTCTTT 1 CTTCAA-T-TT-A--T-TTCAGA---A--T--G----ATCAGGGGTGGTCTTT * * 11535 CTTCAATTTTTTTTCAGAATGATCGGGGGTGGTCTTT 1 CTTCAA-TTTATTTCAGAATGATCAGGGGTGGTCTTT * * * * 11572 ATTCAATTCTTCAATGCTTCAAATTATTTCAGAATGATCGGGGGTGTTCTTT 1 CTTCAA-T-TT--AT--TTCAGA--A--T--G----ATCAGGGGTGGTCTTT * * 11624 CTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTT 1 CTTCAATTTATTTCAGAATGATCAGGGGTGGTCTTT ** 11660 CTTCAATACTTTAATTTATTTCAGAATGATTGGGGGTGGTCTTT 1 C-T---T-C---AATTTATTTCAGAATGATCAGGGGTGGTCTTT * * 11704 CTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTT 1 CTTCAATTTATTTCAGAATGATCAGGGGTGGTCTTT 11740 CTTCAATT 1 CTTCAATT 11748 CTTCAATCTT Statistics Matches: 241, Mismatches: 19, Indels: 82 0.70 0.06 0.24 Matches are distributed among these distances: 36 91 0.38 37 25 0.10 38 4 0.02 39 2 0.01 40 4 0.02 41 4 0.02 42 10 0.04 43 2 0.01 44 34 0.14 45 1 0.00 46 7 0.03 47 4 0.02 48 6 0.02 50 2 0.01 51 3 0.01 52 42 0.17 ACGTcount: A:0.20, C:0.15, G:0.22, T:0.44 Consensus pattern (36 bp): CTTCAATTTATTTCAGAATGATCAGGGGTGGTCTTT Found at i:11560 original size:37 final size:37 Alignment explanation

Indices: 11497--11747 Score: 193 Period size: 36 Copynumber: 6.3 Consensus size: 37 11487 AATTCTTCAC 11497 TTCTTCAA-TTTATTTCAGAATAATCGGGGGTGGTCT 1 TTCTTCAATTTTATTTCAGAATAATCGGGGGTGGTCT * * 11533 TTCTTCAATTTTTTTTCAGAATGATCGGGGGTGGTCTTTATT 1 TTCTTCAATTTTATTTCAGAATAATCGGGGGTGGTC-----T * * 11575 CAATTCTTCAATGCTTCAAATTATTTCAGAATGATCGGGGGTGTTCT 1 ---TTCTTCAAT--TT-----TATTTCAGAATAATCGGGGGTGGTCT * * 11622 TTCTTC-AGTTTATTTCAGAATGATCGGGGGTGGTCT 1 TTCTTCAATTTTATTTCAGAATAATCGGGGGTGGTCT * * 11658 TTCTTCAATACTTTAATTTATTTCAGAATGATTGGGGGTGGTCT 1 TTCTTC-A-A---T--TTTATTTCAGAATAATCGGGGGTGGTCT * * 11702 TTCTTC-AGTTTATTTCAGAATGATCGGGGGTGGTCT 1 TTCTTCAATTTTATTTCAGAATAATCGGGGGTGGTCT 11738 TTCTTCAATT 1 TTCTTCAATT 11748 CTTCAATCTT Statistics Matches: 179, Mismatches: 11, Indels: 49 0.75 0.05 0.21 Matches are distributed among these distances: 36 72 0.40 37 27 0.15 39 1 0.01 41 3 0.02 42 1 0.01 43 1 0.01 44 39 0.22 45 9 0.05 47 3 0.02 52 23 0.13 ACGTcount: A:0.20, C:0.14, G:0.22, T:0.44 Consensus pattern (37 bp): TTCTTCAATTTTATTTCAGAATAATCGGGGGTGGTCT Found at i:11643 original size:177 final size:168 Alignment explanation

Indices: 11347--11831 Score: 706 Period size: 177 Copynumber: 2.8 Consensus size: 168 11337 AGTTGACTCA * * 11347 GGGTGGTCTTTTCTTCAATTTATGTC-GAAATGATCGGGGGTGGTTTTTCTTCAATTCTTCAATG 1 GGGTGGTC-TTTCTTCAATTTATTTCAG-AATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATG * 11411 CTTCAATTTATTTCAGAATGATTGGGGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCAGGGGT 64 CTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCAGGGGT 11476 GGTCTTTCTTCAATTCTTCACTTCTTCAATTTATTTCAGAATAATCGG 129 GGTCTTTCTTCAA----T-AC-T-TT-AATTTATTTCAGAATAATCGG * * 11524 GGGTGGTCTTTCTTCAATTTTTTTTCAGAATGATCGGGGGTGGTCTTTATTCAATTCTTCAATGC 1 GGGTGGTCTTTCTTCAA-TTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGC * * * 11589 TTCAAATTATTTCAGAATGATCGGGGGTGTTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTG 65 TTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCAGGGGTG * * 11654 GTCTTTCTTCAATACTTTAATTTATTTCAGAATGATTGG 130 GTCTTTCTTCAATACTTTAATTTATTTCAGAATAATCGG * 11693 GGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAAT-CT 1 GGGTGGTCTTTCTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGCT * ** * 11757 TCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTACAGTTTATTTTGGGATGATCCA-GGGTG 66 TCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATTTCAGAATGAT-CAGGGGTG 11821 GTCTTTCTTCA 130 GTCTTTCTTCA 11832 CTTTAGTTCC Statistics Matches: 285, Mismatches: 20, Indels: 16 0.89 0.06 0.05 Matches are distributed among these distances: 167 68 0.24 168 44 0.15 169 35 0.12 170 2 0.01 171 1 0.00 172 2 0.01 173 1 0.00 176 9 0.03 177 122 0.43 178 1 0.00 ACGTcount: A:0.20, C:0.15, G:0.22, T:0.43 Consensus pattern (168 bp): GGGTGGTCTTTCTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAATTCTTCAATGCT TCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATTTCAGAATGATCAGGGGTGG TCTTTCTTCAATACTTTAATTTATTTCAGAATAATCGG Found at i:11711 original size:80 final size:80 Alignment explanation

Indices: 11578--11831 Score: 346 Period size: 80 Copynumber: 3.1 Consensus size: 80 11568 CTTTATTCAA * * * 11578 TTCTTCAATGCTTCAAATTATTTCAGAATGATCGGGGGTGTTCTTTCTTCAGTTTATTTCAGAAT 1 TTCTTCAATACTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATTTCAGAAT 11643 GATCGGGGGTGGTCT 66 GATCGGGGGTGGTCT * * 11658 TTCTTCAATACTTTAATTTATTTCAGAATGATTGGGGGTGGTCTTTCTTCAGTTTATTTCAGAAT 1 TTCTTCAATACTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATTTCAGAAT 11723 GATCGGGGGTGGTCT 66 GATCGGGGGTGGTCT * 11738 TTCTTCAATTCTTCAATCTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTACAGTTTATT 1 TTCTTCAA----T--A-CTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATT ** * ** 11803 TTGGGATGATCCAGGGTGGTCT 59 TCAGAATGATCGGGGGTGGTCT 11825 TTCTTCA 1 TTCTTCA 11832 CTTTAGTTCC Statistics Matches: 154, Mismatches: 13, Indels: 7 0.89 0.07 0.04 Matches are distributed among these distances: 80 83 0.54 84 1 0.01 86 1 0.01 87 69 0.45 ACGTcount: A:0.20, C:0.15, G:0.22, T:0.43 Consensus pattern (80 bp): TTCTTCAATACTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTTCAGTTTATTTCAGAAT GATCGGGGGTGGTCT Found at i:11771 original size:87 final size:88 Alignment explanation

Indices: 11658--11831 Score: 269 Period size: 87 Copynumber: 2.0 Consensus size: 88 11648 GGGGTGGTCT * * * 11658 TTCTTCAATACTTTAATTTATTTCAGAATGATTGGGGGTGGTCTTTCTTCAGTTTATTTCAGAAT 1 TTCTTCAATACTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTACAGTTTATTTCAGAAT ** 11723 GATCGGGGGTGGTCTTTCTTCAA 66 GATCCAGGGTGGTCTTTCTTCAA ** * 11746 TTCTTCAAT-CTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTACAGTTTATTTTGGGAT 1 TTCTTCAATACTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTACAGTTTATTTCAGAAT 11810 GATCCAGGGTGGTCTTTCTTCA 66 GATCCAGGGTGGTCTTTCTTCA 11832 CTTTAGTTCC Statistics Matches: 78, Mismatches: 8, Indels: 1 0.90 0.09 0.01 Matches are distributed among these distances: 87 69 0.88 88 9 0.12 ACGTcount: A:0.20, C:0.15, G:0.22, T:0.44 Consensus pattern (88 bp): TTCTTCAATACTTCAATTTATTTCAGAATGATCGGGGGTGGTCTTTCTACAGTTTATTTCAGAAT GATCCAGGGTGGTCTTTCTTCAA Found at i:11848 original size:36 final size:36 Alignment explanation

Indices: 11754--11849 Score: 102 Period size: 36 Copynumber: 2.7 Consensus size: 36 11744 AATTCTTCAA ** 11754 TCTTCAATTTATTTCAGAATGATCGGGGGTGGTCTT 1 TCTTCAATTTATTTCAGAATGATCCAGGGTGGTCTT * * ** * 11790 TCTACAGTTTATTTTGGGATGATCCAGGGTGGTCTT 1 TCTTCAATTTATTTCAGAATGATCCAGGGTGGTCTT * * * 11826 TCTTCACTTTAGTTCCGAATGATC 1 TCTTCAATTTATTTCAGAATGATC 11850 GAAAATGGTT Statistics Matches: 47, Mismatches: 13, Indels: 0 0.78 0.22 0.00 Matches are distributed among these distances: 36 47 1.00 ACGTcount: A:0.19, C:0.17, G:0.23, T:0.42 Consensus pattern (36 bp): TCTTCAATTTATTTCAGAATGATCCAGGGTGGTCTT Found at i:14475 original size:37 final size:35 Alignment explanation

Indices: 14434--14579 Score: 123 Period size: 37 Copynumber: 3.9 Consensus size: 35 14424 AAATACAAGT * 14434 GTTTTCGTTTTTTAAGAAGAAGGGAAAGATCCCGCTA 1 GTTTTAGTTTTTTAAG--GAAGGGAAAGATCCCGCTA * 14471 GTTTTAGTTTCTTTTTAGGAAGGGAAAGATCCCGCTA 1 GTTTTAG-TT-TTTTAAGGAAGGGAAAGATCCCGCTA * * * * * 14508 GTTTTGGTTTCATCTT-AGGATGAGAAGGAATCCCACTA 1 GTTTTAGTTT--T-TTAAGGAAGGGAAAG-ATCCCGCTA * 14546 GTTTTAGTTTCGTTTTAGGAAGGGAAAGATCCCG 1 GTTTTAGTTT--TTTAAGGAAGGGAAAGATCCCG 14580 TCAAAAGTTT Statistics Matches: 89, Mismatches: 13, Indels: 14 0.77 0.11 0.12 Matches are distributed among these distances: 35 1 0.01 36 2 0.02 37 48 0.54 38 32 0.36 39 6 0.07 ACGTcount: A:0.26, C:0.14, G:0.25, T:0.35 Consensus pattern (35 bp): GTTTTAGTTTTTTAAGGAAGGGAAAGATCCCGCTA Found at i:14549 original size:38 final size:37 Alignment explanation

Indices: 14452--14579 Score: 157 Period size: 37 Copynumber: 3.4 Consensus size: 37 14442 TTTTTAAGAA * * 14452 GAAGGGAAAGATCCCGCTAGTTTTAGTTTCTTTTTAG 1 GAAGAGAAAGATCCCGCTAGTTTTAGTTTCATTTTAG * * * 14489 GAAGGGAAAGATCCCGCTAGTTTTGGTTTCATCTTAG 1 GAAGAGAAAGATCCCGCTAGTTTTAGTTTCATTTTAG * * * * 14526 GATGAGAAGGAATCCCACTAGTTTTAGTTTCGTTTTAG 1 GAAGAGAAAG-ATCCCGCTAGTTTTAGTTTCATTTTAG * 14564 GAAGGGAAAGATCCCG 1 GAAGAGAAAGATCCCG 14580 TCAAAAGTTT Statistics Matches: 76, Mismatches: 14, Indels: 2 0.83 0.15 0.02 Matches are distributed among these distances: 37 46 0.61 38 30 0.39 ACGTcount: A:0.27, C:0.15, G:0.27, T:0.32 Consensus pattern (37 bp): GAAGAGAAAGATCCCGCTAGTTTTAGTTTCATTTTAG Found at i:19348 original size:17 final size:18 Alignment explanation

Indices: 19323--19356 Score: 52 Period size: 17 Copynumber: 1.9 Consensus size: 18 19313 GATGCAATGC 19323 AAAAGACCTACCAATTAT 1 AAAAGACCTACCAATTAT * 19341 AAAA-ACCTACCGATTA 1 AAAAGACCTACCAATTA 19357 AGTGAATGTG Statistics Matches: 15, Mismatches: 1, Indels: 1 0.88 0.06 0.06 Matches are distributed among these distances: 17 11 0.73 18 4 0.27 ACGTcount: A:0.50, C:0.24, G:0.06, T:0.21 Consensus pattern (18 bp): AAAAGACCTACCAATTAT Found at i:19378 original size:22 final size:23 Alignment explanation

Indices: 19347--19391 Score: 74 Period size: 23 Copynumber: 2.0 Consensus size: 23 19337 TTATAAAAAC * 19347 CTACCGATT-AAGTGAATGTGCT 1 CTACCAATTAAAGTGAATGTGCT 19369 CTACCAATTAAAGTGAATGTGCT 1 CTACCAATTAAAGTGAATGTGCT 19392 ATGCACAATG Statistics Matches: 21, Mismatches: 1, Indels: 1 0.91 0.04 0.04 Matches are distributed among these distances: 22 8 0.38 23 13 0.62 ACGTcount: A:0.31, C:0.18, G:0.20, T:0.31 Consensus pattern (23 bp): CTACCAATTAAAGTGAATGTGCT Found at i:19901 original size:39 final size:39 Alignment explanation

Indices: 19791--20107 Score: 334 Period size: 39 Copynumber: 8.1 Consensus size: 39 19781 TCTAAATTAA * * * * 19791 GATTTTGAAATT-AGCTGATAAGGTAATGATCCTAAATAG 1 GATTCTGAAATTGA-CTGATAAAGCAATGATCCTGAATAG * * * ** * 19830 GAATTTTGAAATTAACTAATATGGCAATGATCCTTAATAG 1 G-ATTCTGAAATTGACTGATAAAGCAATGATCCTGAATAG * * * 19870 GACTCTGAAATTAACTGATAGAA-CAAAGATCCTGAATAG 1 GATTCTGAAATTGACTGATA-AAGCAATGATCCTGAATAG * * * 19909 GATTTTGAAATTGACAGATAAAGCCATGATCCTGAATAG 1 GATTCTGAAATTGACTGATAAAGCAATGATCCTGAATAG * * * 19948 GATTCGGAAATTCACTGATAATGCAATGATCCTGAATAG 1 GATTCTGAAATTGACTGATAAAGCAATGATCCTGAATAG * * 19987 GATTCTGAACTTGTCTGATAAAGCAATGATCCTGAATAG 1 GATTCTGAAATTGACTGATAAAGCAATGATCCTGAATAG * * * 20026 AATTCTGAAATCGA-TAGATAAAGCGATGATCCTGAATAG 1 GATTCTGAAATTGACT-GATAAAGCAATGATCCTGAATAG * ** 20065 GACTCTGAAATTGACAAATAAAGCAATGATCCTGAATAG 1 GATTCTGAAATTGACTGATAAAGCAATGATCCTGAATAG 20104 GATT 1 GATT 20108 GATAAAGCAA Statistics Matches: 232, Mismatches: 40, Indels: 12 0.82 0.14 0.04 Matches are distributed among these distances: 38 3 0.01 39 195 0.84 40 33 0.14 41 1 0.00 ACGTcount: A:0.39, C:0.13, G:0.19, T:0.29 Consensus pattern (39 bp): GATTCTGAAATTGACTGATAAAGCAATGATCCTGAATAG Found at i:20114 original size:66 final size:66 Alignment explanation

Indices: 20042--20173 Score: 210 Period size: 66 Copynumber: 2.0 Consensus size: 66 20032 GAAATCGATA * * * 20042 GATAAAGCGATGATCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATGATCCTGAATAGGAT 1 GATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACAAATAAAGAAATGATCCTGAATAGGAT 20107 T 66 T * * * 20108 GATAAAGCAATTATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAGGAT 1 GATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACAAATAAAGAAATGATCCTGAATAGGAT 20173 T 66 T 20174 CAAACAAAAA Statistics Matches: 60, Mismatches: 6, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 66 60 1.00 ACGTcount: A:0.42, C:0.12, G:0.19, T:0.27 Consensus pattern (66 bp): GATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACAAATAAAGAAATGATCCTGAATAGGAT T Found at i:20139 original size:39 final size:38 Alignment explanation

Indices: 20104--20329 Score: 218 Period size: 39 Copynumber: 5.8 Consensus size: 38 20094 TCCTGAATAG * * 20104 GATTGATAAAGCAATTATCCTGAATAGAATTCTGAAATT 1 GATTGATAAAGCAATGATCCTGAATAGGATTCTGAAA-T * * * ** * 20143 GACTAATAAAGAAATGATCCTGAATAGGATTCAAACAAA 1 GATTGATAAAGCAATGATCCTGAATAGGATTCTGA-AAT * * * 20182 AATTCATAAAGTAATGATCCTGAATAGGATTCTGAAATT 1 GATTGATAAAGCAATGATCCTGAATAGGATTCTGAAA-T * * * 20221 GATAGATAAAGCAATGATCCTGAATAGGACTCTGAGATT 1 GATTGATAAAGCAATGATCCTGAATAGGATTCTGA-AAT ** * * 20260 GACAGATAAAGCAATGATCCTGAACAGGAGTCTGAAAT 1 GATTGATAAAGCAATGATCCTGAATAGGATTCTGAAAT * * * 20298 CAATTCATAAAGAAATGATCCTGAATAGGATT 1 -GATTGATAAAGCAATGATCCTGAATAGGATT 20330 AAAACACATA Statistics Matches: 151, Mismatches: 32, Indels: 8 0.79 0.17 0.04 Matches are distributed among these distances: 38 4 0.03 39 144 0.95 40 3 0.02 ACGTcount: A:0.43, C:0.12, G:0.18, T:0.27 Consensus pattern (38 bp): GATTGATAAAGCAATGATCCTGAATAGGATTCTGAAAT Found at i:20191 original size:105 final size:105 Alignment explanation

Indices: 20002--20329 Score: 286 Period size: 105 Copynumber: 3.0 Consensus size: 105 19992 TGAACTTGTC ** 20002 TGATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGCGATGATCCTGAATAGGA 1 TGATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGAAATGATCCTGAATAGGA * ** * 20067 CTCTGAAATTGACAAATAAAGCAATGATCCTGAATAGGAT 66 CTCTCAAATCAAAAAATAAAGCAATGATCCTGAATAGGAT * * 20107 TGATAAAGCAATTATCCTGAATAGAATTCTGAAATTGACTA-ATAAAGAAATGATCCTGAATAGG 1 TGATAAAGCAATGATCCTGAATAGAATTCTGAAATCGA-TAGATAAAGAAATGATCCTGAATAGG * 20171 A-T-TCAAA-CAAAAATTCATAAAGTAATGATCCTGAATAGGATTCTGAAATT 65 ACTCTCAAATCAAAAA---ATAAAGCAATGATCCTGAATA-G-----G--A-T * * * * * * * 20221 GATAGATAAAGCAATGATCCTGAATAGGACTCTGAGATTGACAGATAAAGCAATGATCCTGAACA 1 --T-GATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGAAATGATCCTGAATA * * *** * 20286 GGAGTCTGAAATCAATTCATAAAGAAATGATCCTGAATAGGAT 63 GGACTCTCAAATCAAAAAATAAAGCAATGATCCTGAATAGGAT 20329 T 1 T 20330 AAAACACATA Statistics Matches: 182, Mismatches: 21, Indels: 39 0.75 0.09 0.16 Matches are distributed among these distances: 102 3 0.02 103 4 0.02 104 1 0.01 105 78 0.43 106 4 0.02 108 1 0.01 109 1 0.01 111 2 0.01 113 1 0.01 114 1 0.01 116 3 0.02 117 75 0.41 118 1 0.01 119 4 0.02 120 3 0.02 ACGTcount: A:0.43, C:0.13, G:0.19, T:0.26 Consensus pattern (105 bp): TGATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGAAATGATCCTGAATAGGA CTCTCAAATCAAAAAATAAAGCAATGATCCTGAATAGGAT Found at i:20205 original size:66 final size:66 Alignment explanation

Indices: 20069--20205 Score: 145 Period size: 66 Copynumber: 2.1 Consensus size: 66 20059 GAATAGGACT * * ** *** 20069 CTGAAATTGACAAATAAAGCAATGATCCTGAATAGGATTGATAAAGCAATTATCCTGAATAGAAT 1 CTGAAATTGACAAATAAAGAAATGATCCTGAATAGGATTCATAAAAAAATTATAAAGAATAGAAT * 20134 T 66 C * 20135 CTGAAATTGACTAATAAAGAAATGATCCTGAATAGGATTCA-AACAAAAATTCATAAAGTAAT-G 1 CTGAAATTGACAAATAAAGAAATGATCCTGAATAGGATTCATAA-AAAAATT-ATAAAG-AATAG 20198 -ATC 63 AATC 20201 CTGAA 1 CTGAA 20206 TAGGATTCTG Statistics Matches: 59, Mismatches: 9, Indels: 6 0.80 0.12 0.08 Matches are distributed among these distances: 65 2 0.03 66 50 0.85 67 4 0.07 68 3 0.05 ACGTcount: A:0.46, C:0.12, G:0.15, T:0.26 Consensus pattern (66 bp): CTGAAATTGACAAATAAAGAAATGATCCTGAATAGGATTCATAAAAAAATTATAAAGAATAGAAT C Found at i:20230 original size:144 final size:144 Alignment explanation

Indices: 19963--20249 Score: 387 Period size: 144 Copynumber: 2.0 Consensus size: 144 19953 GGAAATTCAC * * * * * * 19963 TGATAATGCAATGATCCTGAATAGGATTCTGAACTTGTCTGATAAAGCAATGATCCTGAATAGAA 1 TGATAAAGCAATGATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAGAA * *** * * 20028 TTCTGAAATCGATAGATAAAGCGATGATCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATG 66 TTCTAAAAAAAATACATAAAGCAATGATCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATG 20093 ATCCTGAATAGGAT 131 ATCCTGAATAGGAT * * 20107 TGATAAAGCAATTATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAGGA 1 TGATAAAGCAATGATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAGAA * * * * * 20172 TTC-AAACAAAAATTCATAAAGTAATGATCCTGAATAGGATTCTGAAATTGATAGATAAAGCAAT 66 TTCTAAA-AAAAATACATAAAGCAATGATCCTGAATAGGACTCTGAAATTGACAAATAAAGCAAT 20236 GATCCTGAATAGGA 130 GATCCTGAATAGGA 20250 CTCTGAGATT Statistics Matches: 123, Mismatches: 19, Indels: 2 0.85 0.13 0.01 Matches are distributed among these distances: 143 2 0.02 144 121 0.98 ACGTcount: A:0.42, C:0.13, G:0.18, T:0.27 Consensus pattern (144 bp): TGATAAAGCAATGATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAGAA TTCTAAAAAAAATACATAAAGCAATGATCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATG ATCCTGAATAGGAT Found at i:20240 original size:183 final size:183 Alignment explanation

Indices: 19925--20288 Score: 516 Period size: 183 Copynumber: 2.0 Consensus size: 183 19915 GAAATTGACA * * * * * 19925 GATAAAGCCATGATCCTGAATAGGATTCGGAAATTCACTGATAATGCAATGATCCTGAATAGGAT 1 GATAAAGCAATGATCCTGAATAGAATTCGGAAATTCACTAATAAAGAAATGATCCTGAATAGGAT * *** * * 19990 TCTGAACTTGTCTGATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGCGATGA 66 TCTAAACAAATCTCATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGCAATGA * 20055 TCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATGATCCTGAATAGGATT 131 TCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATGATCCTGAACAGGATT * * * 20108 GATAAAGCAATTATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAGGAT 1 GATAAAGCAATGATCCTGAATAGAATTCGGAAATTCACTAATAAAGAAATGATCCTGAATAGGAT * * * 20173 TC-AAACAAAAAT-TCATAAAGTAATGATCCTGAATAGGATTCTGAAATTGATAGATAAAGCAAT 66 TCTAAAC--AAATCTCATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGCAAT * * 20236 GATCCTGAATAGGACTCTGAGATTGACAGATAAAGCAATGATCCTGAACAGGA 129 GATCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATGATCCTGAACAGGA 20289 GTCTGAAATC Statistics Matches: 159, Mismatches: 20, Indels: 4 0.87 0.11 0.02 Matches are distributed among these distances: 182 3 0.02 183 155 0.97 184 1 0.01 ACGTcount: A:0.41, C:0.14, G:0.19, T:0.26 Consensus pattern (183 bp): GATAAAGCAATGATCCTGAATAGAATTCGGAAATTCACTAATAAAGAAATGATCCTGAATAGGAT TCTAAACAAATCTCATAAAGCAATGATCCTGAATAGAATTCTGAAATCGATAGATAAAGCAATGA TCCTGAATAGGACTCTGAAATTGACAAATAAAGCAATGATCCTGAACAGGATT Found at i:20303 original size:117 final size:117 Alignment explanation

Indices: 20108--20329 Score: 313 Period size: 117 Copynumber: 1.9 Consensus size: 117 20098 GAATAGGATT * * * * 20108 GATAAAGCAATTATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAGGAT 1 GATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACTAATAAAGAAATGATCCTGAACAGGAG * 20173 TCAAACAAAAATTCATAAAGTAATGATCCTGAATAGGATTCTGAAATTGATA 66 TCAAACAAAAATTCATAAAGAAATGATCCTGAATAGGATTCTGAAATTGATA * * * 20225 GATAAAGCAATGATCCTGAATAGGACTCTGAGATTGAC-AGATAAAGCAATGATCCTGAACAGGA 1 GATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACTA-ATAAAGAAATGATCCTGAACAGGA * ** 20289 GTCTGAA-ATCAATTCATAAAGAAATGATCCTGAATAGGATT 65 GTC-AAACAAAAATTCATAAAGAAATGATCCTGAATAGGATT 20330 AAAACACATA Statistics Matches: 92, Mismatches: 11, Indels: 4 0.86 0.10 0.04 Matches are distributed among these distances: 116 1 0.01 117 89 0.97 118 2 0.02 ACGTcount: A:0.43, C:0.13, G:0.18, T:0.26 Consensus pattern (117 bp): GATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACTAATAAAGAAATGATCCTGAACAGGAG TCAAACAAAAATTCATAAAGAAATGATCCTGAATAGGATTCTGAAATTGATA Found at i:20346 original size:78 final size:78 Alignment explanation

Indices: 20105--20351 Score: 266 Period size: 78 Copynumber: 3.2 Consensus size: 78 20095 CCTGAATAGG * * * 20105 ATTGATAAAGCAATTATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATAG 1 ATTGATAAAGCAATGATCCTGAATAGGATTCTGAAATTGATTAATAAAGAAATGATCCTGAATAG * 20170 GATTCAAACAAAA 66 GATTAAAACAAAA * * * 20183 ATTCATAAAGTAATGATCCTGAATAGGATTCTGAAATTGA-TAGATAAAGCAATGATCCTGAATA 1 ATTGATAAAGCAATGATCCTGAATAGGATTCTGAAATTGATTA-ATAAAGAAATGATCCTGAATA * * *** * 20247 GGACTCTGAGATTGAC 65 GGA-T-TAAAACAAAA * * ** * 20263 A--GATAAAGCAATGATCCTGAACAGGAGTCTGAAATCAATTCATAAAGAAATGATCCTGAATAG 1 ATTGATAAAGCAATGATCCTGAATAGGATTCTGAAATTGATTAATAAAGAAATGATCCTGAATAG * * 20326 GATTAAAACACAT 66 GATTAAAACAAAA 20339 ATTGATAAAGCAA 1 ATTGATAAAGCAA 20352 AATAGTTCCT Statistics Matches: 138, Mismatches: 25, Indels: 12 0.79 0.14 0.07 Matches are distributed among these distances: 76 5 0.04 77 3 0.02 78 123 0.89 79 2 0.01 80 5 0.04 ACGTcount: A:0.45, C:0.13, G:0.17, T:0.26 Consensus pattern (78 bp): ATTGATAAAGCAATGATCCTGAATAGGATTCTGAAATTGATTAATAAAGAAATGATCCTGAATAG GATTAAAACAAAA Found at i:20346 original size:117 final size:117 Alignment explanation

Indices: 20104--20351 Score: 297 Period size: 117 Copynumber: 2.1 Consensus size: 117 20094 TCCTGAATAG * * * 20104 GATTGATAAAGCAATTATCCTGAATAGAATTCTGAAATTGACTAATAAAGAAATGATCCTGAATA 1 GATTGATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACTAATAAAGAAATGATCCTGAACA * * ** * 20169 GGATTCAAACAAAAATTCATAAAGTAATGATCCTGAATAGGATTCTGAAATT 66 GGAGTCAAACAAAAATTCATAAAGAAATGATCCTGAATAGGATTCAAAAAAT * * * * 20221 GATAGATAAAGCAATGATCCTGAATAGGACTCTGAGATTGAC-AGATAAAGCAATGATCCTGAAC 1 GATTGATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACTA-ATAAAGAAATGATCCTGAAC * ** 20285 AGGAGTCTGAA-ATCAATTCATAAAGAAATGATCCTGAATAGGATT-AAAACACAT 65 AGGAGTC-AAACAAAAATTCATAAAGAAATGATCCTGAATAGGATTCAAAA-A-AT 20339 -ATTGATAAAGCAA 1 GATTGATAAAGCAA 20352 AATAGTTCCT Statistics Matches: 111, Mismatches: 16, Indels: 8 0.82 0.12 0.06 Matches are distributed among these distances: 116 3 0.03 117 105 0.95 118 3 0.03 ACGTcount: A:0.44, C:0.12, G:0.17, T:0.26 Consensus pattern (117 bp): GATTGATAAAGCAATGATCCTGAATAGAACTCTGAAATTGACTAATAAAGAAATGATCCTGAACA GGAGTCAAACAAAAATTCATAAAGAAATGATCCTGAATAGGATTCAAAAAAT Found at i:20695 original size:147 final size:147 Alignment explanation

Indices: 20396--20976 Score: 558 Period size: 167 Copynumber: 3.7 Consensus size: 147 20386 GACTTGTCAA * * * * * 20396 AATTAATACCCGGATGTTTTTGAAATCGTGACCAGAGGTCTTACAAATGTAAAGTCTGAATAGAG 1 AATTAATACCCGGAGGTTTCTGAAATTGTGCCCAGAGGTCTTACAAATGCAAA--C----TA-AG * * * 20461 ACCTTGAGCAAGG-TTTTTTTTTTTGAAATTTAAATGCAACTTTGATTAACAACTTGATGCATTG 59 ACCTTGAGCAAGGTTTTTTTTTTTTGAAA-TTAAACGCAACTTTGATTAACAACTTGATGAAATG * * 20525 AGGTGATACTCAGAGGATTTATCAG 123 AAGTGATACTCGGAGGATTTATCAG * * * 20550 AATTAATACCCGGAGGTTTTTGAAATTGTGCCCGGAGGTCTTACAAATGCAAACT-CGACCTTGA 1 AATTAATACCCGGAGGTTTCTGAAATTGTGCCCAGAGGTCTTACAAATGCAAACTAAGACCTTGA 20614 GCAAGGTTTTTTTTTCTTTGAAATTCAAACGCAAC-TTGATTAACAACTTGATGAAATGAAGTGA 66 GCAAGGTTTTTTTTT-TTTGAAATT-AAACGCAACTTTGATTAACAACTTGATGAAATGAAGTGA 20678 TACTCGGAGGATTTATCAG 129 TACTCGGAGGATTTATCAG * *** * 20697 AATTAATACTCGGAGGTTTCTGAAATTGTGCTTGGAGGTCTTACAAATGCTAACTTTGAATAAAG 1 AATTAATACCCGGAGGTTTCTGAAATTGTGCCCAGAGGTCTTACAAATGC-AA-----ACT-AAG * 20762 ACCTTGAGCAAGGTTTTTTTTTTTTTTTTTTTTTTTTGAAACTTAAACACAACTTTGATTAACAA 59 ACCTTGAGCAAGG------------TTTTTTTTTTTTGAAA-TTAAACGCAACTTTGATTAACAA * 20827 CTTGATGAAGTGAAGTGATAC-CTGGAGGATTTATCAG 111 CTTGATGAAATGAAGTGATACTC-GGAGGATTTATCAG * * 20864 AGTTAATACCCGGAGGTTTCTGAAATTGTGCCCAGAGGTCTTACAAATGCTAACTTTGAATAAAG 1 AATTAATACCCGGAGGTTTCTGAAATTGTGCCCAGAGGTCTTACAAATGC-AA-----ACT-AAG 20929 ACCTTGAGCAAGGGTTCTTTTTTTTTTTGAAACTTAAACGCAACTTTG 59 ACCTTGAGCAA-GG-T-TTTTTTTTTTTGAAA-TTAAACGCAACTTTG 20977 CTGAAAAACT Statistics Matches: 374, Mismatches: 25, Indels: 52 0.83 0.06 0.12 Matches are distributed among these distances: 146 14 0.04 147 100 0.27 148 18 0.05 152 1 0.00 153 2 0.01 154 48 0.13 155 14 0.04 157 2 0.01 158 30 0.08 166 16 0.04 167 127 0.34 168 2 0.01 ACGTcount: A:0.31, C:0.14, G:0.20, T:0.35 Consensus pattern (147 bp): AATTAATACCCGGAGGTTTCTGAAATTGTGCCCAGAGGTCTTACAAATGCAAACTAAGACCTTGA GCAAGGTTTTTTTTTTTTGAAATTAAACGCAACTTTGATTAACAACTTGATGAAATGAAGTGATA CTCGGAGGATTTATCAG Found at i:21996 original size:31 final size:34 Alignment explanation

Indices: 21961--22027 Score: 95 Period size: 31 Copynumber: 2.1 Consensus size: 34 21951 CCTTTTCAAA 21961 TTTCTCTGTTTT-TTTATTTCATT-TCCA-TTCT 1 TTTCTCTGTTTTCTTTATTTCATTCTCCACTTCT * * 21992 TTTCTCTTTTTTCTTTTTTTCATTCTCCACTTCT 1 TTTCTCTGTTTTCTTTATTTCATTCTCCACTTCT 22026 TT 1 TT 22028 GTTTACTTCT Statistics Matches: 31, Mismatches: 2, Indels: 3 0.86 0.06 0.08 Matches are distributed among these distances: 31 11 0.35 32 10 0.32 33 4 0.13 34 6 0.19 ACGTcount: A:0.07, C:0.22, G:0.01, T:0.69 Consensus pattern (34 bp): TTTCTCTGTTTTCTTTATTTCATTCTCCACTTCT Found at i:31505 original size:19 final size:18 Alignment explanation

Indices: 31472--31507 Score: 54 Period size: 19 Copynumber: 1.9 Consensus size: 18 31462 TTGAGATAAT 31472 TCTTCAATGATCTTCAAA 1 TCTTCAATGATCTTCAAA * 31490 TCTTCAAATTATCTTCAA 1 TCTTC-AATGATCTTCAA 31508 TAAGTCTTCA Statistics Matches: 16, Mismatches: 1, Indels: 1 0.89 0.06 0.06 Matches are distributed among these distances: 18 5 0.31 19 11 0.69 ACGTcount: A:0.33, C:0.22, G:0.03, T:0.42 Consensus pattern (18 bp): TCTTCAATGATCTTCAAA Done.