Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01012844.1 Corchorus capsularis cultivar CVL-1 contig12865, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 25257
ACGTcount: A:0.35, C:0.16, G:0.15, T:0.34


Found at i:1377 original size:2 final size:2

Alignment explanation

Indices: 1372--1400 Score: 58 Period size: 2 Copynumber: 14.5 Consensus size: 2 1362 TTTGTGCAAA 1372 AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1401 CTAGTTTTAG Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 27 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:1572 original size:22 final size:21 Alignment explanation

Indices: 1521--2037 Score: 153 Period size: 22 Copynumber: 23.7 Consensus size: 21 1511 ATGATCCCAT * 1521 TATGAAATTTTGATAACCTCC 1 TATGAAATTTTGATAACCTAC * 1542 TATGAAATTTTGATAACGGTAC 1 TATGAAATTTTGATAAC-CTAC ** * ** 1564 TAT-AGAATTTCAAGAATCCTTT 1 TATGA-AATTTTGATAA-CCTAC ** * 1586 TAT-AAATTTTTTTTAACTTAC 1 TATGAAA-TTTTGATAACCTAC * * 1607 TTATGAAATTTTGTTAACCTCCC 1 -TATGAAATTTTGATAACCT-AC * * * * 1630 TAAGGAATTTTGA-AGATCTCAA 1 TATGAAATTTTGATA-ACCT-AC * 1652 TATGAAATTTTGATAACCAACAC 1 TATGAAATTTTGATAACC--TAC * * * 1675 TATGAGATGTTGATAACCTCC 1 TATGAAATTTTGATAACCTAC * * 1696 ATATGATATATTGATAACC-AC 1 -TATGAAATTTTGATAACCTAC * * * * * 1717 TTTATAAAAATTTAAAAACCTCC 1 --TATGAAATTTTGATAACCTAC * 1740 ATATG-AATGGTTAGTAATAA-C-AC 1 -TATGAAAT--TT--TGATAACCTAC * * 1763 TTTAAAATTTTGATAATCAC-AC 1 TATGAAATTTTGATAA-C-CTAC * * 1785 TATGAAATTGTGATAACCTCGC 1 TATGAAATTTTGATAACCT-AC * 1807 TATGAAATTTTGATAACTCTTTC 1 TATGAAATTTTGATAAC-C-TAC * * * * 1830 AATAAAATTTTAATAAACCTCCC 1 TATGAAATTTTGAT-AACCT-AC * * * 1853 TATAAAATTTTGATAACTTTC 1 TATGAAATTTTGATAACCTAC * 1874 TTATGAAATCTTGATAA-CTAC 1 -TATGAAATTTTGATAACCTAC * * 1895 ----AAATTTTGATAAGCTCC 1 TATGAAATTTTGATAACCTAC ** * 1912 TTATGATTTTTTGATAACCTCAT 1 -TATGAAATTTTGATAACCT-AC * * * 1935 TATGAAATTTTGTTAATCTCCC 1 TATGAAATTTTGATAACCT-AC * * 1957 TATGAAATTTTGATCTACATAC 1 TATGAAATTTTGAT-AACCTAC * 1979 TATGAAATTTTGA-AAACTAAAC 1 TATGAAATTTTGATAACCT--AC * * 2001 TATGAAAATTTGATAACCTTC 1 TATGAAATTTTGATAACCTAC 2022 ATATGAAATTTTGATA 1 -TATGAAATTTTGATA 2038 TCCTCCCTGA Statistics Matches: 360, Mismatches: 95, Indels: 81 0.67 0.18 0.15 Matches are distributed among these distances: 16 11 0.03 17 3 0.01 19 5 0.01 20 3 0.01 21 34 0.09 22 231 0.64 23 63 0.17 24 5 0.01 25 5 0.01 ACGTcount: A:0.37, C:0.14, G:0.10, T:0.39 Consensus pattern (21 bp): TATGAAATTTTGATAACCTAC Found at i:1687 original size:23 final size:22 Alignment explanation

Indices: 1651--1715 Score: 67 Period size: 23 Copynumber: 2.9 Consensus size: 22 1641 GAAGATCTCA * 1651 ATATGAAATTTTGATAACCAAC 1 ATATGAAATATTGATAACCAAC * * ** 1673 ACTATGAGATGTTGATAACCTCC 1 A-TATGAAATATTGATAACCAAC * 1696 ATATGATATATTGATAACCA 1 ATATGAAATATTGATAACCA 1716 CTTTATAAAA Statistics Matches: 35, Mismatches: 7, Indels: 2 0.80 0.16 0.05 Matches are distributed among these distances: 22 17 0.49 23 18 0.51 ACGTcount: A:0.40, C:0.15, G:0.12, T:0.32 Consensus pattern (22 bp): ATATGAAATATTGATAACCAAC Found at i:2052 original size:19 final size:20 Alignment explanation

Indices: 2027--2075 Score: 73 Period size: 19 Copynumber: 2.5 Consensus size: 20 2017 CCTTCATATG 2027 AAATTTTGATATCCTCCCT- 1 AAATTTTGATATCCTCCCTA * * 2046 GAATTTTGGTATCCTCCCTA 1 AAATTTTGATATCCTCCCTA 2066 AAATTTTGAT 1 AAATTTTGAT 2076 TACTCCATCA Statistics Matches: 25, Mismatches: 4, Indels: 1 0.83 0.13 0.03 Matches are distributed among these distances: 19 17 0.68 20 8 0.32 ACGTcount: A:0.27, C:0.20, G:0.10, T:0.43 Consensus pattern (20 bp): AAATTTTGATATCCTCCCTA Found at i:2309 original size:18 final size:19 Alignment explanation

Indices: 2271--2309 Score: 53 Period size: 21 Copynumber: 2.0 Consensus size: 19 2261 CAACACTATG 2271 AAATTTTGATAATCTTCATAT 1 AAATTTTGATAA--TTCATAT 2292 AAATTTTGATAA-TCATAT 1 AAATTTTGATAATTCATAT 2310 CTTTATGAAA Statistics Matches: 18, Mismatches: 0, Indels: 3 0.86 0.00 0.14 Matches are distributed among these distances: 18 6 0.33 21 12 0.67 ACGTcount: A:0.41, C:0.08, G:0.05, T:0.46 Consensus pattern (19 bp): AAATTTTGATAATTCATAT Found at i:2418 original size:22 final size:22 Alignment explanation

Indices: 2181--2499 Score: 101 Period size: 22 Copynumber: 14.3 Consensus size: 22 2171 AATCAGATTT * ** 2181 TGAAAATTTGATAACCTTTTTA 1 TGAAATTTTGATAACCTTCATA 2203 TGAAATTTTGATAACATCTT--TA 1 TGAAATTTTGATAAC--CTTCATA * * * * * 2225 TAAAATTTCGTTGACCCCTC-TA 1 TGAAATTTTGAT-AACCTTCATA ** 2247 TGAAATTTTGATAA-CAACACTA 1 TGAAATTTTGATAACCTTCA-TA * 2269 TGAAATTTTGATAATCTTCATA 1 TGAAATTTTGATAACCTTCATA * * 2291 T-AAATTTTGATAATCATATCTTTA 1 TGAAATTTTGATAA-CCT-TC-ATA ** 2315 TGAAATTTCAATAATCAC-TC-TA 1 TGAAATTTTGATAA-C-CTTCATA * 2337 TGAGA-TTTGATAACCTTC-TA 1 TGAAATTTTGATAACCTTCATA * * * 2357 TCAAATTTTGGTACTCCTT-ATAAAA 1 TGAAATTTTGATA-ACCTTCAT---A * 2382 TTGAGACTTTT-ATAACCTTCATA 1 -TGA-AATTTTGATAACCTTCATA * 2405 TGAAATTTTGATAACC-ACACTA 1 TGAAATTTTGATAACCTTCA-TA * * ** 2427 TAAAATTTTGATAACCTCCCGA 1 TGAAATTTTGATAACCTTCATA * * 2449 TGAAGTATT-AGTAACCTTC-TAA 1 TGAAATTTTGA-TAACCTTCAT-A * * 2471 TGAAATTTTGTTAACC-ACACTA 1 TGAAATTTTGATAACCTTCA-TA 2493 TGAAATT 1 TGAAATT 2500 CGTATAACCT Statistics Matches: 221, Mismatches: 46, Indels: 60 0.68 0.14 0.18 Matches are distributed among these distances: 19 1 0.00 20 10 0.05 21 36 0.16 22 128 0.58 23 10 0.05 24 8 0.04 25 17 0.08 26 6 0.03 27 5 0.02 ACGTcount: A:0.36, C:0.15, G:0.09, T:0.39 Consensus pattern (22 bp): TGAAATTTTGATAACCTTCATA Found at i:2646 original size:22 final size:22 Alignment explanation

Indices: 2617--2694 Score: 129 Period size: 22 Copynumber: 3.5 Consensus size: 22 2607 TAACTTGATC * 2617 CTATGAAATTTTGGTAACCACT 1 CTATGAAATTTTGGTAACCACA * * 2639 TTATGAAATTTTGGTAACTACA 1 CTATGAAATTTTGGTAACCACA 2661 CTATGAAATTTTGGTAACCACA 1 CTATGAAATTTTGGTAACCACA 2683 CTATGAAATTTT 1 CTATGAAATTTT 2695 AATAACCTTC Statistics Matches: 51, Mismatches: 5, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 22 51 1.00 ACGTcount: A:0.35, C:0.14, G:0.13, T:0.38 Consensus pattern (22 bp): CTATGAAATTTTGGTAACCACA Found at i:2700 original size:22 final size:22 Alignment explanation

Indices: 2563--2701 Score: 111 Period size: 22 Copynumber: 6.2 Consensus size: 22 2553 TTGTGATAAT * * 2563 TAACCACCCTATGAAATTTCAA 1 TAACCACACTATGAAATTTTAA * 2585 TAACCA-ACCTAAGAAATTTTAA 1 TAACCACA-CTATGAAATTTTAA * ** 2607 TAACTTGATC-CTATGAAATTTTGG 1 TAAC--CA-CACTATGAAATTTTAA ** ** 2631 TAACCACTTTATGAAATTTTGG 1 TAACCACACTATGAAATTTTAA * ** 2653 TAACTACACTATGAAATTTTGG 1 TAACCACACTATGAAATTTTAA 2675 TAACCACACTATGAAATTTTAA 1 TAACCACACTATGAAATTTTAA 2697 TAACC 1 TAACC 2702 TTCTCATGGA Statistics Matches: 96, Mismatches: 15, Indels: 12 0.78 0.12 0.10 Matches are distributed among these distances: 21 1 0.01 22 79 0.82 24 16 0.17 ACGTcount: A:0.39, C:0.18, G:0.09, T:0.34 Consensus pattern (22 bp): TAACCACACTATGAAATTTTAA Found at i:4301 original size:23 final size:23 Alignment explanation

Indices: 4270--4354 Score: 91 Period size: 23 Copynumber: 3.7 Consensus size: 23 4260 GATAACCTCG * 4270 CTATGAAATTTTGATAAATCTTC 1 CTATAAAATTTTGATAAATCTTC * * * 4293 CTATAAAATTTTAATAAACCTCC 1 CTATAAAATTTTGATAAATCTTC * 4316 CTATAAAATTTTGATAACT-TTC 1 CTATAAAATTTTGATAAATCTTC * * * 4338 TTATGAAATCTTGATAA 1 CTATAAAATTTTGATAA 4355 CTACAAATTT Statistics Matches: 51, Mismatches: 11, Indels: 1 0.81 0.17 0.02 Matches are distributed among these distances: 22 16 0.31 23 35 0.69 ACGTcount: A:0.39, C:0.14, G:0.06, T:0.41 Consensus pattern (23 bp): CTATAAAATTTTGATAAATCTTC Found at i:4410 original size:23 final size:22 Alignment explanation

Indices: 4087--4530 Score: 114 Period size: 22 Copynumber: 20.3 Consensus size: 22 4077 AACTTTGAAG * 4087 ACCTCAATATGAAATTTTGATA 1 ACCTCATTATGAAATTTTGATA * *** 4109 ACTTCCCAATGAAATTTTGATA 1 ACCTCATTATGAAATTTTGATA * * ** * 4131 ACCAACACTATGGGATGTTGATA 1 ACC-TCATTATGAAATTTTGATA * * 4154 ACCTCCA-TATGATATATTGATA 1 ACCT-CATTATGAAATTTTGATA * * * * * * 4176 ACCACTTTATAAAAATTTAAAA 1 ACCTCATTATGAAATTTTGATA * * 4198 ATCTCCA-TATG-AATTGTTAATA 1 ACCT-CATTATGAAATT-TTGATA * * * * 4220 ATCACACTT-TAAAAATTTGATA 1 ACCTCA-TTATGAAATTTTGATA * * * * 4242 ATCACACTATGAAATTGTGATA 1 ACCTCATTATGAAATTTTGATA ** 4264 ACCTCGCTATGAAATTTTGATAA 1 ACCTCATTATGAAATTTTGAT-A * * * * 4287 ATCTTC-CTATAAAATTTTAATAA 1 A-CCTCATTATGAAATTTTGAT-A ** * 4310 ACCTCCCTATAAAATTTTGATA 1 ACCTCATTATGAAATTTTGATA * * 4332 ACTTTC-TTATGAAATCTTGATA 1 AC-CTCATTATGAAATTTTGATA * 4354 A---C--TA-CAAATTTTGATA 1 ACCTCATTATGAAATTTTGATA * * ** 4370 AGCTCCTTATGATTTTTTTGATA 1 ACCTCATTATGA-AATTTTGATA * 4393 ACCTCATTATGAAATTTTGTTA 1 ACCTCATTATGAAATTTTGATA * ** 4415 ATCTCCCTATGAAATTTTGAT- 1 ACCTCATTATGAAATTTTGATA * 4436 --CTACATGCTATAAAATTTTGATA 1 ACCT-CAT--TATGAAATTTTGATA 4459 ACCCTC-TTATGAAATTTTGA-A 1 A-CCTCATTATGAAATTTTGATA * * * * * 4480 AGCTAAACTATGAAAATTAGATA 1 ACCT-CATTATGAAATTTTGATA * 4503 ACCTTCA-TACGAAATTTTGATA 1 ACC-TCATTATGAAATTTTGATA * 4525 TCCTCA 1 ACCTCA 4531 CTGAATTTTG Statistics Matches: 312, Mismatches: 79, Indels: 63 0.69 0.17 0.14 Matches are distributed among these distances: 16 11 0.04 17 2 0.01 18 1 0.00 19 3 0.01 20 3 0.01 21 14 0.04 22 191 0.61 23 79 0.25 24 5 0.02 25 1 0.00 26 2 0.01 ACGTcount: A:0.38, C:0.16, G:0.09, T:0.38 Consensus pattern (22 bp): ACCTCATTATGAAATTTTGATA Found at i:4473 original size:22 final size:21 Alignment explanation

Indices: 4249--4478 Score: 121 Period size: 22 Copynumber: 10.6 Consensus size: 21 4239 ATAATCACAC * * 4249 TATGAAATTGTGATAACCTCGC 1 TATGAAATTTTGATAACCTC-T * * 4271 TATGAAATTTTGATAAATCTTCC 1 TATGAAATTTTGAT-AA-CCTCT * * * 4294 TATAAAATTTTAATAAACCTCCC 1 TATGAAATTTTGAT-AACCT-CT * * 4317 TATAAAATTTTGATAACTTTCT 1 TATGAAATTTTGATAAC-CTCT * 4339 TATGAAATCTTGATAA---C- 1 TATGAAATTTTGATAACCTCT * * 4356 TA-CAAATTTTGATAAGCTCCT 1 TATGAAATTTTGATAACCT-CT ** 4377 TATGATTTTTTTGATAACCTCAT 1 TATGA-AATTTTGATAACCTC-T * * * 4400 TATGAAATTTTGTTAATCTCCC 1 TATGAAATTTTGATAACCT-CT * * 4422 TATGAAATTTTGATCTACATGC- 1 TATGAAATTTTGAT-AACCT-CT * 4444 TATAAAATTTTGATAACCCTCT 1 TATGAAATTTTGATAA-CCTCT 4466 TATGAAATTTTGA 1 TATGAAATTTTGA 4479 AAGCTAAACT Statistics Matches: 162, Mismatches: 30, Indels: 32 0.72 0.13 0.14 Matches are distributed among these distances: 16 11 0.07 17 2 0.01 18 1 0.01 20 1 0.01 21 4 0.02 22 85 0.52 23 55 0.34 24 3 0.02 ACGTcount: A:0.34, C:0.15, G:0.09, T:0.42 Consensus pattern (21 bp): TATGAAATTTTGATAACCTCT Found at i:4568 original size:19 final size:20 Alignment explanation

Indices: 4513--4562 Score: 75 Period size: 19 Copynumber: 2.5 Consensus size: 20 4503 ACCTTCATAC * 4513 GAAATTTTGATATCCTCACT 1 GAAATTTTGATATCCTCCCT * 4533 G-AATTTTGGTATCCTCCCT 1 GAAATTTTGATATCCTCCCT 4552 GAAATTTTGAT 1 GAAATTTTGAT 4563 TACTCCATCA Statistics Matches: 26, Mismatches: 3, Indels: 2 0.84 0.10 0.06 Matches are distributed among these distances: 19 17 0.65 20 9 0.35 ACGTcount: A:0.26, C:0.18, G:0.14, T:0.42 Consensus pattern (20 bp): GAAATTTTGATATCCTCCCT Found at i:4693 original size:22 final size:22 Alignment explanation

Indices: 4668--4914 Score: 159 Period size: 22 Copynumber: 11.3 Consensus size: 22 4658 AATCACATTT * 4668 TGAAAATTTGATAACCTCTTTA 1 TGAAATTTTGATAACCTCTTTA * 4690 TGAAATTTTGATAACATCTTTA 1 TGAAATTTTGATAACCTCTTTA * * * * * 4712 TAAAATTTTGTTGACCCCTCTA 1 TGAAATTTTGATAACCTCTTTA * * * 4734 TGAAATTTTGATAATCACATTA 1 TGAAATTTTGATAACCTCTTTA * 4756 TGTAATTTTGATAACCTCGTTT- 1 TGAAATTTTGATAACCTC-TTTA ** ** 4778 TGAAATTTTGATAACAACACTA 1 TGAAATTTTGATAACCTCTTTA 4800 TGAAATTTTGATAA--TCTTCATA 1 TGAAATTTTGATAACCTCTT--TA * 4822 T-AAATTTTGATAATCATATCTTTA 1 TGAAATTTTGATAA-C--CTCTTTA * * * * 4846 TGAAATTTCGATAATCACTCTA 1 TGAAATTTTGATAACCTCTTTA * * 4868 TGAGA-TTTGATAACCT-TCTA 1 TGAAATTTTGATAACCTCTTTA * * * 4888 TCAAATTTTGGT-A-CTCCTTA 1 TGAAATTTTGATAACCTCTTTA 4908 TGAAATT 1 TGAAATT 4915 GAGACTTTTA Statistics Matches: 171, Mismatches: 42, Indels: 26 0.72 0.18 0.11 Matches are distributed among these distances: 19 2 0.01 20 17 0.10 21 26 0.15 22 106 0.62 23 2 0.01 24 3 0.02 25 11 0.06 26 4 0.02 ACGTcount: A:0.34, C:0.13, G:0.10, T:0.43 Consensus pattern (22 bp): TGAAATTTTGATAACCTCTTTA Found at i:4694 original size:44 final size:44 Alignment explanation

Indices: 4644--4837 Score: 163 Period size: 44 Copynumber: 4.4 Consensus size: 44 4634 GAAATACCAC 4644 TATGAAATTTTTG-TAATCACATTTTGAAAATTTGATAACCTCTT 1 TATGAAA-TTTTGATAATCACATTTTGAAAATTTGATAACCTCTT * * * * 4688 TATGAAATTTTGATAA-CATC-TTTAT-AAAATTTTGTTGACCCCTC 1 TATGAAATTTTGATAATCA-CATTT-TGAAAA-TTTGATAACCTCTT * * * 4732 TATGAAATTTTGATAATCACATTATGTAATTTTGATAACCTCGTT 1 TATGAAATTTTGATAATCACATTTTGAAAATTTGATAACCTC-TT * * * 4777 T-TGAAATTTTGATAA-CAACACTATGAAATTTTGATAA--TCTT 1 TATGAAATTTTGATAATC-ACATTTTGAAAATTTGATAACCTCTT 4818 CATAT-AAATTTTGATAATCA 1 --TATGAAATTTTGATAATCA 4838 TATCTTTATG Statistics Matches: 124, Mismatches: 13, Indels: 27 0.76 0.08 0.16 Matches are distributed among these distances: 41 2 0.02 42 2 0.02 43 29 0.23 44 83 0.67 45 8 0.06 ACGTcount: A:0.36, C:0.12, G:0.09, T:0.43 Consensus pattern (44 bp): TATGAAATTTTGATAATCACATTTTGAAAATTTGATAACCTCTT Found at i:4757 original size:66 final size:66 Alignment explanation

Indices: 4668--4880 Score: 202 Period size: 66 Copynumber: 3.2 Consensus size: 66 4658 AATCACATTT * * * * * * ** * 4668 TGAAAATTTGATAACCTCTTTATGAAATTTTGATAACATCTTTATAAAATTTTGTTGACCCCTCT 1 TGAAATTTTGATAATCTCATTATGAAATTTTGATAACATCTTTATGAAATTTTGATAACAACACT 4733 A 66 A * * * 4734 TGAAATTTTGATAATCACATTATGTAATTTTGATAACCTCGTTT-TGAAATTTTGATAACAACAC 1 TGAAATTTTGATAATCTCATTATGAAATTTTGATAACATC-TTTATGAAATTTTGATAACAACAC 4798 TA 65 TA * 4800 TGAAATTTTGATAATCTTCA-TAT-AAATTTTGATAATCATATCTTTATGAAATTTCGATAATC- 1 TGAAATTTTGATAATC-TCATTATGAAATTTTGATAA-C--ATCTTTATGAAATTTTGATAA-CA * 4862 ACTCTA 61 ACACTA * 4868 TGAGA-TTTGATAA 1 TGAAATTTTGATAA 4881 CCTTCTATCA Statistics Matches: 122, Mismatches: 18, Indels: 13 0.80 0.12 0.08 Matches are distributed among these distances: 65 11 0.09 66 70 0.57 67 16 0.13 68 24 0.20 69 1 0.01 ACGTcount: A:0.36, C:0.12, G:0.10, T:0.42 Consensus pattern (66 bp): TGAAATTTTGATAATCTCATTATGAAATTTTGATAACATCTTTATGAAATTTTGATAACAACACT A Found at i:4806 original size:88 final size:88 Alignment explanation

Indices: 4643--4809 Score: 239 Period size: 88 Copynumber: 1.9 Consensus size: 88 4633 AGAAATACCA * * 4643 CTATGAAATTTTTGTAATCACATTTTGAAAATTTGATAACCTCTTTATGAAATTTTGATAACATC 1 CTATGAAATTTTTGTAATCACATTATGAAAATTTGATAACCTCTTTATGAAATTTTGATAACAAC ** 4708 TTTATAAAATTTTGTTGACCCCT 66 ACTATAAAATTTTGTTGACCCCT * * 4731 CTATGAAA-TTTTGATAATCACATTATGTAATTTTGATAACCTCGTTT-TGAAATTTTGATAACA 1 CTATGAAATTTTTG-TAATCACATTATGAAAATTTGATAACCTC-TTTATGAAATTTTGATAACA * 4794 ACACTATGAAATTTTG 64 ACACTATAAAATTTTG 4810 ATAATCTTCA Statistics Matches: 70, Mismatches: 7, Indels: 4 0.86 0.09 0.05 Matches are distributed among these distances: 87 5 0.07 88 62 0.89 89 3 0.04 ACGTcount: A:0.34, C:0.13, G:0.10, T:0.43 Consensus pattern (88 bp): CTATGAAATTTTTGTAATCACATTATGAAAATTTGATAACCTCTTTATGAAATTTTGATAACAAC ACTATAAAATTTTGTTGACCCCT Found at i:4840 original size:18 final size:19 Alignment explanation

Indices: 4802--4840 Score: 53 Period size: 21 Copynumber: 2.0 Consensus size: 19 4792 CAACACTATG 4802 AAATTTTGATAATCTTCATAT 1 AAATTTTGATAA--TTCATAT 4823 AAATTTTGATAA-TCATAT 1 AAATTTTGATAATTCATAT 4841 CTTTATGAAA Statistics Matches: 18, Mismatches: 0, Indels: 3 0.86 0.00 0.14 Matches are distributed among these distances: 18 6 0.33 21 12 0.67 ACGTcount: A:0.41, C:0.08, G:0.05, T:0.46 Consensus pattern (19 bp): AAATTTTGATAATTCATAT Found at i:4949 original size:22 final size:22 Alignment explanation

Indices: 4668--4974 Score: 122 Period size: 22 Copynumber: 13.8 Consensus size: 22 4658 AATCACATTT * * 4668 TGAAAATTTGATAACC-TCTTTA 1 TGAAATTTTGATAACCTTC-ATA 4690 TGAAATTTTGATAACATCTT--TA 1 TGAAATTTTGATAAC--CTTCATA * * * * 4712 TAAAATTTTGTTGACCCCTC-TA 1 TGAAATTTTGAT-AACCTTCATA * * 4734 TGAAATTTTGATAATC-ACATTA 1 TGAAATTTTGATAACCTTCA-TA * * * 4756 TGTAATTTTGATAACC-TCGTTT 1 TGAAATTTTGATAACCTTC-ATA ** 4778 TGAAATTTTGATAA-CAACACTA 1 TGAAATTTTGATAACCTTCA-TA * 4800 TGAAATTTTGATAATCTTCATA 1 TGAAATTTTGATAACCTTCATA * * 4822 T-AAATTTTGATAATCATATCTTTA 1 TGAAATTTTGATAA-CCT-TC-ATA * 4846 TGAAATTTCGATAATCAC-TC-TA 1 TGAAATTTTGATAA-C-CTTCATA * 4868 TGAGA-TTTGATAACCTTC-TA 1 TGAAATTTTGATAACCTTCATA * * * 4888 TCAAATTTTGGTACTCCTT-ATGAAA 1 TGAAATTTTGATA-ACCTTCAT---A * 4913 TTGAGACTTTT-ATAACCTTCATA 1 -TGA-AATTTTGATAACCTTCATA * 4936 TGAAATTTTGATAACC-ACACTA 1 TGAAATTTTGATAACCTTCA-TA * 4958 TAAAATTTTGATAACCT 1 TGAAATTTTGATAACCT 4975 CCCGATGAAG Statistics Matches: 217, Mismatches: 39, Indels: 57 0.69 0.12 0.18 Matches are distributed among these distances: 19 1 0.00 20 9 0.04 21 37 0.17 22 126 0.58 23 8 0.04 24 6 0.03 25 19 0.09 26 6 0.03 27 5 0.02 ACGTcount: A:0.35, C:0.14, G:0.09, T:0.41 Consensus pattern (22 bp): TGAAATTTTGATAACCTTCATA Found at i:5126 original size:22 final size:22 Alignment explanation

Indices: 5094--5230 Score: 125 Period size: 22 Copynumber: 6.1 Consensus size: 22 5084 TTGTGATAAT * * 5094 TAACCACCCTATGAAATTTCAA 1 TAACCACACTATGAAATTTTAA * 5116 TAACCA-ACCTAAGAAATTTTAA 1 TAACCACA-CTATGAAATTTTAA * * 5138 TAAACTGATC-CTATGAAATTTTAG 1 T-AAC-CA-CACTATGAAATTTTAA * ** 5162 TAACCACTCTATGAAATTTTGG 1 TAACCACACTATGAAATTTTAA * * 5184 TAACCACACTATGGAATTTTGA 1 TAACCACACTATGAAATTTTAA * 5206 TAACCACACTATGAAATTTTGA 1 TAACCACACTATGAAATTTTAA 5228 TAA 1 TAA 5231 ACTCCTCATG Statistics Matches: 97, Mismatches: 12, Indels: 12 0.80 0.10 0.10 Matches are distributed among these distances: 21 1 0.01 22 76 0.78 23 6 0.06 24 14 0.14 ACGTcount: A:0.40, C:0.18, G:0.09, T:0.32 Consensus pattern (22 bp): TAACCACACTATGAAATTTTAA Found at i:5242 original size:44 final size:42 Alignment explanation

Indices: 5101--5255 Score: 123 Period size: 44 Copynumber: 3.5 Consensus size: 42 5091 AATTAACCAC * * * * 5101 CCTATGAAATTTCAATAACCA-ACCTAAGAAATTTTAATAAACTGAT 1 CCTATGGAATTT-TATAACCACA-CTATGAAATTTTGATAAAC---T * * * * * 5147 CCTATGAAATTTTAGTAACCACTCTATGAAATTTTGGTAACCA 1 CCTATGGAATTTTA-TAACCACACTATGAAATTTTGATAAACT 5190 CACTATGGAATTTTGATAACCACACTATGAAATTTTGATAAACT 1 C-CTATGGAATTTT-ATAACCACACTATGAAATTTTGATAAACT * 5234 CCTCATGGAATTATAATAACCA 1 CCT-ATGGAATT-TTATAACCA 5256 TCTTATGAAT Statistics Matches: 90, Mismatches: 13, Indels: 14 0.77 0.11 0.12 Matches are distributed among these distances: 43 3 0.03 44 51 0.57 45 3 0.03 46 33 0.37 ACGTcount: A:0.40, C:0.18, G:0.10, T:0.32 Consensus pattern (42 bp): CCTATGGAATTTTATAACCACACTATGAAATTTTGATAAACT Found at i:5245 original size:22 final size:22 Alignment explanation

Indices: 5148--5245 Score: 83 Period size: 22 Copynumber: 4.5 Consensus size: 22 5138 TAAACTGATC * * * 5148 CTATGAAATTTT-AGTAACCACT 1 CTATGGAATTTTGA-TAAACACA * * * 5170 CTATGAAATTTTGGTAACCACA 1 CTATGGAATTTTGATAAACACA * 5192 CTATGGAATTTTGATAACCACA 1 CTATGGAATTTTGATAAACACA * * 5214 CTATGAAATTTTGATAAACTC- 1 CTATGGAATTTTGATAAACACA 5235 CTCATGGAATT 1 CT-ATGGAATT 5246 ATAATAACCA Statistics Matches: 66, Mismatches: 8, Indels: 4 0.85 0.10 0.05 Matches are distributed among these distances: 21 2 0.03 22 64 0.97 ACGTcount: A:0.36, C:0.17, G:0.12, T:0.35 Consensus pattern (22 bp): CTATGGAATTTTGATAAACACA Found at i:5580 original size:30 final size:31 Alignment explanation

Indices: 5546--5610 Score: 89 Period size: 30 Copynumber: 2.1 Consensus size: 31 5536 TGGCAATTTA * * 5546 GAAATATGTTTTAAAAGA-AA-GGTACAATTG 1 GAAATATATTTTAAAA-ATAAGGGTACAATAG 5576 GAAATATATTTTAAAAATAAGGGTACAATAG 1 GAAATATATTTTAAAAATAAGGGTACAATAG 5607 GAAA 1 GAAA 5611 ACATAAAGTT Statistics Matches: 31, Mismatches: 2, Indels: 3 0.86 0.06 0.08 Matches are distributed among these distances: 29 1 0.03 30 17 0.55 31 13 0.42 ACGTcount: A:0.51, C:0.03, G:0.18, T:0.28 Consensus pattern (31 bp): GAAATATATTTTAAAAATAAGGGTACAATAG Found at i:18608 original size:319 final size:320 Alignment explanation

Indices: 14834--22775 Score: 6000 Period size: 319 Copynumber: 24.4 Consensus size: 320 14824 ATTTATTTCT * * * * ** * * 14834 AATCTAATGTGGCTGAGATTTGGTTAGATGATTACAGATATTTCAAGAAGTCTTTTAGACAAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * ** * * * * 14899 CCATGCAAAACTAAGTCAAGACCCCGAAACGCGTTTTTAGCAAAAAACCATGATGGTTATTAGTA 66 TCATGCAAAATTGAG-CCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGG---TTAGTA * * 14964 CACGATTTCGGCT-AAAACTAAC-CTG-AAAATTTTTTCCTCAATTCTTTGCCACAATACTCAGA 127 CACGATTTCGGCTAAAAACTGACTC-GAAAAATTTTTT-CTCAATTTTTTGCCACAATACTCAGA * * * * * * * 15026 AAAAATATATAATTCAACACCAAAAAAATTGAAAGGTTTTTCATGCTT-TTAACATCGTTTTTCC 190 AAAAATATATAATTCAACGCCAAAAAGATTG-ACGGGTTTTCACGCTTCTAAATATCGTTTTTCC * * * * * 15090 A-TTTTTCCGGAATTTATTTCTAATTAAATCGAAATAA-ATTCAGATGCTCATAAAAATAAATTC 254 ATTTTTTCC-GAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCC 15153 TTA 318 TTA * * ** * 15156 AATCCAATGTGGCTGAGATTTGATTAGAT-AATTATAGATATTTCGAGGAGTCTTTATGCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAA-TATAGATATTTCAAGGAGTCTTTGCGCCCAAA * * * * * * * * * 15220 ACCATGCAGAACTAAGTCGAGGC-CCGAAAACGCATTTTTAGCCAAAAACCGTGATGG-TGGTAC 65 ATCATGCAAAATTGAGCCG-GGCTCCG-GAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTAC * * * * * * 15283 ACGATTTCAGCTAAAAACTGACAC-AAATATTTTTTCT-AATTTTTTTGCCATAATAATCATAAA 128 ACGATTTCGGCTAAAAACTGACTCGAAAAATTTTTTCTCAA-TTTTTTGCCACAATACTCAGAAA ** * ** * * * * * 15346 AAATATATAATTCAA-TACTAAAA-ATAAATGGTTTTTAACCCTTCT-AATACCGTTTTTCCATT 192 AAATATATAATTCAACGCCAAAAAGATTGACGGGTTTTCACGCTTCTAAATATCGTTTTTCCA-T * * * * * 15408 TTTTTCCGAATTAATTTCTAATTAAACCGAAAAAAGATTCAGATACTCGTAAAAGCAAATTCTTA 256 TTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * * * * ** * * * 15473 TATCCAATGTAGCAGAGATTTGATTCCATGAATATAGATATTTTACGGAGTCTTTGCACCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * ** * 15538 TCATGCAAAACTGATCCGGGCTCTC-GAACGCGTTTTTAGCAAAAAAAAAAAAAAAGTGATGTTT 66 TCATGCAAAATTGAGCCGGGCTC-CGGAACGCGTTTTTAGC-------AAAAAACCGTGATGGTT * * * * * * 15602 AGTACACAATTTGGGCTAAAATTTTGCAAAAATTGACTCGAAAAGTTTTTCCTCAATTTTTTTGA 123 AGTACACGATTTCGGC--------T--AAAAACTGACTCGAAAAATTTTTTCTCAA-TTTTTTGC * ** * * 15667 CACAATACTCAGGAAAAATATATAATTCAACGTAAAAAAGATTAAAGGGTTTTTCACGCTTCT-A 177 CACAATACTCAGAAAAAATATATAATTCAACGCCAAAAAGATTGACGGG-TTTTCACGCTTCTAA * * * * 15731 ACATCGTTTTTCCAATTTTTTTTCGAATTAATTTCTAATTAAATCGAAACAAGATCCAAATGCTC 241 ATATCGTTTTTCC-A-TTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTC * 15796 GTAAAAACAAAT-CTTT 304 GTAAAAACAAATCCTTA ** * * * * ** 15812 AATCCAATGTCCCTAAAATTTGGTTTGATGAATATAGATATTTCAAGAAGTCTTTGCGCTTAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * 15877 TCATGCAAAATTGAGTCGGGCTCCGGAATGCGTTTTTTGGCCAAAAACCGTGATGGTTA-TCACA 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCG-TTTTTAGCAAAAAACCGTGATGGTTAGT-ACA * * * * * * 15941 TGATTTTGGCTAAACACTAACCCGAAAAGA-TTTTTCTCAATTTTTTTCCACAATACTCAG-AAA 129 CGATTTCGGCTAAAAACTGACTCGAAAA-ATTTTTTCTCAATTTTTTGCCACAATACTCAGAAAA * * * * * * ** * 16004 AGTGTATAATTAAACGACAAAAAGATTGACAAGATTTTCACGCTTCTAAATATTATTTTCCCATT 193 AATATATAATTCAACGCCAAAAAGATTGAC-GGGTTTTCACGCTTCTAAATATCGTTTTTCCA-- * 16069 TTTTTTCCGAATTAATTTCTAATTAAATCGAAA-AAGATTCAGATACTCGTAAAAACAAATCCTT 255 TTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTT 16133 A 320 A ** * * 16134 TGTCCAATGTGGCTGAGATTT-GTTCGATGAATATAGATATTTCAAGGAGTCTTTGCACCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * *** *** 16198 TCATGCAAAATTGAGCCGGGCTCCGGAACGCATTTTTAGCAAAAAAAAAAATGATAAATAGTACA 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGC--AAAAAACCGTGATGGTTAGTACA * ** * * * ** 16263 CGATTTCGACTAAAATTTTGCAAAAATTGACAAGAAAGGTTTTTCCTCAATTTTTTTGAAACAAT 129 CGATTTCGGCTAAAA-ACTG----ACTCG--AA-AAA--TTTTTTCTCAA-TTTTTTGCCACAAT * * 16328 ACAT-A-AAAGAATATATAATTCAACGCCAAAAAGATT-ACAGAGTTTTTCACGCTTCTAAATAT 183 AC-TCAGAAAAAATATATAATTCAACGCCAAAAAGATTGAC-G-GGTTTTCACGCTTCTAAATAT * 16390 CGTTTTTCCATTTTTTTTCCGAATTAATTTCTAATTAAATCGAAATAAGATTCAGATGCTCGTAA 245 CGTTTTTCCA--TTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAA 16455 AAACAAATCCTTA 308 AAACAAATCCTTA * * * * * * 16468 AATCCAATGTGGCTTAGATTTGGTTAGATGAATATACATATTTCAAGAAGTCTTTGTGCTCAAAT 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * * * * * 16533 TCATGCAAAATTGAGCAGGACACCGGAACGCGTTTTTAGCCAAAAACTGTGATAGTTATTACATG 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTACACG * * * * 16598 ATTTCGGTTAAAAACTGAC-CCAAAATTTATTTTTTCTCAATTTTTTACCACAATACTCATAAAA 131 ATTTCGGCTAAAAACTGACTCGAAAA---ATTTTTTCTCAATTTTTTGCCACAATACTCAGAAAA * * ** * 16662 AATATATAATTCAACGCCAGAATGATTGACGAAATTTAT-ACGCTTCTAAATATCGTTTTCCCAT 193 AATATATAATTCAACGCCAAAAAGATTGACG-GGTTT-TCACGCTTCTAAATATCGTTTTTCCA- ** * * * * * * 16726 TTTTTTTCTTAATTAATTTATAATTAGATCGAAACAAAATTTAGATACTCGTAAAAACAAATTCT 255 -TTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCT 16791 TA 319 TA * * * * * * * 16793 TATCCAATGTGGCTAAGATTTGATTCGATGAATATAAATATTTTAAGGAGTCTTTGGGCCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * ** * 16858 TCATGCGAAACTT-ATCCGGGCTCTC-GAACGCGTTTTTAGCAAAAAAAAAAAATGTGATGTTTA 66 TCATGC-AAAATTGAGCCGGGCTC-CGGAACGCGTTTTTAGC-----AAAAAACCGTGATGGTTA ** * * ** 16921 GTACATAATTTGGGCTAAAATTTTTCCAAAAATTGACTC-AAAAAGTTTTCCCTCAATTTTTTTA 124 GTACACGATTTCGGC---------T--AAAAACTGACTCGAAAAA-TTTTTTCTCAA--TTTTT- * * 16985 TGCCACAATACTCAGGAAAAATATATAATTCAACGCCAAAAAAGATT-ACAGAGTTTTTCACGCT 174 TGCCACAATACTCAGAAAAAATATATAATTCAACGCC-AAAAAGATTGAC-G-GGTTTTCACGCT 17049 TCTAAATATCGTTTTTCCA-TTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATG 236 TCTAAATATCGTTTTTCCATTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATG * 17113 CTCGTAAAAACAATTCCTTA 301 CTCGTAAAAACAAATCCTTA * * * ** 17133 AATCCAATGTGACTGATATTTGGTTAGATGAATATACATATTTCAAGGAGTCTTTGCGGTCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * * * 17198 TCATGAAAAATTGAGCCGGACTCCGGAACGCGTTTTTAGCCAAAAACTGTGATAGTTATTACACG 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTACACG * * * * 17263 ATTTCGGCTAAAAATTTAC-CAGAAAATTTTTTTTCTCAA-TTTTTACCACAATACTCA-ACAAA 131 ATTTCGGCTAAAAACTGACTC-GAAAA-ATTTTTTCTCAATTTTTTGCCACAATACTCAGA-AAA * ** * * 17325 AATATATAATTCAACGCCAGAAAGATTGACAGAATTTTCACACTTCTAAATATCGTTTTCCCA-T 193 AATATATAATTCAACGCCAAAAAGATTGAC-GGGTTTTCACGCTTCTAAATATCGTTTTTCCATT * ** * 17389 TTTTCCGAATTAATTTCTAATTAAATCGAAACAAAATTCAGATAATCGTAAAAACAAATTCTTA 257 TTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * * * * * * * * 17453 TATCCAATGTGACTAAGATTTGATTCGGTGAATATAGATATTTCAAGGAGTCTTTGGGTCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * ** * 17518 TCATGCGAAACTGATCCGGGCTCTC-GAACGCGTTTTTAGCCAAAAAAAAAAAAAGAAGTGATGT 66 TCATGCAAAATTGAGCCGGGCTC-CGGAACGCGTTTTTAG-C-------AAAAAA-CCGTGATGG * * * * * 17582 TTAGTACACAATTTGGGCTAAAATTTTGCAAAAATTGACTCGAAAAGTTTTTCCTCAATTTTTTT 121 TTAGTACACGATTTCGGC--------T--AAAAACTGACTCGAAAAATTTTTTCTCAA-TTTTTT * * * 17647 GCCACAATACTCA-AGAAAAATATATAATTCAACGCAAAAAAGATTAAAGGGGTTTTCACGCTTC 175 GCCACAATACTCAGA-AAAAATATATAATTCAACGCCAAAAAGATT-GACGGGTTTTCACGCTTC * * * * * * 17711 T-AACATCATTTTTCCATTTTTTTTTTCAAATTAATTTCTAATTAAATCGAAACAAGATCCAAAT 238 TAAATATCGTTTTTCCA---TTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGAT * * 17775 GCTCATAAAAACAAAT-CTTT 300 GCTCGTAAAAACAAATCCTTA * * * * * * ** 17795 AATCCAATGTCGTTTA-A-AT--TT-GATGAATATAGATATTTCAAGGTGTCGTTGCGCTTAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * * 17855 TCATGCAAAATTGAGCCGGGCTCTGAAATGCGTTTTTAGCCAAAAACCATGATGGTTA-TCACAC 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGT-ACAC * * * * 17919 GATTTTGGCTAAAAACTAACCCGAAAAGA-TTTTTCTCAATTTTTTTCCACAATACTCAGAAAAG 130 GATTTCGGCTAAAAACTGACTCGAAAA-ATTTTTTCTCAATTTTTTGCCACAATACTC-----AG * * * 17983 TATATATATATATATATAATTAAACGCCAAAAAGATTGACAGGATTTTCACGCTTCTAAATATTG 189 -A-A-A-A-A-ATATATAATTCAACGCCAAAAAGATTGAC-GGGTTTTCACGCTTCTAAATATCG * * * * * 18048 -TTTT-CGTATTTTTCCGGATTAAATTCTAATTAAATCGAAACAAGATTCAAATACTCGTAAAAA 247 TTTTTCCAT-TTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAA * 18111 -AAATCATTA 311 CAAATCCTTA * * * * * * * 18120 TATCCAATGTGGCTAAGATTT-GTTCGATGAATGTAGAAATTTAAAGGAGTCTTGGCGCCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA ** ** * * 18184 TCATGCAAAATTGAG-CGGGCTCCAAAACGCGTTTTTAGTC-AAAAACTATGATGATTATTACAC 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAG-CAAAAAACCGTGATGGTTAGTACAC * * * 18247 GATTTTCGGCTAAAAACTGACTTGAAATATATTTTCTCAATTTTTTTGCCACAATACTCAGAAAA 130 GA-TTTCGGCTAAAAACTGACTCGAAAAATTTTTTCTCAA-TTTTTTGCCACAATACTCAGAAAA * * * * * 18312 AATATATAATTCAACACCAAAAAGATTGACGGGATTTGACACTTCTAAATAACGTTTTTCCA-TT 193 AATATATAATTCAACGCCAAAAAGATTGACGGGTTTTCACGCTTCTAAATATCGTTTTTCCATTT * * * 18376 TTCCCAAATTAATTTCTAATTAAATCAAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA 258 TTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * * * * * * * * 18439 AATTCGATGTGGCAGAGATTTGGTTAGTTTAATATAGATATTTTAAGGAGCCATTGCGCCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA ** * * * * 18504 TCATGCAAAATTGAGCATGTCTCTGGAATGCGTTTTTAGCAAAAAACCGTGATGGTTATTACACG 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTACACG * * * * 18569 ATTTTGGCTAAAAACTGACTCGAAAAACTTTTT-TCAATTTTTTGCCGCAATACTTAGAAAAAAT 131 ATTTCGGCTAAAAACTGACTCGAAAAATTTTTTCTCAATTTTTTGCCACAATACTCAGAAAAAAT * ** ** * * 18633 ATATAATTCAACACCAAAAAGATTGACAGGAATTTCACGCTTCTAAATATCACTCTCCCA--TTT 196 ATATAATTCAACGCCAAAAAGATTGAC-GGGTTTTCACGCTTCTAAATATCGTTTTTCCATTTTT ** * * * * 18696 TCCTGTTTTAATTTCTGATTAAATCGAAACAAAATTCAGATACTCGTAAAAGCAAATCCTTA 260 TCC-GAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * * * * * 18758 TATCCAATGTGGCTGAGATTTGGTTCGATGAATATAGATATTTGAAGGAGTCTTTCCACCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * ** * 18823 TCATGCAAAACTGATCCGGGCTCTGGAACGCGTCTTTAGCTAAAAAAAGGTGTTGGTTGGCTAGT 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGC-AAAAAACCGTGATGG-T---TAGT * * * 18888 ACACGATTTCGGCTAAAATTTTCCAAAAATTGACCCG-AAAAGTTTTTCTCAATTTTTTTGCCAC 126 ACACGATTTCGGC--------T--AAAAACTGACTCGAAAAATTTTTTCTCAA-TTTTTTGCCAC * ** * * * 18952 AATACTC-GGAAAAATATATGGTTCAACGCCAAAAAGA-TGAAATGGCTTTTCACGCTT-TTAAT 180 AATACTCAGAAAAAATATATAATTCAACGCCAAAAAGATTG--ACGGGTTTTCACGCTTCTAAAT ** * * * 19014 ATAATTTTTCCATTTTTTTCCAAATTAATTTCTAATTAAATCGAAAAAAGATTCAGATGCTCATA 243 ATCGTTTTTCCA-TTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTA * 19079 AAAACAAATTCTTA 307 AAAACAAATCCTTA * * ** * 19093 AATCCAAAGTGGCTGAGATTTGGTTAGATGATTATAGATATTTCAAGGAGTCTTTTAGCCTAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * ** * * * * * * * * 19158 CCATAAAAAACTGA-CTCGAGAC-CGCGAAACACGTTTTTAACCAAAACCCGTGATGGTTAGAAC 66 TCATGCAAAATTGAGC-CG-GGCTC-CGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTAC * * * * * * 19221 AAGATATCGTCT-AAAACTGACTTGAGAATTTTTTTCCTCAATTTTTTGCCACAATACTC--AAA 128 ACGATTTCGGCTAAAAACTGACTCGAAAAATTTTTT-CTCAATTTTTTGCCACAATACTCAGAAA * * * * * * * * 19283 AAATATATAATTCAACACTAAAAA-AGTTCAAGGATTTTTCACGCTTTTGACAT-T-ATTTTTCC 192 AAATATATAATTCAACGCCAAAAAGA-TTGACGG-GTTTTCACGCTTCT-AAATATCGTTTTTCC * * ** 19345 A-TTTTTCCGAATTTATTTCTAATTAAATCGAAACAAGATTAAGATGCTTATAAAAACAAATCCT 254 ATTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCT 19409 TA 319 TA * * * ** ** * * 19411 AATCTAA-GTGGCTGAGATTTGGTTAGATGATTAAAGATATTTTGAGGAGTCTTTATGGCAAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * ** * * * ** 19475 CCATACAGAACCGAGTCGTGGC-CCCGAATCGCGTTTTTAGCCAAAAA-C-T-A-GGACGAGTAC 66 TCATGCAAAATTGAGCCG-GGCTCCGGAA-CGCGTTTTTAGCAAAAAACCGTGATGG-TTAGTAC * * * * * * 19535 ACGATTTCGGCTAAAAATTGACCCG-TAAATTTTTTCTCAATTTTTTGCCAGAATACTTATAAAA 128 ACGATTTCGGCTAAAAACTGACTCGAAAAATTTTTTCTCAATTTTTTGCCACAATACTCAGAAAA * * * * * * * 19599 AATATATAATTGAACGCCAAAAATATTGAAGAGTTTCTCACACTTGT-AATATCATTTTTCCATT 193 AATATATAATTCAACGCCAAAAAGATTGACGGGTTT-TCACGCTTCTAAATATCGTTTTTCCATT * * * 19663 TTTGCCAAATTTATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA 257 TTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * * * * * * * * 19727 AATTCAATGTGGTTGTGATTTGGTTAGTTAAATATAGATATTTTAAGGAATCTTTGCACCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * ** * * * * 19792 TCATGCAAAATTGAGTCGGGCTCTAGAACGCGCTTTTAACCAAAAACCGTGATGGTTATTACACG 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTACACG * * * 19857 ATTTCGGCTAAAAACTGACTTGAAAAATATTTTCTGAATTTTTTGCCACAATACTCAGAAAAAAT 131 ATTTCGGCTAAAAACTGACTCGAAAAATTTTTTCTCAATTTTTTGCCACAATACTCAGAAAAAAT * * * * * * * 19922 ATATAATTCAACGCCAAAATGATTAATGGGTTTTTCATGCTT-TTAATATCGTATTTCCATTTGT 196 ATATAATTCAACGCCAAAAAGATTGACGGG-TTTTCACGCTTCTAAATATCGTTTTTCCATTTTT * 19986 TCCGATTTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA 260 TCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * ** * 20047 AATCCAATGTGGCTGAGATTTGATTAGATGAATATAGATATTTCAAGGAGTCTTTTTTGCCCGAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTC-TTTGCGCCCAAA * * * 20112 TTCATGCAAAATTGAGCCGGGCT------C-CGTTTTTGGCAAAAAAAAAACCGTGAT-ATTAAG 65 ATCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGC----AAAAAACCGTGATGGTT-AG * * * 20169 CACACGGTTTCGGCTAAAAACTTAAC-C-AAAAGA-TTTTTCTCAATTTTTTTTGCCACAATACT 125 TACACGATTTCGGCTAAAAAC-TGACTCGAAAA-ATTTTTTCTCAA--TTTTTTGCCACAATACT * **** * * * 20231 C-GTAAAAAATAAATAATTCAATAAAAAAAATATTGAAGGGTTTTCCACACTTCT-AATATCGTT 186 CAG-AAAAAATATATAATTCAACGCCAAAAAGATTGACGGGTTTT-CACGCTTCTAAATATCGTT * 20294 TTCCCATTATTTT-CGAATTAATTTCTAATTAAATCGAAACAAGATTCATATTTCAGATGCTCGT 249 TTTCCATT-TTTTCCGAATTAATTTCTAATTAAATCGAAACAAG------A-TTCAGATGCTCGT 20358 AAAAACAAATCCTTA 306 AAAAACAAATCCTTA ** * * * * * * * * 20373 AATCCAATGAAGTTG-GTATTTGCTTCGATGAATATAGAAATTTCAGGGAGTATTTACGCCAAAA 1 AATCCAATGTGGCTGAG-ATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAA * * * * * * * * * 20437 ATTATGCAAAACTAAGCTGGGGC-CCTAGAACGCGTTTTTAGCAAAAAATCGT--T-ATAAGTAA 65 ATCATGCAAAATTGAGC-CGGGCTCC-GGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTAC * * * * 20498 ACGATTTCGGCTAAAATTTTGCAAAAACTGACCCGAAAATTTTTTTCCTCAATTTTTTGCAATAA 128 ACGATTTCGGC--------T--AAAAACTGACTCGAAAAATTTTTT-CTCAATTTTTTGCCACAA * * * * 20563 TACTCAGAAAAAATATATAAGTCAATGCC-AAAAGAATTGACGAGCTTTTCACACTTCT-AATAT 182 TACTCAGAAAAAATATATAATTCAACGCCAAAAAG-ATTGACG-GGTTTTCACGCTTCTAAATAT * * * * * * 20626 CGTTTTCCCA-TTTTTCCAAATTTATTTCTAATTAAATCGAAACAACATTTAAATGCTCGTAAAA 245 CGTTTTTCCATTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAA 20690 ACAAATCCTTA 310 ACAAATCCTTA * * 20701 AATCCAATGTGGCTGAGATTTGGTTAGATGGATATAGATATTTCAAGGAGTCTTT-CTGCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGC-GCCCAAA * * * * * * * 20765 ATCATGC-AAATCTAAGTCGGGGCCCCGAAATGTGTTTTTAGCAAAAAAACG-G-T-G--A-T-- 65 ATCATGCAAAAT-TGAG-CCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTAC * * * * 20821 --GGTTTCGGCTAAAAACTGACCCGAAAAGTTTATTT-TCAATTTTTTGCCACAACACTCAGAAA 128 ACGATTTCGGCTAAAAACTGACTCGAAAAATTT-TTTCTCAATTTTTTGCCACAATACTCAGAAA * * * * * * * * * 20883 AAATATTTAATTCAACGCTAAAACGATTTAAGGGTTTTTCACG-TTTTTAATAT-TTTTTTCTAT 192 AAATATATAATTCAACGCCAAAAAGATTGACGGG-TTTTCACGCTTCTAAATATCGTTTTTCCAT * * * 20946 TTTTTCCCAATTAATTTCTAATTAAATCGAAACAAGATTAAGATGCTCGTAAAAACAAGTCCTTA 256 TTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * * * 21011 AATCCAATGTGG-TCGAGATTTGGTTAGATGAATATAGATATTTCAAGAAGTTTTTGCACCCAAA 1 AATCCAATGTGGCT-GAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAA * * * * 21075 ATCATGCAAAATTGAGTCGGGCTCCGAAACGCATTTTTAGTC-AAAAACCGTGATGATTAGTACA 65 ATCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAG-CAAAAAACCGTGATGGTTAGTACA * ** * * 21139 CAATTTCGGCTAAAAACTGATCTAAAAAAATTTTTTCTCAATTTTTTGTCATAATACTCAG-AAA 129 CGATTTCGGCTAAAAACTGA-CTCGAAAAATTTTTTCTCAATTTTTTGCCACAATACTCAGAAAA * * * 21203 AATATATAATTCAACGCCAAAAAGATTGGCGAGATTTTTACGCTTCTAAATATCGTTTTTCCA-- 193 AATATATAATTCAACGCCAAAAAGATTGACG-GGTTTTCACGCTTCTAAATATCGTTTTTCCATT * * * 21266 TTTTCCGAATTAATTTCTAATTAAATCGAAACAATATTCAGATACTCGTAAAAAAAAATCCTTA 257 TTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA * * * * * 21330 TATCCAATGTGACTGATATTTGGTTCGATGAATATAGATATTTCAAGGACTCTTTGCGCCCAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * * * * ** 21395 TTATGCAAAACTAATCCGAGCTACGGAACGTGTTTTTAGCAAAAAAAAAAAAAAAAATGTGATGG 66 TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGC----------AAAAAACCGTGATGG * * * * 21460 TTAGTACACGATTTCGGAT-AAAATTGAC-CAGAAAAGTTTTTTCTCAATTTCTTTACCACAATA 121 TTAGTACACGATTTCGGCTAAAAACTGACTC-GAAAAATTTTTTCTCAATTT-TTTGCCACAATA * ** * ** * * 21523 CT-AAATATATATATATATATAGATAGAAAAAGGATTAAAGATTAAAGATTAAAGGGTTTTTCAC 184 CTCAGA-A-A-A-A-ATATATA-AT---TCAACG---CCA-A--AAAGATTGACGGG-TTTTCAC * * * * * 21587 GCTTCT-AATATCATTTTTCCATTTTTTTCCCCGAATTTATTACTAATTAAATAGAAAAAAGATT 233 GCTTCTAAATATCGTTTTTCCA-TTTTTT--CCGAATTAATTTCTAATTAAATCGAAACAAGATT * * * 21651 AAAATTCTCG-AAAAACAAATCCTTA 295 CAGATGCTCGTAAAAACAAATCCTTA * * * *** * 21676 AATCCAATGTGGCTGA-AATTGGTTAGATGATTATAGATATTTTAAGGAGTCTTT-ATTCTAATA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAA-A * * * * * * * ** * * * 21739 ACCAAGCAAAACTGAGTCGAGGCCCCGAAACGTGTTTTTAGCCTAAAACTGTGATAGTTAGTACC 65 ATCATGCAAAATTGAGCCG-GGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTACA * ** * * 21804 CGATTTCGGGTAAAAACTGACTCAGAGAATTTTTTTTCTCAATTTTTTTGCCAAAATACTTA-AA 129 CGATTTCGGCTAAAAACTGACTC-GA-AAAATTTTTTCTCAA-TTTTTTGCCACAATACTCAGAA * * * * * 21868 ATATATATATAATCCAATGTCAAAAAGATTGAAGTGGTTTTCACGCTTCTAACAT-T-GTTTTTC 191 A-AAATATATAATTCAACGCCAAAAAGATTGACG-GGTTTTCACGCTTCTAA-ATATCGTTTTTC * * * 21931 CATTATTTTGTTTCCGAATTAATATCTAATTAAATCGAAACAAGATTCAGATACTTGTAAAAACA 253 C---A-TTT-TTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACA * 21996 AATACTTA 313 AATCCTTA * * * ** * * * 22004 AATCGAATGAGGCTGGGATTTGGTTCCATGAATATAGATATTTCAAGAAGTCTTTACGCCAAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * ** * * 22069 TAATGTAAAACTAAGCCGAGG-TCCCAAAACGCGTTTTTAGCAAAAAACTGTGAT-G--AGTAAA 66 TCATGCAAAATTGAGCCG-GGCT-CCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTACA * * * 22130 CGAATTTTGGTTAAAATTTTACAAAAACTGAACTC-AAAAATTTTTCCTCAATTATTTT-CCACA 129 CG-ATTTCGG--------CT--AAAAACTG-ACTCGAAAAATTTTTTCTCAATT-TTTTGCCACA * * * * 22193 ATACTCAGAAAAGATATATAATTCAACGCCAAAACA-ATTAAAGTGTTTTTCACGCTTCTAAA-A 181 ATACTCAGAAAAAATATATAATTCAACGCCAAAA-AGATTGACG-GGTTTTCACGCTTCTAAATA * * * * * 22256 TCATTTTTCC---TTTTCCGAATTTATTTCTAATTAAATCGAAATAAGATTCAGATACTCGTTAA 244 TCGTTTTTCCATTTTTTCCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAA 22318 AACAAATCCTTA 309 AACAAATCCTTA * ** * * * * * * * ** * 22330 GATTTAATGTGGCTGGGATTTGGTTTGGTTATTTTAGATATTTCAAGGAGTTTTTATGCCAAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * ** * * ** * * ** 22395 CCATGCAAAACAGAGTCGGGGTCCTTAAACGCGTTTTTTTGGC-AAAAACCG-CA---ACA-TAC 66 TCATGCAAAATTGAGCCGGGCTCC-GGAACGCG--TTTTTAGCAAAAAACCGTGATGGTTAGTAC * * * ** * 22454 ACGAATTCGGCTAAAAACAGACTCGAAAAATATTTTCTCAATTTTTTGCGTCAATACACAGAAAA 128 ACGATTTCGGCTAAAAACTGACTCGAAAAATTTTTTCTCAATTTTTTGCCACAATACTCAG-AAA * * * * * * 22519 AATATATATAATTTAATGCCAAAAAGATTGAAGTGCTTTTCACCCTTCT-AATTTCGTTTTTCCA 192 AA-ATATATAATTCAACGCCAAAAAGATTGACG-GGTTTTCACGCTTCTAAATATCGTTTTTCCA * * 22583 TTTTTTTCGAATTAATATTC-AATTAAATCGAAACAAGATTC-GAATGCTCGTAAAAACAAATCA 255 TTTTTTCCGAATTAAT-TTCTAATTAAATCGAAACAAGATTCAG-ATGCTCGTAAAAACAAATCC 22646 TTA 318 TTA * * * * * 22649 AATCCAATGTGGCTGAGATTTGGTTCGATGAATATAGATATTTCAGGGGGTCTTTACGCCGAAAA 1 AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA * * * * 22714 TCATGCATAATTGAGCCGTGGCT-CGGAAACGCTTTTTTAGCCAAAAACCGTGAAGGTTAGTA 66 TCATGCAAAATTGAGCCG-GGCTCCGG-AACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTA 22776 GTTATCATAG Statistics Matches: 6126, Mismatches: 1098, Indels: 791 0.76 0.14 0.10 Matches are distributed among these distances: 309 36 0.01 310 182 0.03 311 30 0.00 312 3 0.00 313 34 0.01 314 67 0.01 315 90 0.01 316 182 0.03 317 284 0.05 318 315 0.05 319 686 0.11 320 568 0.09 321 290 0.05 322 216 0.04 323 41 0.01 324 66 0.01 325 351 0.06 326 241 0.04 327 87 0.01 328 284 0.05 329 231 0.04 330 48 0.01 331 17 0.00 332 19 0.00 333 194 0.03 334 185 0.03 335 315 0.05 336 156 0.03 337 106 0.02 338 34 0.01 339 170 0.03 340 267 0.04 341 21 0.00 342 67 0.01 343 107 0.02 344 3 0.00 345 51 0.01 346 46 0.01 347 36 0.01 ACGTcount: A:0.37, C:0.16, G:0.14, T:0.33 Consensus pattern (320 bp): AATCCAATGTGGCTGAGATTTGGTTAGATGAATATAGATATTTCAAGGAGTCTTTGCGCCCAAAA TCATGCAAAATTGAGCCGGGCTCCGGAACGCGTTTTTAGCAAAAAACCGTGATGGTTAGTACACG ATTTCGGCTAAAAACTGACTCGAAAAATTTTTTCTCAATTTTTTGCCACAATACTCAGAAAAAAT ATATAATTCAACGCCAAAAAGATTGACGGGTTTTCACGCTTCTAAATATCGTTTTTCCATTTTTT CCGAATTAATTTCTAATTAAATCGAAACAAGATTCAGATGCTCGTAAAAACAAATCCTTA Found at i:22856 original size:21 final size:20 Alignment explanation

Indices: 22832--22877 Score: 65 Period size: 21 Copynumber: 2.2 Consensus size: 20 22822 CTCTGGGTCA * 22832 CGGGTCAATTGGTTCAACCGT 1 CGGGTCAATTGGGTCAA-CGT * 22853 CGGGTCAATTGGGTCAATGT 1 CGGGTCAATTGGGTCAACGT 22873 CGGGT 1 CGGGT 22878 TAATAAAATA Statistics Matches: 23, Mismatches: 2, Indels: 1 0.88 0.08 0.04 Matches are distributed among these distances: 20 7 0.30 21 16 0.70 ACGTcount: A:0.17, C:0.20, G:0.35, T:0.28 Consensus pattern (20 bp): CGGGTCAATTGGGTCAACGT Found at i:25065 original size:122 final size:121 Alignment explanation

Indices: 24828--25070 Score: 262 Period size: 122 Copynumber: 2.0 Consensus size: 121 24818 TTTTTTAAAT * 24828 TAAAATGGCAAAAATAAAATAATTATAAAATATTAAATTTAATTAAATAAAATAGAGTTTTTAAT 1 TAAAATGGCAAAAATAAAATAATTATAAAATATTAAATTTAATTAAATAAAATAGAGCTTTTAAT ** * * * * 24893 ATAATAAAACTGTATATTAAAAGATTTTAATATATCCAAATTTTTATTGAAAAATAG 66 ATAATAAAACTAAATATTAAAAGA-TTGAATATATACAAATATGTATTGAAAAATAG * * * * 24950 TAAAATGGTAAAAATAAAGTAATTATAAAGATATTAGATTTCATTAAATAAAAATAGAGCTTTTA 1 TAAAATGGCAAAAATAAAATAATTATAAA-ATATTAAATTTAATTAAAT-AAAATAGAGCTTTTA * * * 25015 GTA-AATAAAACTAAAATAGTTAAACA-A-TGACATTTA-AGAAATATGT-TTGAAAAATA 64 ATATAATAAAACT-AAATA-TTAAA-AGATTGA-ATATATACAAATATGTATTGAAAAATA 25071 AGAGTAAATG Statistics Matches: 101, Mismatches: 14, Indels: 12 0.80 0.11 0.09 Matches are distributed among these distances: 122 37 0.37 123 34 0.34 124 23 0.23 125 6 0.06 126 1 0.01 ACGTcount: A:0.53, C:0.04, G:0.09, T:0.34 Consensus pattern (121 bp): TAAAATGGCAAAAATAAAATAATTATAAAATATTAAATTTAATTAAATAAAATAGAGCTTTTAAT ATAATAAAACTAAATATTAAAAGATTGAATATATACAAATATGTATTGAAAAATAG Found at i:25132 original size:13 final size:14 Alignment explanation

Indices: 25113--25149 Score: 60 Period size: 13 Copynumber: 2.8 Consensus size: 14 25103 ACTCGTACTT 25113 TTATATATATAATA 1 TTATATATATAATA 25127 -TATATATAT-ATA 1 TTATATATATAATA 25139 TTATATATATA 1 TTATATATATA 25150 GTAAGATATA Statistics Matches: 21, Mismatches: 0, Indels: 4 0.84 0.00 0.16 Matches are distributed among these distances: 12 3 0.14 13 18 0.86 ACGTcount: A:0.49, C:0.00, G:0.00, T:0.51 Consensus pattern (14 bp): TTATATATATAATA Found at i:25134 original size:15 final size:15 Alignment explanation

Indices: 25114--25149 Score: 63 Period size: 15 Copynumber: 2.4 Consensus size: 15 25104 CTCGTACTTT 25114 TATATATATAATATA 1 TATATATATAATATA * 25129 TATATATATATTATA 1 TATATATATAATATA 25144 TATATA 1 TATATA 25150 GTAAGATATA Statistics Matches: 20, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 15 20 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (15 bp): TATATATATAATATA Found at i:25167 original size:2 final size:2 Alignment explanation

Indices: 25114--25149 Score: 58 Period size: 2 Copynumber: 19.0 Consensus size: 2 25104 CTCGTACTTT 25114 TA TA TA TA TA -A TA TA TA TA TA TA TA T- TA TA TA TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 25150 GTAAGATATA Statistics Matches: 32, Mismatches: 0, Indels: 4 0.89 0.00 0.11 Matches are distributed among these distances: 1 2 0.06 2 30 0.94 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): TA Done.