Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01010609.1 Corchorus capsularis cultivar CVL-1 contig10630, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 71990
ACGTcount: A:0.32, C:0.19, G:0.18, T:0.31


Found at i:7 original size:2 final size:2

Alignment explanation

Indices: 1--48 Score: 96 Period size: 2 Copynumber: 24.0 Consensus size: 2 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 43 AT AT AT 1 AT AT AT 49 CAAGAGGTCA Statistics Matches: 46, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 46 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): AT Found at i:4730 original size:83 final size:86 Alignment explanation

Indices: 4605--4769 Score: 255 Period size: 83 Copynumber: 2.0 Consensus size: 86 4595 TAAAAATATA * * 4605 AAGGCACGAGATCAAAGAGTGGATTCACCATTAAAATCGTAATCTCCAATTGGCAACTTTTAACC 1 AAGGCACGAGATCAAAGAGTGGATACACCATTAAAATCGTAATCTCCAATTGACAACTTTTAACC * 4670 AAATTATAGTTAGGAAACATT 66 AAATTATAGTTAAGAAACATT * * * 4691 AAGGCACGAGATC-AAG-G-GGATACACCATTAAAATTGTAATCTCCAGTTGACCACTTTTAACC 1 AAGGCACGAGATCAAAGAGTGGATACACCATTAAAATCGTAATCTCCAATTGACAACTTTTAACC 4753 AAATTATAGTTAAGAAA 66 AAATTATAGTTAAGAAA 4770 TCCATACATT Statistics Matches: 73, Mismatches: 6, Indels: 3 0.89 0.07 0.04 Matches are distributed among these distances: 83 56 0.77 84 1 0.01 85 3 0.04 86 13 0.18 ACGTcount: A:0.40, C:0.18, G:0.16, T:0.26 Consensus pattern (86 bp): AAGGCACGAGATCAAAGAGTGGATACACCATTAAAATCGTAATCTCCAATTGACAACTTTTAACC AAATTATAGTTAAGAAACATT Found at i:5045 original size:14 final size:14 Alignment explanation

Indices: 5026--5052 Score: 54 Period size: 14 Copynumber: 1.9 Consensus size: 14 5016 AGCTTATAAA 5026 CCCCCCCCCGCCCC 1 CCCCCCCCCGCCCC 5040 CCCCCCCCCGCCC 1 CCCCCCCCCGCCC 5053 TACCCCAAGT Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 14 13 1.00 ACGTcount: A:0.00, C:0.93, G:0.07, T:0.00 Consensus pattern (14 bp): CCCCCCCCCGCCCC Found at i:7325 original size:32 final size:32 Alignment explanation

Indices: 7275--7336 Score: 106 Period size: 32 Copynumber: 1.9 Consensus size: 32 7265 GAGTTTATGA * 7275 ATTCCATATACCTAGAAACAGACGCCACAAGG 1 ATTCCATACACCTAGAAACAGACGCCACAAGG * 7307 ATTCGATACACCTAGAAACAGACGCCACAA 1 ATTCCATACACCTAGAAACAGACGCCACAA 7337 TCCTCCTAAC Statistics Matches: 28, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 32 28 1.00 ACGTcount: A:0.42, C:0.29, G:0.15, T:0.15 Consensus pattern (32 bp): ATTCCATACACCTAGAAACAGACGCCACAAGG Found at i:12310 original size:7 final size:7 Alignment explanation

Indices: 12298--12327 Score: 60 Period size: 7 Copynumber: 4.3 Consensus size: 7 12288 CATCTCCATT 12298 ACTTCAA 1 ACTTCAA 12305 ACTTCAA 1 ACTTCAA 12312 ACTTCAA 1 ACTTCAA 12319 ACTTCAA 1 ACTTCAA 12326 AC 1 AC 12328 ACTGCTATTA Statistics Matches: 23, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 7 23 1.00 ACGTcount: A:0.43, C:0.30, G:0.00, T:0.27 Consensus pattern (7 bp): ACTTCAA Found at i:16136 original size:28 final size:28 Alignment explanation

Indices: 16104--16159 Score: 85 Period size: 28 Copynumber: 2.0 Consensus size: 28 16094 TTGTCGGTAT * * 16104 AAACTCAAGTTCATTTTGATGCCAAAAA 1 AAACTCAAGTACATTTTGATCCCAAAAA * 16132 AAACTCGAGTACATTTTGATCCCAAAAA 1 AAACTCAAGTACATTTTGATCCCAAAAA 16160 GAAAGAAAAA Statistics Matches: 25, Mismatches: 3, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 28 25 1.00 ACGTcount: A:0.43, C:0.20, G:0.11, T:0.27 Consensus pattern (28 bp): AAACTCAAGTACATTTTGATCCCAAAAA Found at i:16307 original size:20 final size:20 Alignment explanation

Indices: 16282--16340 Score: 72 Period size: 20 Copynumber: 3.0 Consensus size: 20 16272 ATGTAACGGG 16282 ATATCCGTCGATATATCCGT 1 ATATCCGTCGATATATCCGT 16302 ATATCCGTCGATATTTAT-CG- 1 ATATCCGTCGATA--TATCCGT 16322 ATAT-C-TCGATATATCCGT 1 ATATCCGTCGATATATCCGT 16340 A 1 A 16341 AATATCCGTA Statistics Matches: 35, Mismatches: 0, Indels: 10 0.78 0.00 0.22 Matches are distributed among these distances: 16 3 0.09 17 2 0.06 18 7 0.20 19 1 0.03 20 17 0.49 21 2 0.06 22 3 0.09 ACGTcount: A:0.27, C:0.22, G:0.14, T:0.37 Consensus pattern (20 bp): ATATCCGTCGATATATCCGT Found at i:16345 original size:10 final size:10 Alignment explanation

Indices: 16332--16357 Score: 52 Period size: 10 Copynumber: 2.6 Consensus size: 10 16322 ATATCTCGAT 16332 ATATCCGTAA 1 ATATCCGTAA 16342 ATATCCGTAA 1 ATATCCGTAA 16352 ATATCC 1 ATATCC 16358 ATATTAAATT Statistics Matches: 16, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 10 16 1.00 ACGTcount: A:0.38, C:0.23, G:0.08, T:0.31 Consensus pattern (10 bp): ATATCCGTAA Found at i:16702 original size:3 final size:3 Alignment explanation

Indices: 16690--16728 Score: 71 Period size: 3 Copynumber: 13.3 Consensus size: 3 16680 GCTCACGGAA 16690 GAT G-T GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT G 1 GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT GAT G 16729 GGGGAAATGA Statistics Matches: 35, Mismatches: 0, Indels: 2 0.95 0.00 0.05 Matches are distributed among these distances: 2 2 0.06 3 33 0.94 ACGTcount: A:0.31, C:0.00, G:0.36, T:0.33 Consensus pattern (3 bp): GAT Found at i:17834 original size:13 final size:12 Alignment explanation

Indices: 17804--17846 Score: 77 Period size: 12 Copynumber: 3.5 Consensus size: 12 17794 CATCGATACC 17804 TCGATATATCCG 1 TCGATATATCCG 17816 TCGATATATCCG 1 TCGATATATCCG 17828 TTCGATATATCCG 1 -TCGATATATCCG 17841 TCGATA 1 TCGATA 17847 CCTGTATTAA Statistics Matches: 30, Mismatches: 0, Indels: 2 0.94 0.00 0.06 Matches are distributed among these distances: 12 18 0.60 13 12 0.40 ACGTcount: A:0.26, C:0.23, G:0.16, T:0.35 Consensus pattern (12 bp): TCGATATATCCG Found at i:18297 original size:220 final size:219 Alignment explanation

Indices: 17915--18345 Score: 711 Period size: 220 Copynumber: 2.0 Consensus size: 219 17905 ATTATACAAA 17915 ATACAAAACAATAAAAGGAGGATATGAAAGTACCAAAAAACCAAACCAACCAACTCTTCTCATGA 1 ATACAAAACAATAAAAGGAGGATATGAAAGTACCAAAAAACCAAACCAACCAACTCTTCTCATGA * * ** 17980 CTTGAAGAATAAATGGGGAGGGGTATTTTGATACTTTCATGCTCGGTTTTTTTGTTATTTGGATT 66 CTTGAAGAATAAATGGAGAGGGGTATTTTGATACTTTCATGCTCGATTTTTTCATTATTTGGATT * 18045 GATGATTGAAAGTATTTTGAAAATTTTGTGAAAAAGTTACTATTTT-TGTGGGACCTATTCTCCA 131 GATGATTGAAAGTATTTTGAAAACTTTGTGAAAAAGTTACTATTTTGTG-GGGACC-ATTCTCCA * 18109 GGAACTAATACTTTTATTGGATTAAT 194 GAAACTAATACTTTTATTGGATTAAT * 18135 ATACAAAACAATAAAAGGAGGATATGAAAGTACCAAAAAACCAACCCAACCAACTCTTCTCATGA 1 ATACAAAACAATAAAAGGAGGATATGAAAGTACCAAAAAACCAAACCAACCAACTCTTCTCATGA * 18200 CTTGAAGAATAAATGGAGAGGGGTATTTTGGTACTTTCATGCTCGATTTTTTCATTATTTGGATT 66 CTTGAAGAATAAATGGAGAGGGGTATTTTGATACTTTCATGCTCGATTTTTTCATTATTTGGATT ** ** * * 18265 GATGATTGAGGGTATTTTGGGAACTTTGTGAAAAAGTTACTATTTTGTGGGGTCCATTCTCCATA 131 GATGATTGAAAGTATTTTGAAAACTTTGTGAAAAAGTTACTATTTTGTGGGGACCATTCTCCAGA 18330 AACTAATACTTTTATT 196 AACTAATACTTTTATT 18346 AGATCATTTA Statistics Matches: 196, Mismatches: 14, Indels: 3 0.92 0.07 0.01 Matches are distributed among these distances: 219 24 0.12 220 170 0.87 221 2 0.01 ACGTcount: A:0.34, C:0.13, G:0.18, T:0.34 Consensus pattern (219 bp): ATACAAAACAATAAAAGGAGGATATGAAAGTACCAAAAAACCAAACCAACCAACTCTTCTCATGA CTTGAAGAATAAATGGAGAGGGGTATTTTGATACTTTCATGCTCGATTTTTTCATTATTTGGATT GATGATTGAAAGTATTTTGAAAACTTTGTGAAAAAGTTACTATTTTGTGGGGACCATTCTCCAGA AACTAATACTTTTATTGGATTAAT Found at i:19149 original size:437 final size:438 Alignment explanation

Indices: 18435--19224 Score: 1047 Period size: 438 Copynumber: 1.8 Consensus size: 438 18425 CGCGTTGACT * * * * 18435 TTTATTTTTGTATTTTTTTTTCTATTTTTCCGATTAAGGTGATTCAAGCGTCTATTAAGAGATAA 1 TTTATTTTTGTATTCTTTGTTCTATTTGTCCGATTAAGGTGATTCAAGCGTCTATTAAAAGATAA * ** * * 18500 TTTCATGATCTACAATTTTCATTAAGAACTCAAAAACCAATTTTAATGTGTTGATTCTAAAAAAT 66 TTTCATGATCTACAACTTTCAGGAAGAACTCAAAAACCAATTTTAATGTGTTAATTCAAAAAAAT * * * * * * 18565 GGTTCCGAAATTTTGTGGTTTTGATTGCCGGTTAATTTAATATCGTATAATTTTTTGTCCACATG 131 GGTTCCGAAATTTTGTGGTTTCGATTGCCGGTTAATTCAATACCATATAATCTTTCGTCCACATG * * * * * * 18630 TCCGATTGAAGTTATTGAAGTGTCGATTAAAAGGTTATTGCATGATTTACGACTTTCATAAAGGA 196 TCCAATTAAAGTTATTCAAGTGTCGATTAAAAGGTTACTGCATGATCTACGACTTTCATAAAGAA * * 18695 CCCGAAAGCTAAATTTGATATACGAGTTTCGTTAAGGGTT-AAAAGAGAATTTTTATGTTTCAAG 261 CCCGAAAGCTAAATTTGATATACGAGTTTCATGAAGGGTTCAAAAGAGAATTTTTATGTTTCAAG 18759 ATCTCCATTAACAAAC-ATTTTCTTATTTGGATTATTTATCAAATGACCCTCATACTTTTCTACA 326 ATCTCCATTAAC-AACTATTTTCTTATTTGGATTATTTATCAAATGACCCTCATACTTTTCTACA 18823 TTATACTACTTAGTCCTTTACAAATTCTATCTTAATCTGACGTTTAAGC 390 TTATACTACTTAGTCCTTTACAAATTCTATCTTAATCTGACGTTTAAGC * * * * * 18872 TTTATTTTTTTATTCTTTGTTCTATTTGTCCGATTAAGTTGATTCATGTGTCTATTAAAAGGTAA 1 TTTATTTTTGTATTCTTTGTTCTATTTGTCCGATTAAGGTGATTCAAGCGTCTATTAAAAGATAA * * ** * 18937 TTTCATGATTTACAACTTTCAGGAAGGACTC-AAAAGTAATTTTTTATGT-TTCAATTCAAAAAA 66 TTTCATGATCTACAACTTTCAGGAAGAACTCAAAAACCAA-TTTTAATGTGTT-AATTCAAAAAA * * * ** 19000 ATTGTTTCCTAAA-TTTGATTGTTTCGATTGTTGGTCT-ATTCAATACCATATAA-CTTTCGATC 129 A-TGGTTCCGAAATTTTG-TGGTTTCGATTGCCGGT-TAATTCAATACCATATAATCTTTCG-TC * * * * 19062 CACATGTCCAATTAAAGTTATTCAAGTGTCGGTTAAAAGGTTACTGTATGGTCTACTACTTTCAT 190 CACATGTCCAATTAAAGTTATTCAAGTGTCGATTAAAAGGTTACTGCATGATCTACGACTTTCAT * * * * 19127 GAAGAACCCGAAAG-TTAATTTGATCTACGAGTTTCATGGAGGGTTCAAAAGAGAATTTTTATGT 255 AAAGAACCCGAAAGCTAAATTTGATATACGAGTTTCATGAAGGGTTCAAAAGAGAATTTTTATGT 19191 TTCAAGATCTCCATTAACAACTATTTTCTTATTT 320 TTCAAGATCTCCATTAACAACTATTTTCTTATTT 19225 TTTTTACTCG Statistics Matches: 299, Mismatches: 46, Indels: 15 0.83 0.13 0.04 Matches are distributed among these distances: 436 8 0.03 437 137 0.46 438 153 0.51 439 1 0.00 ACGTcount: A:0.30, C:0.14, G:0.14, T:0.42 Consensus pattern (438 bp): TTTATTTTTGTATTCTTTGTTCTATTTGTCCGATTAAGGTGATTCAAGCGTCTATTAAAAGATAA TTTCATGATCTACAACTTTCAGGAAGAACTCAAAAACCAATTTTAATGTGTTAATTCAAAAAAAT GGTTCCGAAATTTTGTGGTTTCGATTGCCGGTTAATTCAATACCATATAATCTTTCGTCCACATG TCCAATTAAAGTTATTCAAGTGTCGATTAAAAGGTTACTGCATGATCTACGACTTTCATAAAGAA CCCGAAAGCTAAATTTGATATACGAGTTTCATGAAGGGTTCAAAAGAGAATTTTTATGTTTCAAG ATCTCCATTAACAACTATTTTCTTATTTGGATTATTTATCAAATGACCCTCATACTTTTCTACAT TATACTACTTAGTCCTTTACAAATTCTATCTTAATCTGACGTTTAAGC Found at i:23436 original size:2 final size:2 Alignment explanation

Indices: 23429--23461 Score: 57 Period size: 2 Copynumber: 16.0 Consensus size: 2 23419 ACCCAACGTG 23429 AT AT AT AT AT AT AT AT AT AT AT AT AT AT GAT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT -AT AT 23462 GTGTCTTTGC Statistics Matches: 30, Mismatches: 0, Indels: 2 0.94 0.00 0.06 Matches are distributed among these distances: 2 28 0.93 3 2 0.07 ACGTcount: A:0.48, C:0.00, G:0.03, T:0.48 Consensus pattern (2 bp): AT Found at i:23621 original size:41 final size:41 Alignment explanation

Indices: 23562--23745 Score: 142 Period size: 41 Copynumber: 4.4 Consensus size: 41 23552 CTTGTGTTAC * * 23562 ATGTGTTT-AGGGATTTTGATATAGATGCCTCTGTGTTAAGA 1 ATGTGTTTGA-GGACTTTGATAGAGATGCCTCTGTGTTAAGA * * 23603 ATGTGCTTGAGGACTTTGAGAGAGAGTTG-CTCCTGTGTTATA-A 1 ATGTGTTTGAGGACTTTGATAGAGA--TGCCT-CTGTGTTA-AGA * * * * * 23646 TTGTGTTTGGGGACTTTGATATG-GATGCTTTTGTGTTATGA 1 ATGTGTTTGAGGACTTTGATA-GAGATGCCTCTGTGTTAAGA * * * * * 23687 ATGTGTTTGAGGACTTTAAAAGAGTTGCCCCTGTGTTATGA 1 ATGTGTTTGAGGACTTTGATAGAGATGCCTCTGTGTTAAGA * * 23728 TTGTGTTTGGGGACTTTG 1 ATGTGTTTGAGGACTTTG 23746 GTTATTGGGT Statistics Matches: 112, Mismatches: 22, Indels: 18 0.74 0.14 0.12 Matches are distributed among these distances: 40 1 0.01 41 75 0.67 42 4 0.04 43 30 0.27 44 2 0.02 ACGTcount: A:0.20, C:0.09, G:0.30, T:0.41 Consensus pattern (41 bp): ATGTGTTTGAGGACTTTGATAGAGATGCCTCTGTGTTAAGA Found at i:23653 original size:84 final size:82 Alignment explanation

Indices: 23547--23745 Score: 258 Period size: 82 Copynumber: 2.4 Consensus size: 82 23537 GTAGAGAAAA * * * * 23547 TTGCCCTTGTGTTA-CA-TGTGTTTAGGGATTTTGATATAGATGCCTCTGTGTTAAGAATGTGCT 1 TTGCCCCTGTGTTATAATTGTGTTTGGGGACTTTGATATAGATGCCTCTGTGTTAAGAATGTGCT * 23610 TGAGGACTTTGAGAGAGAG 66 TGAGGACTTT-A-AAAGAG * * * * * * 23629 TTGCTCCTGTGTTATAATTGTGTTTGGGGACTTTGATATGGATGCTTTTGTGTTATGAATGTGTT 1 TTGCCCCTGTGTTATAATTGTGTTTGGGGACTTTGATATAGATGCCTCTGTGTTAAGAATGTGCT 23694 TGAGGACTTTAAAAGAG 66 TGAGGACTTTAAAAGAG * 23711 TTGCCCCTGTGTTATGATTGTGTTTGGGGACTTTG 1 TTGCCCCTGTGTTATAATTGTGTTTGGGGACTTTG 23746 GTTATTGGGT Statistics Matches: 102, Mismatches: 13, Indels: 4 0.86 0.11 0.03 Matches are distributed among these distances: 82 50 0.49 83 2 0.02 84 50 0.49 ACGTcount: A:0.19, C:0.10, G:0.29, T:0.42 Consensus pattern (82 bp): TTGCCCCTGTGTTATAATTGTGTTTGGGGACTTTGATATAGATGCCTCTGTGTTAAGAATGTGCT TGAGGACTTTAAAAGAG Found at i:43203 original size:35 final size:35 Alignment explanation

Indices: 43160--43641 Score: 509 Period size: 35 Copynumber: 13.8 Consensus size: 35 43150 ATTACCCTTT * * 43160 CTTAATTACCCTGAATTAAGTTACTTATTGACTTG 1 CTTAATTACCCTGAATTAAGTTACTTACTGACTTA * * * * 43195 CTTGATTACCCTGAATCAAGTTGCTAACTGACTTA 1 CTTAATTACCCTGAATTAAGTTACTTACTGACTTA * 43230 CTTAATTACCCTGAATTAAGCTACTTACTG----A 1 CTTAATTACCCTGAATTAAGTTACTTACTGACTTA * * * * 43261 CTTAATTACCCTGGATTAAGTTACCT-GTTACTTA 1 CTTAATTACCCTGAATTAAGTTACTTACTGACTTA * * 43295 CTTAATTACCCTGAATTAAGTTGA-TTACTAAATTA 1 CTTAATTACCCTGAATTAAGTT-ACTTACTGACTTA * * * * 43330 CTTAATTACCCTGAATTAAATTAATAACTGGA-TTT 1 CTTAATTACCCTGAATTAAGTTACTTACT-GACTTA * 43365 CTTAATTACCCTGAATTAAGTTACTGCTTACTAACTTA 1 CTTAATTACCCTGAATTAAGTTA---CTTACTGACTTA * * * * 43403 CTTAATTACCCTGAATTAAGTTTCTTATTAACTCA 1 CTTAATTACCCTGAATTAAGTTACTTACTGACTTA * 43438 CTTAATTACCCTGAATTAAATTA-TTCACTGACTTA 1 CTTAATTACCCTGAATTAAGTTACTT-ACTGACTTA * * 43473 CTTAACTACCCTGAATTAAATTA-TTCACTGACTTA 1 CTTAATTACCCTGAATTAAGTTACTT-ACTGACTTA * * * * 43508 CATAATTATCCTGAATTAAGTTACTTATTAACTTA 1 CTTAATTACCCTGAATTAAGTTACTTACTGACTTA * 43543 CTTAATTTACCCTGAATTAAGTTA-TTCACTGACCTA 1 CTTAA-TTACCCTGAATTAAGTTACTT-ACTGACTTA * 43579 CTTAATTTACCCTGAATTAAGTTA-TTCACTGACCTA 1 CTTAA-TTACCCTGAATTAAGTTACTT-ACTGACTTA 43615 CTTAATTACCCTGAATTAAGTTACTTA 1 CTTAATTACCCTGAATTAAGTTACTTA 43642 TTACTGATTT Statistics Matches: 380, Mismatches: 50, Indels: 34 0.82 0.11 0.07 Matches are distributed among these distances: 30 1 0.00 31 24 0.06 34 26 0.07 35 231 0.61 36 69 0.18 37 1 0.00 38 28 0.07 ACGTcount: A:0.32, C:0.20, G:0.09, T:0.40 Consensus pattern (35 bp): CTTAATTACCCTGAATTAAGTTACTTACTGACTTA Found at i:43446 original size:108 final size:106 Alignment explanation

Indices: 43160--43644 Score: 546 Period size: 108 Copynumber: 4.6 Consensus size: 106 43150 ATTACCCTTT * * * * * * * 43160 CTTAATTACCCTGAATTAAGTTA-CTTATTGACTTGCTTGATTACCCTGAATCAAGTTGCTAACT 1 CTTAATTACCCTGAATTAAGTTATCTTACTGACTTACTTAATTACCCTGAATTAAGTTACTTATT * ** 43224 GACTTACTTAATTACCCTGAATTAAGCTACTT-ACTG----A 66 AACTTACTTAATTACCCTGAATTAAATTA-TTCACTGACTTA * * 43261 CTTAATTACCCTGGATTAAGTTACCTGT--T-ACTTACTTAATTACCCTGAATTAAGTTGA-TTA 1 CTTAATTACCCTGAATTAAGTTATCT-TACTGACTTACTTAATTACCCTGAATTAAGTT-ACTTA * * * * * 43322 CTAAATTACTTAATTACCCTGAATTAAATTAATAACTGGA-TTT 64 TTAACTTACTTAATTACCCTGAATTAAATTATTCACT-GACTTA * * 43365 CTTAATTACCCTGAATTAAGTTACTGCTTACTAACTTACTTAATTACCCTGAATTAAGTTTCTTA 1 CTTAATTACCCTGAATTAAGTTA-T-CTTACTGACTTACTTAATTACCCTGAATTAAGTTACTTA * 43430 TTAACTCACTTAATTACCCTGAATTAAATTATTCACTGACTTA 64 TTAACTTACTTAATTACCCTGAATTAAATTATTCACTGACTTA * * * * * 43473 CTTAACTACCCTGAATTAAATTAT-TCACTGACTTACATAATTATCCTGAATTAAGTTACTTATT 1 CTTAATTACCCTGAATTAAGTTATCTTACTGACTTACTTAATTACCCTGAATTAAGTTACTTATT * * 43537 AACTTACTTAATTTACCCTGAATTAAGTTATTCACTGACCTA 66 AACTTACTTAA-TTACCCTGAATTAAATTATTCACTGACTTA * * 43579 CTTAATTTACCCTGAATTAAGTTAT-TCACTGACCTACTTAATTACCCTGAATTAAGTTACTTAT 1 CTTAA-TTACCCTGAATTAAGTTATCTTACTGACTTACTTAATTACCCTGAATTAAGTTACTTAT 43643 TA 65 TA 43645 CTGATTTACC Statistics Matches: 331, Mismatches: 36, Indels: 28 0.84 0.09 0.07 Matches are distributed among these distances: 99 1 0.00 100 56 0.17 101 24 0.07 102 2 0.01 103 1 0.00 104 22 0.07 105 46 0.14 106 35 0.11 107 59 0.18 108 85 0.26 ACGTcount: A:0.32, C:0.19, G:0.08, T:0.40 Consensus pattern (106 bp): CTTAATTACCCTGAATTAAGTTATCTTACTGACTTACTTAATTACCCTGAATTAAGTTACTTATT AACTTACTTAATTACCCTGAATTAAATTATTCACTGACTTA Found at i:44184 original size:49 final size:52 Alignment explanation

Indices: 44131--44238 Score: 132 Period size: 49 Copynumber: 2.1 Consensus size: 52 44121 AAAAATCTCA * * 44131 TTTTTACTCCAAACTTTACCAAGAT-TCATTTTTT-CT-AACTAAAGATAAT 1 TTTTTACTCAAAACTTTACCAAGATCTAATTTTTTACTAAACTAAAGATAAT * * * 44180 TTTTTATTTAAAACTTTACCAAGATCTAATTTTTTAACTAAACTAAAGATCAT 1 TTTTTACTCAAAACTTTACCAAGATCTAATTTTTT-ACTAAACTAAAGATAAT * 44233 ATTTTA 1 TTTTTA 44239 TTTAAAAAAT Statistics Matches: 49, Mismatches: 6, Indels: 4 0.83 0.10 0.07 Matches are distributed among these distances: 49 22 0.45 50 8 0.16 52 2 0.04 53 17 0.35 ACGTcount: A:0.37, C:0.15, G:0.04, T:0.44 Consensus pattern (52 bp): TTTTTACTCAAAACTTTACCAAGATCTAATTTTTTACTAAACTAAAGATAAT Found at i:44243 original size:53 final size:49 Alignment explanation

Indices: 44141--44245 Score: 147 Period size: 53 Copynumber: 2.1 Consensus size: 49 44131 TTTTTACTCC * * 44141 AAACTTTACCAAGATTCATTTTTTCTAACTAAAGATAATTTTTTATTTA 1 AAACTTTACCAAGATTAATTTTTTCTAACTAAAGATAATATTTTATTTA * 44190 AAACTTTACCAAGATCTAATTTTTTAACTAAACTAAAGATCATATTTTATTTA 1 AAACTTTACCAAGAT-TAATTTTTT--CT-AACTAAAGATAATATTTTATTTA 44243 AAA 1 AAA 44246 AATTAAATTG Statistics Matches: 49, Mismatches: 3, Indels: 4 0.88 0.05 0.07 Matches are distributed among these distances: 49 15 0.31 50 8 0.16 52 2 0.04 53 24 0.49 ACGTcount: A:0.41, C:0.12, G:0.04, T:0.43 Consensus pattern (49 bp): AAACTTTACCAAGATTAATTTTTTCTAACTAAAGATAATATTTTATTTA Found at i:46807 original size:37 final size:36 Alignment explanation

Indices: 46764--46849 Score: 93 Period size: 37 Copynumber: 2.4 Consensus size: 36 46754 ACTTTAAACG * * 46764 AAGACCACCCTGGATCATTTCGA-ACTGAACTAAAAA 1 AAGACCACCCTGGATCATTCCGACA-TAAACTAAAAA * * * * 46800 AACGACCACCTTTGATCGTTCCGACATAAACTAAAGA 1 AA-GACCACCCTGGATCATTCCGACATAAACTAAAAA 46837 AAGACCACCCTGG 1 AAGACCACCCTGG 46850 GTCAGCTAAA Statistics Matches: 40, Mismatches: 8, Indels: 4 0.77 0.15 0.08 Matches are distributed among these distances: 36 11 0.28 37 28 0.70 38 1 0.03 ACGTcount: A:0.38, C:0.28, G:0.15, T:0.19 Consensus pattern (36 bp): AAGACCACCCTGGATCATTCCGACATAAACTAAAAA Found at i:46908 original size:36 final size:35 Alignment explanation

Indices: 46861--46954 Score: 116 Period size: 36 Copynumber: 2.6 Consensus size: 35 46851 TCAGCTAAAA * ** 46861 TAAATTGAAGAACGTCCACCCTCAATCATCCCGGAC 1 TAAACTGAAGAACAACCACCCTCAATCATCCC-GAC * * 46897 TAAACTGAAGAACAACCACCCTCGATCATTCCGAC 1 TAAACTGAAGAACAACCACCCTCAATCATCCCGAC * 46932 TCAAACTGAAGAAAAACCACCCT 1 T-AAACTGAAGAACAACCACCCT 46955 GAGTCATTGA Statistics Matches: 51, Mismatches: 6, Indels: 2 0.86 0.10 0.03 Matches are distributed among these distances: 35 4 0.08 36 47 0.92 ACGTcount: A:0.38, C:0.33, G:0.12, T:0.17 Consensus pattern (35 bp): TAAACTGAAGAACAACCACCCTCAATCATCCCGAC Found at i:46982 original size:8 final size:8 Alignment explanation

Indices: 46969--47010 Score: 50 Period size: 8 Copynumber: 5.2 Consensus size: 8 46959 CATTGAAGTA 46969 AATTGAAG 1 AATTGAAG * 46977 AATTGAAT 1 AATTGAAG * 46985 CATTG-AG 1 AATTGAAG 46992 TAATTGAAG 1 -AATTGAAG 47001 AATTGAAG 1 AATTGAAG 47009 AA 1 AA 47011 AGACCACCCT Statistics Matches: 28, Mismatches: 4, Indels: 4 0.78 0.11 0.11 Matches are distributed among these distances: 7 1 0.04 8 25 0.89 9 2 0.07 ACGTcount: A:0.48, C:0.02, G:0.21, T:0.29 Consensus pattern (8 bp): AATTGAAG Found at i:46999 original size:24 final size:26 Alignment explanation

Indices: 46958--47007 Score: 86 Period size: 24 Copynumber: 2.0 Consensus size: 26 46948 CCACCCTGAG 46958 TCATTGAAGTAAATTGAAGAATTGAA 1 TCATTGAAGTAAATTGAAGAATTGAA 46984 TCATTG-AGT-AATTGAAGAATTGAA 1 TCATTGAAGTAAATTGAAGAATTGAA 47008 GAAAGACCAC Statistics Matches: 24, Mismatches: 0, Indels: 2 0.92 0.00 0.08 Matches are distributed among these distances: 24 15 0.62 25 3 0.12 26 6 0.25 ACGTcount: A:0.44, C:0.04, G:0.20, T:0.32 Consensus pattern (26 bp): TCATTGAAGTAAATTGAAGAATTGAA Found at i:47258 original size:29 final size:29 Alignment explanation

Indices: 47226--47284 Score: 73 Period size: 29 Copynumber: 2.0 Consensus size: 29 47216 TTGGGTCATG * * 47226 TAACTGAGGAAAGATCACCCTGGATCGAT 1 TAACTGAAGAAAGACCACCCTGGATCGAT ** * 47255 TAACTGAAGATGGACCACCCTGGGTCGAT 1 TAACTGAAGAAAGACCACCCTGGATCGAT 47284 T 1 T 47285 GAAAATCACT Statistics Matches: 25, Mismatches: 5, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 29 25 1.00 ACGTcount: A:0.31, C:0.22, G:0.25, T:0.22 Consensus pattern (29 bp): TAACTGAAGAAAGACCACCCTGGATCGAT Found at i:47415 original size:35 final size:34 Alignment explanation

Indices: 47329--47424 Score: 115 Period size: 34 Copynumber: 2.8 Consensus size: 34 47319 TTTGAAAGCA * * 47329 ACTGAAGAAAGACCGCCCTGAGTTAATTGAAATT 1 ACTGAAGAGAGACCGCCCTGAGTCAATTGAAATT * 47363 ATTGAAGAGAGACCGCCCT-AGATCAATTGAAATTT 1 ACTGAAGAGAGACCGCCCTGAG-TCAATTGAAA-TT * 47398 ACTGAATG-GAGACCGCCCTGGGTCAAT 1 ACTGAA-GAGAGACCGCCCTGAGTCAAT 47425 GAACTGAATG Statistics Matches: 53, Mismatches: 5, Indels: 7 0.82 0.08 0.11 Matches are distributed among these distances: 33 2 0.04 34 26 0.49 35 23 0.43 36 2 0.04 ACGTcount: A:0.34, C:0.20, G:0.23, T:0.23 Consensus pattern (34 bp): ACTGAAGAGAGACCGCCCTGAGTCAATTGAAATT Found at i:48404 original size:20 final size:18 Alignment explanation

Indices: 48381--48417 Score: 56 Period size: 18 Copynumber: 1.9 Consensus size: 18 48371 TAGGGCTTCT 48381 TTTTTCTTCTTCTTTTTTTC 1 TTTTTC-TC-TCTTTTTTTC 48401 TTTTTCTCTCTTTTTTT 1 TTTTTCTCTCTTTTTTT 48418 ATGCACTTGA Statistics Matches: 17, Mismatches: 0, Indels: 2 0.89 0.00 0.11 Matches are distributed among these distances: 18 9 0.53 19 2 0.12 20 6 0.35 ACGTcount: A:0.00, C:0.19, G:0.00, T:0.81 Consensus pattern (18 bp): TTTTTCTCTCTTTTTTTC Found at i:48711 original size:18 final size:17 Alignment explanation

Indices: 48690--48724 Score: 52 Period size: 17 Copynumber: 2.0 Consensus size: 17 48680 CTTTGCTCCA 48690 TCTTATACCTTCTTTTTT 1 TCTT-TACCTTCTTTTTT * 48708 TCTTTACTTTCTTTTTT 1 TCTTTACCTTCTTTTTT 48725 CAATTTTCAT Statistics Matches: 16, Mismatches: 1, Indels: 1 0.89 0.06 0.06 Matches are distributed among these distances: 17 12 0.75 18 4 0.25 ACGTcount: A:0.09, C:0.20, G:0.00, T:0.71 Consensus pattern (17 bp): TCTTTACCTTCTTTTTT Found at i:48742 original size:6 final size:6 Alignment explanation

Indices: 48721--48774 Score: 72 Period size: 6 Copynumber: 8.7 Consensus size: 6 48711 TTACTTTCTT * * 48721 TTTTCAA TTTTCA TTTTTA TTTTCA CTTTCA TTTTTCA TTTTCA TTTTCA 1 TTTTC-A TTTTCA TTTTCA TTTTCA TTTTCA -TTTTCA TTTTCA TTTTCA 48771 TTTT 1 TTTT 48775 TTTTCCGCTC Statistics Matches: 42, Mismatches: 4, Indels: 3 0.86 0.08 0.06 Matches are distributed among these distances: 6 32 0.76 7 10 0.24 ACGTcount: A:0.17, C:0.15, G:0.00, T:0.69 Consensus pattern (6 bp): TTTTCA Found at i:48743 original size:19 final size:19 Alignment explanation

Indices: 48721--48775 Score: 85 Period size: 19 Copynumber: 2.9 Consensus size: 19 48711 TTACTTTCTT 48721 TTTTCAATTTTCATTTTT-A 1 TTTTC-ATTTTCATTTTTCA * 48740 TTTTCACTTTCATTTTTCA 1 TTTTCATTTTCATTTTTCA 48759 TTTTCATTTTCATTTTT 1 TTTTCATTTTCATTTTT 48776 TTTCCGCTCT Statistics Matches: 33, Mismatches: 2, Indels: 2 0.89 0.05 0.05 Matches are distributed among these distances: 18 11 0.33 19 22 0.67 ACGTcount: A:0.16, C:0.15, G:0.00, T:0.69 Consensus pattern (19 bp): TTTTCATTTTCATTTTTCA Found at i:48750 original size:25 final size:25 Alignment explanation

Indices: 48722--48770 Score: 80 Period size: 25 Copynumber: 2.0 Consensus size: 25 48712 TACTTTCTTT * 48722 TTTCAATTTTCATTTTTATTTTCAC 1 TTTCAATTTTCATTTTCATTTTCAC * 48747 TTTCATTTTTCATTTTCATTTTCA 1 TTTCAATTTTCATTTTCATTTTCA 48771 TTTTTTTTCC Statistics Matches: 22, Mismatches: 2, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 25 22 1.00 ACGTcount: A:0.18, C:0.16, G:0.00, T:0.65 Consensus pattern (25 bp): TTTCAATTTTCATTTTCATTTTCAC Found at i:49133 original size:16 final size:15 Alignment explanation

Indices: 49112--49145 Score: 50 Period size: 16 Copynumber: 2.2 Consensus size: 15 49102 TTCAAAACCA * 49112 TTTTTGAGAAATCATT 1 TTTTTGAAAAATC-TT 49128 TTTTTGAAAAATCTT 1 TTTTTGAAAAATCTT 49143 TTT 1 TTT 49146 AAAATGATAT Statistics Matches: 17, Mismatches: 1, Indels: 1 0.89 0.05 0.05 Matches are distributed among these distances: 15 5 0.29 16 12 0.71 ACGTcount: A:0.29, C:0.06, G:0.09, T:0.56 Consensus pattern (15 bp): TTTTTGAAAAATCTT Found at i:49487 original size:2 final size:2 Alignment explanation

Indices: 49480--49504 Score: 50 Period size: 2 Copynumber: 12.5 Consensus size: 2 49470 GAACAGCAGA 49480 AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT A 49505 CAAAAAGCAG Statistics Matches: 23, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 23 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:53768 original size:14 final size:14 Alignment explanation

Indices: 53749--53775 Score: 54 Period size: 14 Copynumber: 1.9 Consensus size: 14 53739 TTCTTACTAA 53749 ACTTAATTACCCTT 1 ACTTAATTACCCTT 53763 ACTTAATTACCCT 1 ACTTAATTACCCT 53776 GAATTAAGTT Statistics Matches: 13, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 14 13 1.00 ACGTcount: A:0.30, C:0.30, G:0.00, T:0.41 Consensus pattern (14 bp): ACTTAATTACCCTT Found at i:53807 original size:35 final size:35 Alignment explanation

Indices: 53760--54281 Score: 530 Period size: 35 Copynumber: 14.9 Consensus size: 35 53750 CTTAATTACC * 53760 CTTACTTAATTACCCTGAATTAAGTTACTTATTGA 1 CTTACTTAATTACCCTGAATTAAGTTACTTACTGA * * * * * * 53795 CTTGCTTGATTACCCTGAATCACGTTGCTAACTGA 1 CTTACTTAATTACCCTGAATTAAGTTACTTACTGA * * 53830 CTTACTTAATTACCCTGAACTAAGCTA-TT--T-A 1 CTTACTTAATTACCCTGAATTAAGTTACTTACTGA * * * * * 53861 CTGACTTAATTACCCTGGATTAAGTTACCT-GTTA 1 CTTACTTAATTACCCTGAATTAAGTTACTTACTGA * * 53895 CTTACTTAATCACCCTGAATTAAGTTGA-TTACTAA 1 CTTACTTAATTACCCTGAATTAAGTT-ACTTACTGA * * * * 53930 ATTACTTAATTACCCTGAATTAAATTAATAACTGGA 1 CTTACTTAATTACCCTGAATTAAGTTACTTACT-GA * * 53966 -TTTCTTAATTACCCTGAATTAAGTTATTGACTTACTAA 1 CTTACTTAATTACCCTGAATTAAG---TT-ACTTACTGA 54004 CTTACTTAATTACCCTGAATTAAGTTA-TTCACTGA 1 CTTACTTAATTACCCTGAATTAAGTTACTT-ACTGA * * * 54039 CTTGCTTAATTACCCTGAATTAAGTTTCTTA-TTA 1 CTTACTTAATTACCCTGAATTAAGTTACTTACTGA * * 54073 TCTCACTTAATTACCCTGAATTAAATTA-TTCACTGA 1 -CTTACTTAATTACCCTGAATTAAGTTACTT-ACTGA * * * 54109 CTCACTTAACTACCCTGAATTAAATTA-TTCACTGA 1 CTTACTTAATTACCCTGAATTAAGTTACTT-ACTGA * 54144 CTTACTTAATTACCCTGAATTAAGTTACTTATTGA 1 CTTACTTAATTACCCTGAATTAAGTTACTTACTGA 54179 CTTACTTAATTTACCCTGAATTAAGTTA-TTCACTGA 1 CTTACTTAA-TTACCCTGAATTAAGTTACTT-ACTGA * 54215 CCTACTTAATTTACCCTGAATTAAGTTA-TTCACTGA 1 CTTACTTAA-TTACCCTGAATTAAGTTACTT-ACTGA * 54251 CCTACTTAATTACCCTGAATTAAGTTACTTA 1 CTTACTTAATTACCCTGAATTAAGTTACTTA 54282 TTACTGATTC Statistics Matches: 413, Mismatches: 53, Indels: 42 0.81 0.10 0.08 Matches are distributed among these distances: 31 24 0.06 32 2 0.00 33 1 0.00 34 33 0.08 35 246 0.60 36 77 0.19 38 3 0.01 39 27 0.07 ACGTcount: A:0.31, C:0.20, G:0.09, T:0.40 Consensus pattern (35 bp): CTTACTTAATTACCCTGAATTAAGTTACTTACTGA Found at i:54081 original size:144 final size:138 Alignment explanation

Indices: 53760--54277 Score: 514 Period size: 144 Copynumber: 3.7 Consensus size: 138 53750 CTTAATTACC * * * * * ** ** * 53760 CTTACTTAATTACCCTGAATTAAGTTACTTATTGACTTGCTTGATTACCCTGAATCACGTTGCTA 1 CTTACTTAATTACCCTGAATTAAGTT-TTTATTAAC-TACTTAATTACCCTGAATTAAATTATTC * * * * 53825 ACTGACTTACTTAATTACCCTGAACTAAGCTATT---T-ACTGACTTAATTACCCTGGATTAAGT 64 ACTGACTTACTTAATTACCCTGAATTAAGTTATTCACTAACTTACTTAATTACCCTGAATTAAGT 53886 TA--C-CTGTTA 129 TATTCACTG--A * * * * * * 53895 CTTACTTAATCACCCTGAATTAAGTTGATTACTAAATTACTTAATTACCCTGAATTAAATTAATA 1 CTTACTTAATTACCCTGAATTAAGTT-TTTA-TTAACTACTTAATTACCCTGAATTAAATTATTC * 53960 ACTGGA-TTTCTTAATTACCCTGAATTAAGTTATTGACTTACTAACTTACTTAATTACCCTGAAT 64 ACT-GACTTACTTAATTACCCTGAATTAAGTTATT--C--ACTAACTTACTTAATTACCCTGAAT 54024 TAAGTTATTCACTGA 124 TAAGTTATTCACTGA * * 54039 CTTGCTTAATTACCCTGAATTAAGTTTCTTATTATCTCACTTAATTACCCTGAATTAAATTATTC 1 CTTACTTAATTACCCTGAATTAAGTTT-TTATTAACT-ACTTAATTACCCTGAATTAAATTATTC * * * * 54104 ACTGACTCACTTAACTACCCTGAATTAAATTATTCACTGACTTACTTAATTACCCTGAATTAAGT 64 ACTGACTTACTTAATTACCCTGAATTAAGTTATTCACTAACTTACTTAATTACCCTGAATTAAGT * 54169 TACTT-ATTGA 129 TA-TTCACTGA * * * * 54179 CTTACTTAATTTACCCTGAATTAAGTTATTCACTGACCTACTTAATTTACCCTGAATTAAGTTAT 1 CTTACTTAA-TTACCCTGAATTAAGTT-TTTA-TTAACTACTTAA-TTACCCTGAATTAAATTAT * 54244 TCACTGACCTACTTAATTACCCTGAATTAAGTTA 62 TCACTGACTTACTTAATTACCCTGAATTAAGTTA 54278 CTTATTACTG Statistics Matches: 321, Mismatches: 41, Indels: 35 0.81 0.10 0.09 Matches are distributed among these distances: 135 77 0.24 136 4 0.01 140 43 0.13 141 27 0.08 142 55 0.17 143 31 0.10 144 80 0.25 145 1 0.00 146 3 0.01 ACGTcount: A:0.31, C:0.20, G:0.09, T:0.40 Consensus pattern (138 bp): CTTACTTAATTACCCTGAATTAAGTTTTTATTAACTACTTAATTACCCTGAATTAAATTATTCAC TGACTTACTTAATTACCCTGAATTAAGTTATTCACTAACTTACTTAATTACCCTGAATTAAGTTA TTCACTGA Found at i:62331 original size:6 final size:6 Alignment explanation

Indices: 62322--62348 Score: 54 Period size: 6 Copynumber: 4.5 Consensus size: 6 62312 AAACCAAAGC 62322 AAATCT AAATCT AAATCT AAATCT AAA 1 AAATCT AAATCT AAATCT AAATCT AAA 62349 GAAAATTATA Statistics Matches: 21, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 6 21 1.00 ACGTcount: A:0.56, C:0.15, G:0.00, T:0.30 Consensus pattern (6 bp): AAATCT Found at i:68989 original size:39 final size:39 Alignment explanation

Indices: 68944--69028 Score: 100 Period size: 39 Copynumber: 2.2 Consensus size: 39 68934 TAAATAAAAA * * * 68944 TTAAAAAGCAGAAACAGAAAATAAAAA-TATTTTTTTATT 1 TTAAAAAGCAAAAACAGAAAAGAAAAATTAATTTTTT-TT * * 68983 TTAAAAAGGAAAAACGGAAAAGAAAAATTAATTTTTTTT 1 TTAAAAAGCAAAAACAGAAAAGAAAAATTAATTTTTTTT * 69022 TCAAAAA 1 TTAAAAA 69029 AAAATCGGAA Statistics Matches: 39, Mismatches: 6, Indels: 2 0.83 0.13 0.04 Matches are distributed among these distances: 39 31 0.79 40 8 0.21 ACGTcount: A:0.55, C:0.05, G:0.09, T:0.31 Consensus pattern (39 bp): TTAAAAAGCAAAAACAGAAAAGAAAAATTAATTTTTTTT Done.