Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01010612.1 Corchorus capsularis cultivar CVL-1 contig10633, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 88398
ACGTcount: A:0.33, C:0.18, G:0.17, T:0.33


Found at i:380 original size:56 final size:56

Alignment explanation

Indices: 294--407 Score: 228 Period size: 56 Copynumber: 2.0 Consensus size: 56 284 CGTTCTACTG 294 AATTGCTTTTTAATTTAATACATGATTAAAGATTAGCAACGATATATGCAAGAAGT 1 AATTGCTTTTTAATTTAATACATGATTAAAGATTAGCAACGATATATGCAAGAAGT 350 AATTGCTTTTTAATTTAATACATGATTAAAGATTAGCAACGATATATGCAAGAAGT 1 AATTGCTTTTTAATTTAATACATGATTAAAGATTAGCAACGATATATGCAAGAAGT 406 AA 1 AA 408 ACTTCAATTT Statistics Matches: 58, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 56 58 1.00 ACGTcount: A:0.42, C:0.09, G:0.14, T:0.35 Consensus pattern (56 bp): AATTGCTTTTTAATTTAATACATGATTAAAGATTAGCAACGATATATGCAAGAAGT Found at i:20023 original size:13 final size:13 Alignment explanation

Indices: 20005--20033 Score: 58 Period size: 13 Copynumber: 2.2 Consensus size: 13 19995 AATAGTGTAA 20005 TAAAAAGAAATTC 1 TAAAAAGAAATTC 20018 TAAAAAGAAATTC 1 TAAAAAGAAATTC 20031 TAA 1 TAA 20034 GGTGATAAAA Statistics Matches: 16, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 16 1.00 ACGTcount: A:0.62, C:0.07, G:0.07, T:0.24 Consensus pattern (13 bp): TAAAAAGAAATTC Found at i:20044 original size:21 final size:21 Alignment explanation

Indices: 20018--20066 Score: 80 Period size: 21 Copynumber: 2.3 Consensus size: 21 20008 AAAGAAATTC ** 20018 TAAAAAGAAATTCTAAGGTGA 1 TAAAAAGAAATTCTAAGGAAA 20039 TAAAAAGAAATTCTAAGGAAA 1 TAAAAAGAAATTCTAAGGAAA 20060 TAAAAAG 1 TAAAAAG 20067 TCATCATTTG Statistics Matches: 26, Mismatches: 2, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 21 26 1.00 ACGTcount: A:0.59, C:0.04, G:0.16, T:0.20 Consensus pattern (21 bp): TAAAAAGAAATTCTAAGGAAA Found at i:26005 original size:11 final size:11 Alignment explanation

Indices: 25989--26014 Score: 52 Period size: 11 Copynumber: 2.4 Consensus size: 11 25979 TTTTCGCAGT 25989 AAAAAAAATAA 1 AAAAAAAATAA 26000 AAAAAAAATAA 1 AAAAAAAATAA 26011 AAAA 1 AAAA 26015 TAATGTCTAC Statistics Matches: 15, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 11 15 1.00 ACGTcount: A:0.92, C:0.00, G:0.00, T:0.08 Consensus pattern (11 bp): AAAAAAAATAA Found at i:26208 original size:13 final size:12 Alignment explanation

Indices: 26178--26220 Score: 68 Period size: 12 Copynumber: 3.5 Consensus size: 12 26168 CATCGATACC 26178 TCGATATATCCG 1 TCGATATATCCG 26190 TCGATATATCCG 1 TCGATATATCCG * 26202 TTCGATATATCCA 1 -TCGATATATCCG 26215 TCGATA 1 TCGATA 26221 CCTGTATTAA Statistics Matches: 29, Mismatches: 1, Indels: 2 0.91 0.03 0.06 Matches are distributed among these distances: 12 18 0.62 13 11 0.38 ACGTcount: A:0.28, C:0.23, G:0.14, T:0.35 Consensus pattern (12 bp): TCGATATATCCG Found at i:26290 original size:12 final size:12 Alignment explanation

Indices: 26258--26296 Score: 50 Period size: 12 Copynumber: 3.6 Consensus size: 12 26248 ATGGAATTAA 26258 ATATCCGTCG-- 1 ATATCCGTCGAT 26268 ATA-CC-TCGAT 1 ATATCCGTCGAT 26278 ATATCCGTCGAT 1 ATATCCGTCGAT 26290 ATATCCG 1 ATATCCG 26297 ATATCTGTAC Statistics Matches: 25, Mismatches: 0, Indels: 6 0.81 0.00 0.19 Matches are distributed among these distances: 8 3 0.12 9 2 0.08 10 6 0.24 11 2 0.08 12 12 0.48 ACGTcount: A:0.26, C:0.28, G:0.15, T:0.31 Consensus pattern (12 bp): ATATCCGTCGAT Found at i:26420 original size:10 final size:10 Alignment explanation

Indices: 26407--26432 Score: 52 Period size: 10 Copynumber: 2.6 Consensus size: 10 26397 AAATCTCGAT 26407 ATATCCGTAA 1 ATATCCGTAA 26417 ATATCCGTAA 1 ATATCCGTAA 26427 ATATCC 1 ATATCC 26433 ATATTAAATT Statistics Matches: 16, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 10 16 1.00 ACGTcount: A:0.38, C:0.23, G:0.08, T:0.31 Consensus pattern (10 bp): ATATCCGTAA Found at i:31592 original size:2 final size:2 Alignment explanation

Indices: 31585--31614 Score: 60 Period size: 2 Copynumber: 15.0 Consensus size: 2 31575 ACAATAAGAA 31585 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT 1 CT CT CT CT CT CT CT CT CT CT CT CT CT CT CT 31615 TTTTTCCTTT Statistics Matches: 28, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 28 1.00 ACGTcount: A:0.00, C:0.50, G:0.00, T:0.50 Consensus pattern (2 bp): CT Found at i:34771 original size:32 final size:32 Alignment explanation

Indices: 34734--34841 Score: 84 Period size: 32 Copynumber: 3.4 Consensus size: 32 34724 TTCTCATGTG 34734 AAATCAAATTTGATCTTTGCTTCTAAGAGTTC 1 AAATCAAATTTGATCTTTGCTTCTAAGAGTTC * * * 34766 AAATCATAATTGAAGACTTCTTTG--AC-AAAAG--C 1 AAATCA-AATT--TGA--TCTTTGCTTCTAAGAGTTC * 34798 -AATCAAATTTGAACTTTGCTTCTAAGAGTTC 1 AAATCAAATTTGATCTTTGCTTCTAAGAGTTC * 34829 AAATCAAAGTTGA 1 AAATCAAATTTGA 34842 AGACCTCTTG Statistics Matches: 57, Mismatches: 8, Indels: 22 0.66 0.09 0.25 Matches are distributed among these distances: 26 5 0.09 28 3 0.05 29 4 0.07 30 4 0.07 31 6 0.11 32 18 0.32 33 4 0.07 34 4 0.07 35 3 0.05 37 6 0.11 ACGTcount: A:0.38, C:0.15, G:0.13, T:0.34 Consensus pattern (32 bp): AAATCAAATTTGATCTTTGCTTCTAAGAGTTC Found at i:39246 original size:2 final size:2 Alignment explanation

Indices: 39241--39269 Score: 58 Period size: 2 Copynumber: 14.5 Consensus size: 2 39231 ATATAGTAAG 39241 TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 39270 GCTCAAAGCC Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 27 1.00 ACGTcount: A:0.48, C:0.00, G:0.00, T:0.52 Consensus pattern (2 bp): TA Found at i:41998 original size:2 final size:2 Alignment explanation

Indices: 41991--42074 Score: 168 Period size: 2 Copynumber: 42.0 Consensus size: 2 41981 CGTTTCATAC 41991 GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT 1 GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT 42033 GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT 1 GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT GT 42075 ATACTACGAC Statistics Matches: 82, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 82 1.00 ACGTcount: A:0.00, C:0.00, G:0.50, T:0.50 Consensus pattern (2 bp): GT Found at i:50913 original size:20 final size:20 Alignment explanation

Indices: 50888--50928 Score: 73 Period size: 20 Copynumber: 2.0 Consensus size: 20 50878 TTAATTATTG 50888 ATATGTTAAGTGGATTTTTA 1 ATATGTTAAGTGGATTTTTA * 50908 ATATGTTAAGTGGGTTTTTA 1 ATATGTTAAGTGGATTTTTA 50928 A 1 A 50929 AACATCTCAT Statistics Matches: 20, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 20 20 1.00 ACGTcount: A:0.29, C:0.00, G:0.22, T:0.49 Consensus pattern (20 bp): ATATGTTAAGTGGATTTTTA Found at i:60981 original size:22 final size:22 Alignment explanation

Indices: 60956--61502 Score: 169 Period size: 22 Copynumber: 24.6 Consensus size: 22 60946 ATGATTTTAT 60956 TATGAAATTTTGATAACCTTCC 1 TATGAAATTTTGATAACCTTCC * ** * 60978 TATGAAATTTTAATAATGATACAC 1 TATGAAATTTTGATAA-CCTTC-C * * * ** 61002 TATGGAATTTCGAGAACCTTTT 1 TATGAAATTTTGATAACCTTCC * ** * * 61024 TAT-AATTTTTTTTTAACTTTCT 1 TATGAA-ATTTTGATAACCTTCC * * * 61046 TATGAAGTTTTGTTAACCTCCC 1 TATGAAATTTTGATAACCTTCC * * * 61068 TAAGGAATTTTGA-AGACC-TCAA 1 TATGAAATTTTGATA-ACCTTC-C * * 61090 TATGAATTTTTAATAA-CTTCCC 1 TATGAAATTTTGATAACCTT-CC * ** 61112 AATGAAATTTTGATAACCAACAC 1 TATGAAATTTTGATAACCTTC-C * * 61135 TATGAGATGTTGATAACC-TCC 1 TATGAAATTTTGATAACCTTCC * * * * 61156 ATATATGATATATTGATAACC-ACGT 1 ---TATGAAATTTTGATAACCTTC-C * * * 61181 TATGAAAATTTAAAAACC-TCC 1 TATGAAATTTTGATAACCTTCC * * 61202 ATATG-AATTGTT-AGTAATCATACTC 1 -TATGAAATT-TTGA-TAA-CCTTC-C * * 61227 TAT--AATTTTGATAATC-ACAC 1 TATGAAATTTTGATAACCTTC-C * * * 61247 TATGAAGTTGTGATAACC-ACGC 1 TATGAAATTTTGATAACCTTC-C * 61269 TATGAAATTTTGATAAATCTTCC 1 TATGAAATTTTGAT-AACCTTCC * 61292 TAT-AATATTTTGATAAACCTCCC 1 TATGAA-ATTTTGAT-AACCTTCC * ** 61315 TATAAAATTTTGATAACCTTTT 1 TATGAAATTTTGATAACCTTCC * * 61337 TATGAAATCTTGATAACCTCCC 1 TATGAAATTTTGATAACCTTCC ** * 61359 TATGATTTTTTGATAACC-TCAT 1 TATGAAATTTTGATAACCTTC-C * * * 61381 TATGAAATTTTGTTAATCTCCC 1 TATGAAATTTTGATAACCTTCC * * * 61403 TATGAAATTTTGATCTA-CATAC 1 TATGAAATTTTGAT-AACCTTCC * * 61425 TATGAAATTTTGATAACCCTCT 1 TATGAAATTTTGATAACCTTCC * ** 61447 TATGAAATTTTGA-AAACTAAAC 1 TATGAAATTTTGATAACCT-TCC * * 61469 TATGAAATTTTGATATCCTGCC 1 TATGAAATTTTGATAACCTTCC 61491 --TGAAATTTTGAT 1 TATGAAATTTTGAT 61503 TACTCCATAA Statistics Matches: 380, Mismatches: 111, Indels: 70 0.68 0.20 0.12 Matches are distributed among these distances: 20 18 0.05 21 14 0.04 22 242 0.64 23 68 0.18 24 37 0.10 25 1 0.00 ACGTcount: A:0.35, C:0.15, G:0.10, T:0.40 Consensus pattern (22 bp): TATGAAATTTTGATAACCTTCC Found at i:61407 original size:66 final size:65 Alignment explanation

Indices: 61043--61483 Score: 209 Period size: 66 Copynumber: 6.6 Consensus size: 65 61033 TTTTTAACTT * * * * * * * 61043 TCTTATGAAGTTTTGTTAACCTCCCTAAGGAATTTTGA-AGACCTCAATATGA-ATTTTTAATAA 1 TCTTATGAAATTTTGATAACCTCCCTATGAAATTTTGATAAACCTC-CTATGATA-TTTTGATAA * 61106 CT 64 CC ** * * * * * 61108 TCCCAATGAAATTTTGATAACCAACACTATGAGATGTTGAT-AACCTCCATATATGATATATTGA 1 T-CTTATGAAATTTTGATAACC-TCCCTATGAAATTTTGATAAACCTCC---TATGATATTTTGA 61172 TAACC 61 TAACC * * * * * * * 61177 ACGTTATGAAAATTTAAAAACCTCCATATG-AATTGTT-AGTAATCATACTCTAT-A-ATTTTGA 1 TC-TTATGAAATTTTGATAACCTCCCTATGAAATT-TTGA-TAAACCT-C-CTATGATATTTTGA 61238 TAATCAC 61 TAA-C-C * * * * * * * 61245 AC-TATGAAGTTGTGATAACCACGCTATGAAATTTTGATAAATCTTCCTATAATATTTTGATAAA 1 TCTTATGAAATTTTGATAACCTCCCTATGAAATTTTGATAAA-CCTCCTATGATATTTTGAT-AA 61309 CC 64 CC * * *** * * 61311 TCCCTATAAAATTTTGATAACCTTTTTATGAAATCTTGAT-AACCTCCCTATGATTTTTTGATAA 1 T-CTTATGAAATTTTGATAACCTCCCTATGAAATTTTGATAAACCT-CCTATGATATTTTGATAA 61375 CC 64 CC * * ** * * * 61377 TCATTATGAAATTTTGTTAATCTCCCTATGAAATTTTGATCTACATACTATGAAATTTTGATAAC 1 TC-TTATGAAATTTTGATAACCTCCCTATGAAATTTTGATAAACCTCCTATGATATTTTGATAA- 61442 CC 64 CC * ** 61444 TCTTATGAAATTTTGA-AAACTAAACTATGAAATTTTGATA 1 TCTTATGAAATTTTGATAACCT-CCCTATGAAATTTTGATA 61484 TCCTGCCTGA Statistics Matches: 279, Mismatches: 69, Indels: 55 0.69 0.17 0.14 Matches are distributed among these distances: 65 10 0.04 66 130 0.47 67 61 0.22 68 45 0.16 69 30 0.11 70 2 0.01 71 1 0.00 ACGTcount: A:0.36, C:0.16, G:0.10, T:0.38 Consensus pattern (65 bp): TCTTATGAAATTTTGATAACCTCCCTATGAAATTTTGATAAACCTCCTATGATATTTTGATAACC Found at i:61650 original size:22 final size:22 Alignment explanation

Indices: 61593--61667 Score: 62 Period size: 22 Copynumber: 3.4 Consensus size: 22 61583 CTATGAAATT ** * 61593 TGAAATTTTTGTAATCACATTT- 1 TGAAATTTTAATAATCTC-TTTA * * 61615 TGAAAATTTGATAATCTCTTTA 1 TGAAATTTTAATAATCTCTTTA * * 61637 TGAAATTTTAATAACCTCTTCA 1 TGAAATTTTAATAATCTCTTTA * 61659 TAAAATTTT 1 TGAAATTTT 61668 GTTGACCCTT Statistics Matches: 43, Mismatches: 9, Indels: 2 0.80 0.17 0.04 Matches are distributed among these distances: 21 3 0.07 22 40 0.93 ACGTcount: A:0.36, C:0.11, G:0.07, T:0.47 Consensus pattern (22 bp): TGAAATTTTAATAATCTCTTTA Found at i:61686 original size:22 final size:23 Alignment explanation

Indices: 61615--61779 Score: 63 Period size: 22 Copynumber: 7.3 Consensus size: 23 61605 AATCACATTT * * * 61615 TGAAAATTTGATAATCTCTT-TA 1 TGAAATTTTAATAACCTCTTCTA 61637 TGAAATTTTAATAACCTCTTC-A 1 TGAAATTTTAATAACCTCTTCTA * ** * 61659 TAAAATTTTGTTGACC-CTTCTA 1 TGAAATTTTAATAACCTCTTCTA * * * * * 61681 TGAAATTCTGATAATCACAT-TA 1 TGAAATTTTAATAACCTCTTCTA * * * * 61703 TGTAATTTTGATAACCTC-GCTT 1 TGAAATTTTAATAACCTCTTCTA * ** * 61725 TGAAATTTTGATAACAAC-ACTA 1 TGAAATTTTAATAACCTCTTCTA * * * 61747 TGAAATTTTGATAATCTAATCTCTA 1 TGAAATTTTAATAACCT-CT-TCTA 61772 TGAAATTT 1 TGAAATTT 61780 CGTTTATAAC Statistics Matches: 107, Mismatches: 29, Indels: 11 0.73 0.20 0.07 Matches are distributed among these distances: 21 4 0.04 22 90 0.84 23 2 0.02 25 11 0.10 ACGTcount: A:0.35, C:0.14, G:0.09, T:0.42 Consensus pattern (23 bp): TGAAATTTTAATAACCTCTTCTA Found at i:61778 original size:25 final size:22 Alignment explanation

Indices: 61678--61779 Score: 89 Period size: 22 Copynumber: 4.5 Consensus size: 22 61668 GTTGACCCTT * 61678 CTATGAAATTCTGATAATC-ACA 1 CTATGAAATTTTGATAA-CAACA * * ** * 61700 TTATGTAATTTTGATAACCTCG 1 CTATGAAATTTTGATAACAACA * 61722 CTTTGAAATTTTGATAACAACA 1 CTATGAAATTTTGATAACAACA * 61744 CTATGAAATTTTGATAATCTAATCT 1 CTATGAAATTTTGATAA-C-AA-CA 61769 CTATGAAATTT 1 CTATGAAATTT 61780 CGTTTATAAC Statistics Matches: 63, Mismatches: 13, Indels: 5 0.78 0.16 0.06 Matches are distributed among these distances: 21 1 0.02 22 47 0.75 23 1 0.02 24 2 0.03 25 12 0.19 ACGTcount: A:0.36, C:0.14, G:0.10, T:0.40 Consensus pattern (22 bp): CTATGAAATTTTGATAACAACA Found at i:61875 original size:22 final size:22 Alignment explanation

Indices: 61635--61898 Score: 70 Period size: 22 Copynumber: 11.8 Consensus size: 22 61625 ATAATCTCTT * 61635 TATGAAATTTTAATAACCTCTTCA 1 TATGAAATTTTGATAA-C-CTTCA * * 61659 TA--AAATTTTGTTGACCCTTC- 1 TATGAAATTTTGAT-AACCTTCA * * * 61679 TATGAAATTCTGATAATC-ACA 1 TATGAAATTTTGATAACCTTCA * * 61700 TTATGTAATTTTGATAACC-TCGC 1 -TATGAAATTTTGATAACCTTC-A * ** 61723 TTTGAAATTTTGATAA-CAACA 1 TATGAAATTTTGATAACCTTCA * * 61744 CTATGAAATTTTGATAATCTAATCTC 1 -TATGAAATTTTGATAACCT--TC-A * 61770 TATGAAATTTCGTTTATAA-C-TC- 1 TATGAAA-TT--TTGATAACCTTCA * 61792 TATGAGA-TTTGATAACCTTC- 1 TATGAAATTTTGATAACCTTCA * * * 61812 TATCAAATTTTGGTACTCCTT-A 1 TATGAAATTTTGATA-ACCTTCA * 61834 TGAAATTGAGACTTTT-ATAACCTTCA 1 T---A-TGA-AATTTTGATAACCTTCA * 61860 TATGAAATTTTGATAACC-ACA 1 TATGAAATTTTGATAACCTTCA * 61881 CTATAAAATTTTGATAAC 1 -TATGAAATTTTGATAAC 61899 TTCCCCAGAA Statistics Matches: 177, Mismatches: 35, Indels: 58 0.66 0.13 0.21 Matches are distributed among these distances: 18 6 0.03 19 1 0.01 20 11 0.06 21 20 0.11 22 99 0.56 23 3 0.02 24 4 0.02 25 13 0.07 26 8 0.05 27 6 0.03 28 6 0.03 ACGTcount: A:0.35, C:0.15, G:0.09, T:0.41 Consensus pattern (22 bp): TATGAAATTTTGATAACCTTCA Found at i:61937 original size:332 final size:332 Alignment explanation

Indices: 61548--62209 Score: 1306 Period size: 332 Copynumber: 2.0 Consensus size: 332 61538 TGGTAATCAT 61548 ACTATAAAATTTTGATAACTTCCCCAGAAATACCACTATGAAATTTGAAATTTTTGTAATCACAT 1 ACTATAAAATTTTGATAACTTCCCCAGAAATACCACTATGAAATTTGAAATTTTTGTAATCACAT * 61613 TTTGAAAATTTGATAATCTCTTTATGAAATTTTAATAACCTCTTCATAAAATTTTGTTGACCCTT 66 TTTGAAAATTTGATAACCTCTTTATGAAATTTTAATAACCTCTTCATAAAATTTTGTTGACCCTT 61678 CTATGAAATTCTGATAATCACATTATGTAATTTTGATAACCTCGCTTTGAAATTTTGATAACAAC 131 CTATGAAATTCTGATAATCACATTATGTAATTTTGATAACCTCGCTTTGAAATTTTGATAACAAC 61743 ACTATGAAATTTTGATAATCTAATCTCTATGAAATTTCGTTTATAACTCTATGAGATTTGATAAC 196 ACTATGAAATTTTGATAATCTAATCTCTATGAAATTTCGTTTATAACTCTATGAGATTTGATAAC 61808 CTTCTATCAAATTTTGGTACTCCTTATGAAATTGAGACTTTTATAACCTTCATATGAAATTTTGA 261 CTTCTATCAAATTTTGGTACTCCTTATGAAATTGAGACTTTTATAACCTTCATATGAAATTTTGA 61873 TAACCAC 326 TAACCAC 61880 ACTATAAAATTTTGATAACTTCCCCAGAAATACCACTATGAAATTTGAAATTTTTGTAATCACAT 1 ACTATAAAATTTTGATAACTTCCCCAGAAATACCACTATGAAATTTGAAATTTTTGTAATCACAT * 61945 TTTGAAAATTTGATAACCTCTTTATGAAATTTTAATAACCTCTTTATAAAATTTTGTTGACCCTT 66 TTTGAAAATTTGATAACCTCTTTATGAAATTTTAATAACCTCTTCATAAAATTTTGTTGACCCTT 62010 CTATGAAATTCTGATAATCACATTATGTAATTTTGATAACCTCGCTTTGAAATTTTGATAACAAC 131 CTATGAAATTCTGATAATCACATTATGTAATTTTGATAACCTCGCTTTGAAATTTTGATAACAAC 62075 ACTATGAAATTTTGATAATCTAATCTCTATGAAATTTCGTTTATAACTCTATGAGATTTGATAAC 196 ACTATGAAATTTTGATAATCTAATCTCTATGAAATTTCGTTTATAACTCTATGAGATTTGATAAC 62140 CTTCTATCAAATTTTGGTACTCCTTATGAAATTGAGACTTTTATAACCTTCATATGAAATTTTGA 261 CTTCTATCAAATTTTGGTACTCCTTATGAAATTGAGACTTTTATAACCTTCATATGAAATTTTGA 62205 TAACC 326 TAACC 62210 TCCCCATGAA Statistics Matches: 328, Mismatches: 2, Indels: 0 0.99 0.01 0.00 Matches are distributed among these distances: 332 328 1.00 ACGTcount: A:0.35, C:0.15, G:0.09, T:0.40 Consensus pattern (332 bp): ACTATAAAATTTTGATAACTTCCCCAGAAATACCACTATGAAATTTGAAATTTTTGTAATCACAT TTTGAAAATTTGATAACCTCTTTATGAAATTTTAATAACCTCTTCATAAAATTTTGTTGACCCTT CTATGAAATTCTGATAATCACATTATGTAATTTTGATAACCTCGCTTTGAAATTTTGATAACAAC ACTATGAAATTTTGATAATCTAATCTCTATGAAATTTCGTTTATAACTCTATGAGATTTGATAAC CTTCTATCAAATTTTGGTACTCCTTATGAAATTGAGACTTTTATAACCTTCATATGAAATTTTGA TAACCAC Found at i:61972 original size:22 final size:22 Alignment explanation

Indices: 61947--62071 Score: 87 Period size: 22 Copynumber: 5.7 Consensus size: 22 61937 AATCACATTT 61947 TGAAAATTTGATAACCTCTTTA 1 TGAAAATTTGATAACCTCTTTA * * 61969 TGAAATTTTAATAACCTCTTTA 1 TGAAAATTTGATAACCTCTTTA * * 61991 T-AAAATTTTGTTGACC-CTTCTA 1 TGAAAA-TTTGATAACCTCTT-TA * * * 62013 TG-AAATTCTGATAATCACATTA 1 TGAAAATT-TGATAACCTCTTTA * * * 62035 TGTAATTTTGATAACCTCGCTT- 1 TGAAAATTTGATAACCTC-TTTA * 62057 TGAAATTTTGATAAC 1 TGAAAATTTGATAAC 62072 AACACTATGA Statistics Matches: 81, Mismatches: 15, Indels: 14 0.74 0.14 0.13 Matches are distributed among these distances: 21 8 0.10 22 65 0.80 23 8 0.10 ACGTcount: A:0.34, C:0.14, G:0.10, T:0.42 Consensus pattern (22 bp): TGAAAATTTGATAACCTCTTTA Found at i:62082 original size:22 final size:23 Alignment explanation

Indices: 62010--62111 Score: 93 Period size: 22 Copynumber: 4.5 Consensus size: 23 62000 GTTGACCCTT * 62010 CTATGAAATTCTGATAATC-ACA 1 CTATGAAATTTTGATAATCAACA * * ** * 62032 TTATGTAATTTTGATAA-CCTCG 1 CTATGAAATTTTGATAATCAACA * 62054 CTTTGAAATTTTGATAA-CAACA 1 CTATGAAATTTTGATAATCAACA * 62076 CTATGAAATTTTGATAATCTAATCT 1 CTATGAAATTTTGATAATC-AA-CA 62101 CTATGAAATTT 1 CTATGAAATTT 62112 CGTTTATAAC Statistics Matches: 63, Mismatches: 13, Indels: 5 0.78 0.16 0.06 Matches are distributed among these distances: 21 1 0.02 22 47 0.75 23 1 0.02 24 2 0.03 25 12 0.19 ACGTcount: A:0.36, C:0.14, G:0.10, T:0.40 Consensus pattern (23 bp): CTATGAAATTTTGATAATCAACA Found at i:62207 original size:22 final size:22 Alignment explanation

Indices: 61947--62210 Score: 68 Period size: 22 Copynumber: 11.8 Consensus size: 22 61937 AATCACATTT * * 61947 TGAAAATTTGATAACC-TCTTTA 1 TGAAATTTTGATAACCTTC-ATA * * 61969 TGAAATTTTAATAACC-TCTTTA 1 TGAAATTTTGATAACCTTC-ATA * * * 61991 TAAAATTTTGTTGACCCTTC-TA 1 TGAAATTTTGAT-AACCTTCATA * * * 62013 TGAAATTCTGATAATC-ACATTA 1 TGAAATTTTGATAACCTTCA-TA * * * 62035 TGTAATTTTGATAACC-TCGCTT 1 TGAAATTTTGATAACCTTC-ATA ** 62057 TGAAATTTTGATAA-CAACACTA 1 TGAAATTTTGATAACCTTCA-TA * * 62079 TGAAATTTTGATAATCTAATCTCTA 1 TGAAATTTTGATAACCT--TC-ATA * 62104 TGAAATTTCGTTTATAA-C-TC-TA 1 TGAAA-TT--TTGATAACCTTCATA * 62126 TGAGA-TTTGATAACCTTC-TA 1 TGAAATTTTGATAACCTTCATA * * * 62146 TCAAATTTTGGTACTCCTT-ATGAAA 1 TGAAATTTTGATA-ACCTTCAT---A * 62171 TTGAGACTTTT-ATAACCTTCATA 1 -TGA-AATTTTGATAACCTTCATA 62194 TGAAATTTTGATAACCT 1 TGAAATTTTGATAACCT 62211 CCCCATGAAG Statistics Matches: 179, Mismatches: 37, Indels: 52 0.67 0.14 0.19 Matches are distributed among these distances: 18 6 0.03 19 1 0.01 20 9 0.05 21 14 0.08 22 107 0.60 23 5 0.03 24 4 0.02 25 13 0.07 26 8 0.04 27 6 0.03 28 6 0.03 ACGTcount: A:0.34, C:0.15, G:0.10, T:0.42 Consensus pattern (22 bp): TGAAATTTTGATAACCTTCATA Found at i:62218 original size:22 final size:22 Alignment explanation

Indices: 62178--62294 Score: 80 Period size: 22 Copynumber: 5.4 Consensus size: 22 62168 AAATTGAGAC * * 62178 TTTT-ATAACCTTCATATGAAA 1 TTTTGATAACCTCCCTATGAAA * * 62199 TTTTGATAACCTCCCCATGAAG 1 TTTTGATAACCTCCCTATGAAA * 62221 TATT-AGTAACCT-CCTAATGAAA 1 TTTTGA-TAACCTCCCT-ATGAAA * * * * 62243 TTTTGTTAACCACACTATAAAA 1 TTTTGATAACCTCCCTATGAAA * * 62265 TTCTT-ATAACCTCGCTATGACA 1 TT-TTGATAACCTCCCTATGAAA 62287 TTTTGATA 1 TTTTGATA 62295 TTCTCTTTGA Statistics Matches: 72, Mismatches: 17, Indels: 13 0.71 0.17 0.13 Matches are distributed among these distances: 21 9 0.12 22 59 0.82 23 4 0.06 ACGTcount: A:0.34, C:0.20, G:0.09, T:0.38 Consensus pattern (22 bp): TTTTGATAACCTCCCTATGAAA Found at i:62363 original size:22 final size:22 Alignment explanation

Indices: 62331--62379 Score: 73 Period size: 22 Copynumber: 2.2 Consensus size: 22 62321 TTGTGATAAT * 62331 TAACCACCCTAAGAAATT-TCAA 1 TAACCAACCTAAGAAATTCT-AA 62353 TAACCAACCTAAGAAATTCTAA 1 TAACCAACCTAAGAAATTCTAA 62375 TAACC 1 TAACC 62380 TGATCCTATG Statistics Matches: 25, Mismatches: 1, Indels: 2 0.89 0.04 0.07 Matches are distributed among these distances: 22 24 0.96 23 1 0.04 ACGTcount: A:0.47, C:0.27, G:0.04, T:0.22 Consensus pattern (22 bp): TAACCAACCTAAGAAATTCTAA Found at i:62441 original size:44 final size:44 Alignment explanation

Indices: 62385--62504 Score: 154 Period size: 44 Copynumber: 2.7 Consensus size: 44 62375 TAACCTGATC * * 62385 CTATGAAATTTTGGTAATCATGC-TATGAAATTTTGGTAACCACA 1 CTATGAAATTTTGATAA-CATCCATATGAAATTTTGGTAACCACA * * 62429 CTATGAAATTTTGATAACTTCCATATGAAATTTTGGTAAGCACA 1 CTATGAAATTTTGATAACATCCATATGAAATTTTGGTAACCACA * * 62473 CTATGGAATTTTGATAACCT-CATCATGAAATT 1 CTATGAAATTTTGATAACATCCAT-ATGAAATT 62505 ATAATAACCA Statistics Matches: 68, Mismatches: 6, Indels: 4 0.87 0.08 0.05 Matches are distributed among these distances: 43 6 0.09 44 62 0.91 ACGTcount: A:0.35, C:0.14, G:0.14, T:0.37 Consensus pattern (44 bp): CTATGAAATTTTGATAACATCCATATGAAATTTTGGTAACCACA Found at i:62504 original size:22 final size:23 Alignment explanation

Indices: 62365--62535 Score: 108 Period size: 22 Copynumber: 7.7 Consensus size: 23 62355 ACCAACCTAA * * * 62365 GAAATTCTAATAACCTGATCCTAT 1 GAAATTTTGATAACCTCAT-CTAT * 62389 GAAATTTTGGTAA--TCATGCTAT 1 GAAATTTTGATAACCTCAT-CTAT * * 62411 GAAATTTTGGTAACCACA-CTAT 1 GAAATTTTGATAACCTCATCTAT * 62433 GAAATTTTGATAACTTC--CATAT 1 GAAATTTTGATAACCTCATC-TAT * * * 62455 GAAATTTTGGTAAGCACA-CTAT 1 GAAATTTTGATAACCTCATCTAT * 62477 GGAATTTTGATAACCTCATC-AT 1 GAAATTTTGATAACCTCATCTAT * * * 62499 GAAATTATAATAA-C-CATCTTGT 1 GAAATTTTGATAACCTCATC-TAT * 62521 GGAATTTTGATAACC 1 GAAATTTTGATAACC 62536 ACATAGAGAC Statistics Matches: 115, Mismatches: 24, Indels: 17 0.74 0.15 0.11 Matches are distributed among these distances: 20 4 0.03 21 2 0.02 22 94 0.82 23 3 0.03 24 12 0.10 ACGTcount: A:0.36, C:0.15, G:0.13, T:0.36 Consensus pattern (23 bp): GAAATTTTGATAACCTCATCTAT Found at i:62512 original size:66 final size:66 Alignment explanation

Indices: 62343--62512 Score: 166 Period size: 66 Copynumber: 2.5 Consensus size: 66 62333 ACCACCCTAA ** * * * * 62343 GAAATTTCAATAACCA-ACCTAAGAAATTCTAATAACCTGATCC-TATGAAATTTTGGTAATCAT 1 GAAATTTTGATAACCACA-CTATGAAATTATAATAA-CT--TCCATATGAAATTTTGGTAAGCAC * 62406 GCTAT 62 ACTAT * * * 62411 GAAATTTTGGTAACCACACTATGAAATTTTGATAACTTCCATATGAAATTTTGGTAAGCACACTA 1 GAAATTTTGATAACCACACTATGAAATTATAATAACTTCCATATGAAATTTTGGTAAGCACACTA 62476 T 66 T * * 62477 GGAATTTTGATAACCTCA-TCATGAAATTATAATAAC 1 GAAATTTTGATAACCACACT-ATGAAATTATAATAAC 62513 CATCTTGTGG Statistics Matches: 85, Mismatches: 14, Indels: 8 0.79 0.13 0.07 Matches are distributed among these distances: 65 4 0.05 66 51 0.60 67 2 0.02 68 27 0.32 69 1 0.01 ACGTcount: A:0.39, C:0.16, G:0.12, T:0.34 Consensus pattern (66 bp): GAAATTTTGATAACCACACTATGAAATTATAATAACTTCCATATGAAATTTTGGTAAGCACACTA T Found at i:62733 original size:19 final size:20 Alignment explanation

Indices: 62702--62739 Score: 53 Period size: 19 Copynumber: 1.9 Consensus size: 20 62692 TATTGACATT 62702 TAAAAATTGAAATT-AAAAG 1 TAAAAATTGAAATTCAAAAG 62721 TAAAATATT-AAATTCAAAA 1 TAAAA-ATTGAAATTCAAAA 62740 AATAATAGTA Statistics Matches: 17, Mismatches: 0, Indels: 3 0.85 0.00 0.15 Matches are distributed among these distances: 19 10 0.59 20 7 0.41 ACGTcount: A:0.63, C:0.03, G:0.05, T:0.29 Consensus pattern (20 bp): TAAAAATTGAAATTCAAAAG Found at i:62842 original size:25 final size:25 Alignment explanation

Indices: 62811--62881 Score: 72 Period size: 25 Copynumber: 2.8 Consensus size: 25 62801 CGGTCTAAAT 62811 TGAAAATTTTAATATATTTTAAATAA 1 TGAAAATTTTAATATATTTT-AATAA * * * 62837 T-AAAATTATACTAAATTTTAATAA 1 TGAAAATTTTAATATATTTTAATAA * 62861 TGGAAATTTATAAATATATTT 1 TGAAAATTT-T-AATATATTT 62882 GGAAAAAGGG Statistics Matches: 35, Mismatches: 7, Indels: 5 0.74 0.15 0.11 Matches are distributed among these distances: 24 6 0.17 25 20 0.57 26 2 0.06 27 7 0.20 ACGTcount: A:0.49, C:0.01, G:0.04, T:0.45 Consensus pattern (25 bp): TGAAAATTTTAATATATTTTAATAA Found at i:63057 original size:167 final size:174 Alignment explanation

Indices: 62752--63077 Score: 418 Period size: 167 Copynumber: 1.9 Consensus size: 174 62742 TAATAGTAAA * 62752 GGAAATTTACATGTTCATCAACTAAAATTAATTTGACAAACTTACAACTCGGTCTAAATTGAAAA 1 GGAAATTTACATGTTCATCAACTAAAATCAATTTGACAAACTTACAACTCGGTCTAAATTGAAAA * 62817 TTTTAATATATTTTAAATAATAAAATTATACTAAATTTTAATAATGGAAATTTATAAATATATTT 66 TTTTAATATA-TTTAAATAATAAAATTATACTAAATTTTAATAATGGAAATTTAGAAATATATTT * * ** 62882 GGAAAAAGGGTGTAATCGGAAAACATAAAATTTCCCATTATTCGT 130 GAAAAAAGGATACAATCGGAAAACATAAAATTTCCCATTATTCGT * * * * 62927 GGAAATTTGCATGTTCATCAA-TGAAAATCAATTTTACAAAACTTATAATTCGGTCTAAATTG-A 1 GGAAATTTACATGTTCATCAACT-AAAATCAATTTGAC-AAACTTACAACTCGGTCTAAATTGAA ** * ** 62990 AATTTT-ATA-A-TT-AATTTTTAAA-TA-A-TAAATTTTAATAATGTCAATTTAGAAATATATT 64 AATTTTAATATATTTAAATAATAAAATTATACTAAATTTTAATAATGGAAATTTAGAAATATATT * 63048 TGAAAAAAGGATACAATCGGAAGACATAAA 129 TGAAAAAAGGATACAATCGGAAAACATAAA 63078 GTTTTTCATT Statistics Matches: 133, Mismatches: 16, Indels: 12 0.83 0.10 0.07 Matches are distributed among these distances: 167 55 0.41 168 1 0.01 169 2 0.02 170 7 0.05 171 2 0.02 173 1 0.01 174 4 0.03 175 39 0.29 176 22 0.17 ACGTcount: A:0.45, C:0.09, G:0.10, T:0.36 Consensus pattern (174 bp): GGAAATTTACATGTTCATCAACTAAAATCAATTTGACAAACTTACAACTCGGTCTAAATTGAAAA TTTTAATATATTTAAATAATAAAATTATACTAAATTTTAATAATGGAAATTTAGAAATATATTTG AAAAAAGGATACAATCGGAAAACATAAAATTTCCCATTATTCGT Found at i:63443 original size:30 final size:31 Alignment explanation

Indices: 63407--63474 Score: 111 Period size: 30 Copynumber: 2.2 Consensus size: 31 63397 TAGTGGCAGT * 63407 TTGGAAATATGTTTTAAAAA-AAGGGTACAA 1 TTGGAAATATATTTTAAAAATAAGGGTACAA 63437 TTGGAAATATATTTTAAAAATAAGGGTACAA 1 TTGGAAATATATTTTAAAAATAAGGGTACAA * 63468 TCGGAAA 1 TTGGAAA 63475 ACATAAAATT Statistics Matches: 35, Mismatches: 2, Indels: 1 0.92 0.05 0.03 Matches are distributed among these distances: 30 19 0.54 31 16 0.46 ACGTcount: A:0.47, C:0.04, G:0.19, T:0.29 Consensus pattern (31 bp): TTGGAAATATATTTTAAAAATAAGGGTACAA Found at i:67838 original size:57 final size:58 Alignment explanation

Indices: 67765--67879 Score: 205 Period size: 58 Copynumber: 2.0 Consensus size: 58 67755 CAACATGCAA * 67765 TTCCAACCAATTTCCTATCTCAT-TTCTTTCTCTCTTACTCTCTTAGCTATTACTCTC 1 TTCCAACCAATTTCCTATCCCATCTTCTTTCTCTCTTACTCTCTTAGCTATTACTCTC * 67822 TTCCTACCAATTTCCTATCCCATCTTCTTTCTCTCTTACTCTCTTAGCTATTACTCTC 1 TTCCAACCAATTTCCTATCCCATCTTCTTTCTCTCTTACTCTCTTAGCTATTACTCTC 67880 AAAGCTTCTT Statistics Matches: 55, Mismatches: 2, Indels: 1 0.95 0.03 0.02 Matches are distributed among these distances: 57 21 0.38 58 34 0.62 ACGTcount: A:0.17, C:0.35, G:0.02, T:0.47 Consensus pattern (58 bp): TTCCAACCAATTTCCTATCCCATCTTCTTTCTCTCTTACTCTCTTAGCTATTACTCTC Found at i:69238 original size:15 final size:15 Alignment explanation

Indices: 69218--69259 Score: 75 Period size: 15 Copynumber: 2.8 Consensus size: 15 69208 TAAAAATGAC 69218 ATGAATGAAACAGAG 1 ATGAATGAAACAGAG * 69233 ATGAATGAAACAGTG 1 ATGAATGAAACAGAG 69248 ATGAATGAAACA 1 ATGAATGAAACA 69260 TCTGTTCAAG Statistics Matches: 26, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 15 26 1.00 ACGTcount: A:0.52, C:0.07, G:0.24, T:0.17 Consensus pattern (15 bp): ATGAATGAAACAGAG Found at i:69356 original size:15 final size:15 Alignment explanation

Indices: 69336--69368 Score: 57 Period size: 15 Copynumber: 2.2 Consensus size: 15 69326 GCTAAGTCTA 69336 AAATTGACATGAATG 1 AAATTGACATGAATG * 69351 AAATTGAGATGAATG 1 AAATTGACATGAATG 69366 AAA 1 AAA 69369 CATCTGTTCA Statistics Matches: 17, Mismatches: 1, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 15 17 1.00 ACGTcount: A:0.52, C:0.03, G:0.21, T:0.24 Consensus pattern (15 bp): AAATTGACATGAATG Found at i:69375 original size:111 final size:111 Alignment explanation

Indices: 69233--69440 Score: 326 Period size: 111 Copynumber: 1.9 Consensus size: 111 69223 TGAAACAGAG * * * 69233 ATGAATGAAACAGTGATGAATGAAACATCTGTTCAAGTTGTCCACAACTCTCAAAGCCATGAAAA 1 ATGAATGAAACAGAGATGAATGAAACATCTGTTCAAGTTATCCAAAACTCTCAAAGCCATGAAAA * * 69298 ACGTAAGGAGAATGACACCAGGAAAGAAGCTAAGTCTAAAATTGAC 66 ACGTAAGGAGAATAACACCAGAAAAGAAGCTAAGTCTAAAATTGAC ** * 69344 ATGAATGAAATTGAGATGAATGAAACATCTGTTCAAGTTATCCAAAACTCTCAAAGCCGTGAAAA 1 ATGAATGAAACAGAGATGAATGAAACATCTGTTCAAGTTATCCAAAACTCTCAAAGCCATGAAAA ** 69409 ACGTACTGAGAATAACACCAGAAAAGAAGCTA 66 ACGTAAGGAGAATAACACCAGAAAAGAAGCTA 69441 CCACCCCTGC Statistics Matches: 87, Mismatches: 10, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 111 87 1.00 ACGTcount: A:0.44, C:0.17, G:0.19, T:0.20 Consensus pattern (111 bp): ATGAATGAAACAGAGATGAATGAAACATCTGTTCAAGTTATCCAAAACTCTCAAAGCCATGAAAA ACGTAAGGAGAATAACACCAGAAAAGAAGCTAAGTCTAAAATTGAC Found at i:69778 original size:18 final size:18 Alignment explanation

Indices: 69748--69790 Score: 61 Period size: 18 Copynumber: 2.4 Consensus size: 18 69738 ATTTTCCCCT 69748 TTTTAATAACCTAATTAA 1 TTTTAATAACCTAATTAA * 69766 TATTTAA-AACCTACTTAA 1 T-TTTAATAACCTAATTAA 69784 TTTTAAT 1 TTTTAAT 69791 TTTCTTTTAA Statistics Matches: 22, Mismatches: 1, Indels: 4 0.81 0.04 0.15 Matches are distributed among these distances: 17 5 0.23 18 12 0.55 19 5 0.23 ACGTcount: A:0.42, C:0.12, G:0.00, T:0.47 Consensus pattern (18 bp): TTTTAATAACCTAATTAA Found at i:77280 original size:22 final size:23 Alignment explanation

Indices: 77238--77286 Score: 64 Period size: 22 Copynumber: 2.2 Consensus size: 23 77228 TTACTTAATA * 77238 AAAAAGGTGCCTATTTACTTC-C 1 AAAAAGGTGCCTATTGACTTCAC * * 77260 AAAAAGGTGCTTATTGGCTTCAC 1 AAAAAGGTGCCTATTGACTTCAC 77283 AAAA 1 AAAA 77287 GGTTATCCCA Statistics Matches: 23, Mismatches: 3, Indels: 1 0.85 0.11 0.04 Matches are distributed among these distances: 22 18 0.78 23 5 0.22 ACGTcount: A:0.37, C:0.18, G:0.16, T:0.29 Consensus pattern (23 bp): AAAAAGGTGCCTATTGACTTCAC Found at i:81225 original size:30 final size:30 Alignment explanation

Indices: 81191--81260 Score: 79 Period size: 30 Copynumber: 2.3 Consensus size: 30 81181 GCCATGAATA 81191 AACAAGAAATAAATAAG-AAATTACATGAGG 1 AACAAGAAATAAA-AAGTAAATTACATGAGG * * * * 81221 AACAAGAATTGAAGAGTAAATTACATGGGG 1 AACAAGAAATAAAAAGTAAATTACATGAGG * 81251 AACAATAAAT 1 AACAAGAAAT 81261 TTGATGGAAG Statistics Matches: 33, Mismatches: 6, Indels: 2 0.80 0.15 0.05 Matches are distributed among these distances: 29 2 0.06 30 31 0.94 ACGTcount: A:0.56, C:0.07, G:0.19, T:0.19 Consensus pattern (30 bp): AACAAGAAATAAAAAGTAAATTACATGAGG Found at i:86103 original size:2 final size:2 Alignment explanation

Indices: 86096--86134 Score: 62 Period size: 2 Copynumber: 20.0 Consensus size: 2 86086 TTACTATAAG * 86096 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT -T AT AC AT AT 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT 86135 CTTTGTTATT Statistics Matches: 34, Mismatches: 2, Indels: 2 0.89 0.05 0.05 Matches are distributed among these distances: 1 1 0.03 2 33 0.97 ACGTcount: A:0.49, C:0.03, G:0.00, T:0.49 Consensus pattern (2 bp): AT Done.