Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: scaffold_153 ID=scaffold_153-JGI_221_v2.0

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 10685
ACGTcount: A:0.32, C:0.19, G:0.20, T:0.29


Found at i:4278 original size:15 final size:16

Alignment explanation

Indices: 4258--4293 Score: 65 Period size: 15 Copynumber: 2.3 Consensus size: 16 4248 AAAAAGAATA 4258 AAAAAGAATAAAAA-G 1 AAAAAGAATAAAAAGG 4273 AAAAAGAATAAAAAGG 1 AAAAAGAATAAAAAGG 4289 AAAAA 1 AAAAA 4294 AAGAAAAAAG Statistics Matches: 20, Mismatches: 0, Indels: 1 0.95 0.00 0.05 Matches are distributed among these distances: 15 14 0.70 16 6 0.30 ACGTcount: A:0.81, C:0.00, G:0.14, T:0.06 Consensus pattern (16 bp): AAAAAGAATAAAAAGG Found at i:4286 original size:10 final size:10 Alignment explanation

Indices: 4247--4271 Score: 50 Period size: 10 Copynumber: 2.5 Consensus size: 10 4237 ATTTAGAAAG 4247 AAAAAAGAAT 1 AAAAAAGAAT 4257 AAAAAAGAAT 1 AAAAAAGAAT 4267 AAAAA 1 AAAAA 4272 GAAAAAGAAT Statistics Matches: 15, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 10 15 1.00 ACGTcount: A:0.84, C:0.00, G:0.08, T:0.08 Consensus pattern (10 bp): AAAAAAGAAT Found at i:4295 original size:10 final size:9 Alignment explanation

Indices: 4247--4302 Score: 55 Period size: 9 Copynumber: 6.4 Consensus size: 9 4237 ATTTAGAAAG 4247 AAAAAAGAA 1 AAAAAAGAA 4256 TAAAAAAGAA 1 -AAAAAAGAA * 4266 TAAAAAG-- 1 AAAAAAGAA 4273 -AAAAAGAA 1 AAAAAAGAA * * 4281 TAAAAAGGA 1 AAAAAAGAA 4290 AAAAAAGAA 1 AAAAAAGAA 4299 AAAA 1 AAAA 4303 GAGATGCCAA Statistics Matches: 39, Mismatches: 4, Indels: 7 0.78 0.08 0.14 Matches are distributed among these distances: 6 6 0.15 9 24 0.62 10 9 0.23 ACGTcount: A:0.82, C:0.00, G:0.12, T:0.05 Consensus pattern (9 bp): AAAAAAGAA Found at i:4300 original size:15 final size:16 Alignment explanation

Indices: 4243--4303 Score: 61 Period size: 16 Copynumber: 3.7 Consensus size: 16 4233 TTAGATTTAG * 4243 AAAGAAAAAAGAATAAA 1 AAAGAAAAAAGGA-AAA * 4260 AAAGAATAAAAAGAAAA 1 AAAGAA-AAAAGGAAAA * 4277 AGAA-TAAAAAGGAAAA 1 A-AAGAAAAAAGGAAAA 4293 AAAGAAAAAAG 1 AAAGAAAAAAG 4304 AGATGCCAAG Statistics Matches: 36, Mismatches: 5, Indels: 7 0.75 0.10 0.15 Matches are distributed among these distances: 15 2 0.06 16 16 0.44 17 11 0.31 18 7 0.19 ACGTcount: A:0.80, C:0.00, G:0.15, T:0.05 Consensus pattern (16 bp): AAAGAAAAAAGGAAAA Found at i:4302 original size:25 final size:26 Alignment explanation

Indices: 4243--4304 Score: 101 Period size: 25 Copynumber: 2.5 Consensus size: 26 4233 TTAGATTTAG 4243 AAAGAAAAAAGAATAAAAAAGAATAA 1 AAAGAAAAAAGAATAAAAAAGAATAA * 4269 AAAG-AAAAAGAATAAAAAGGAA-AA 1 AAAGAAAAAAGAATAAAAAAGAATAA 4293 AAAGAAAAAAGA 1 AAAGAAAAAAGA 4305 GATGCCAAGG Statistics Matches: 34, Mismatches: 1, Indels: 3 0.89 0.03 0.08 Matches are distributed among these distances: 24 6 0.18 25 24 0.71 26 4 0.12 ACGTcount: A:0.81, C:0.00, G:0.15, T:0.05 Consensus pattern (26 bp): AAAGAAAAAAGAATAAAAAAGAATAA Found at i:5119 original size:8 final size:8 Alignment explanation

Indices: 5106--5136 Score: 55 Period size: 8 Copynumber: 4.0 Consensus size: 8 5096 AGTCGAACAA 5106 AAAAAATG 1 AAAAAATG 5114 AAAAAATG 1 AAAAAATG 5122 -AAAAATG 1 AAAAAATG 5129 AAAAAATG 1 AAAAAATG 5137 GAAAGTAAAA Statistics Matches: 22, Mismatches: 0, Indels: 2 0.92 0.00 0.08 Matches are distributed among these distances: 7 7 0.32 8 15 0.68 ACGTcount: A:0.74, C:0.00, G:0.13, T:0.13 Consensus pattern (8 bp): AAAAAATG Found at i:5127 original size:15 final size:15 Alignment explanation

Indices: 5107--5136 Score: 60 Period size: 15 Copynumber: 2.0 Consensus size: 15 5097 GTCGAACAAA 5107 AAAAATGAAAAAATG 1 AAAAATGAAAAAATG 5122 AAAAATGAAAAAATG 1 AAAAATGAAAAAATG 5137 GAAAGTAAAA Statistics Matches: 15, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 15 15 1.00 ACGTcount: A:0.73, C:0.00, G:0.13, T:0.13 Consensus pattern (15 bp): AAAAATGAAAAAATG Found at i:5322 original size:44 final size:42 Alignment explanation

Indices: 5225--5338 Score: 106 Period size: 43 Copynumber: 2.7 Consensus size: 42 5215 AAAAAGGGCG * * 5225 GCTCAAATTTTGGAACAAAATGGGGCATGAAGTGATCATAGC 1 GCTCAAATTTTGGATCAAAATGGGGCATGAAGTGATCAGAGC * ** * 5267 GACAT-AAATTTTGGATCAAATTGGGGCATGGTGGTGATTAGAGC 1 G-C-TCAAATTTTGGATCAAAATGGGGCAT-GAAGTGATCAGAGC * * 5311 AGCTCAAATTTT-GATCAGACTGGGGCAT 1 -GCTCAAATTTTGGATCAAAATGGGGCAT 5339 ATGATGATCT Statistics Matches: 59, Mismatches: 8, Indels: 9 0.78 0.11 0.12 Matches are distributed among these distances: 42 1 0.02 43 38 0.64 44 19 0.32 45 1 0.02 ACGTcount: A:0.32, C:0.13, G:0.28, T:0.27 Consensus pattern (42 bp): GCTCAAATTTTGGATCAAAATGGGGCATGAAGTGATCAGAGC Found at i:6588 original size:50 final size:51 Alignment explanation

Indices: 6462--6802 Score: 173 Period size: 51 Copynumber: 6.8 Consensus size: 51 6452 AAACCATAGT * * * * 6462 TTGC-AATTCAAAGATTGAGGCCACAACGGTAAATCTTACTTTCTCTGGCA 1 TTGCAAATTAAAAGATTGAGGCCACAACGGTAAATCTTACTTTCCCCGACA * **** * * * 6512 GTGCAGCGGAAAAGATTGAAGCCACAATGGTAAATCTTGC-TTCCCCGACA 1 TTGCAAATTAAAAGATTGAGGCCACAACGGTAAATCTTACTTTCCCCGACA * * * * * 6562 TTGCAAATTAAAAGATTGAGGCCACAACAGCAAATCTTACTTTCCCTGGCG 1 TTGCAAATTAAAAGATTGAGGCCACAACGGTAAATCTTACTTTCCCCGACA * * * * * * * * 6613 GTG-TAGTGAAACAGATTGAAGCTACGACGGCT-AATCTCAC--TCCCCTGACA 1 TTGCAAATTAAA-AGATTGAGGCCACAACGG-TAAATCTTACTTTCCCC-GACA ** * * ** 6663 TTGC-AATTAAAAGATTGAGGCCACAACGGCGAACCTTACTTTTCCATACGA 1 TTGCAAATTAAAAGATTGAGGCCACAACGGTAAATCTTACTTTCCCCGAC-A ** * * * * * ** ** 6714 -TGC-CGTGGAACAGATTGAAGCTACAATGGCGAATCTCGC-TTCCCCGACA 1 TTGCAAAT-TAAAAGATTGAGGCCACAACGGTAAATCTTACTTTCCCCGACA * 6763 TTGC-AATTAAAAGATTGAGGCCACAACGGTGAATCTTACT 1 TTGCAAATTAAAAGATTGAGGCCACAACGGTAAATCTTACT 6803 CACAGGCTGT Statistics Matches: 205, Mismatches: 73, Indels: 26 0.67 0.24 0.09 Matches are distributed among these distances: 49 50 0.24 50 70 0.34 51 85 0.41 ACGTcount: A:0.31, C:0.24, G:0.21, T:0.24 Consensus pattern (51 bp): TTGCAAATTAAAAGATTGAGGCCACAACGGTAAATCTTACTTTCCCCGACA Found at i:6726 original size:100 final size:98 Alignment explanation

Indices: 6462--8095 Score: 941 Period size: 95 Copynumber: 16.7 Consensus size: 98 6452 AAACCATAGT * ** * * 6462 TTGCAATTCAAAGATTGAGGCCACAACGGTAAATCTTACTTTCTC-TGGCAGTGCAGCGGAAAAG 1 TTGCAATTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTC-CAT-GC-GTGCAGTGGAACAG * * ** ** 6526 ATTGAAGCCACAATGGTAAATCTTGCTTCCCCGACA 63 ATTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * * 6562 TTGCAAATTAAAAGATTGAGGCCACAACAGCAAATCTTACTTTCCCTGGCGGTGTAGTGAAACAG 1 TTGC-AATTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTCCAT-GC-GTGCAGTGGAACAG * * 6627 ATTGAAGCTACGACGGCTAATCTCAC-TCCCCTGACA 63 ATTGAAGCTACAACGGCGAATCTCACTTCCCC-GACA * * * 6663 TTGCAATTAAAAGATTGAGGCCACAACGGCGAACCTTACTTTTCCATACGATGCCGTGGAACAGA 1 TTGCAATTAAAAGATTGAGGCCACAACGGCGAATCTTAC-TTTCCATGCG-TGCAGTGGAACAGA * * 6728 TTGAAGCTACAATGGCGAATCTCGCTTCCCCGACA 64 TTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * 6763 TTGCAATTAAAAGATTGAGGCCACAACGGTGAATCTTAC--TCACAGGCTGTGCAGTGGAACAAA 1 TTGCAATTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTC-CATGC-GTGCAGTGGAACAGA * 6826 TTAAAGCTACAACGGCGAATCTCACTTCCCCGACA 64 TTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * * * * 6861 TTGCAATTAAAAGATTGAAGCTACAACAGTGGATCTTACTTTCCTAGTG-TTACAGTGGAACAGA 1 TTGCAATTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTCC-A-TGCGTGCAGTGGAACAGA * * ** * 6925 TTGAAGCTACAACAGCGGATCTTGCTTCCTCG--- 64 TTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * ** * * * 6957 --GCAGTTAAAAAGATTGAAGCTACAACAACGGATCTTACTTTCCTAGTG-TTGCAGTTGAACAG 1 TTGCAATT-AAAAGATTGAGGCCACAACGGCGAATCTTACTTTCC-A-TGCGTGCAGTGGAACAG * ** * 7019 ATTGAAGCTACAACAGC-AGATCTTGCTT--CC-TCA 63 ATTGAAGCTACAACGGCGA-ATCTCACTTCCCCGACA * * ** * * * * * 7052 --GCAGTTAAGAAGATTGAAGCTGCAACAGTGGATCTTACTTTCTTAGTG-TTGCAGTGGAACAG 1 TTGCAATTAA-AAGATTGAGGCCACAACGGCGAATCTTACTTTC-CA-TGCGTGCAGTGGAACAG ** * ** 7114 ATTGAAGCTACAATAGCGGATCTTGCTT---C--C- 63 ATTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * * * * 7144 TTGGAAGTTAAAAAGATTGAAGCTATAACAGCGGATCTTACTTTCCTAGTG-TTGCAGTGGAACA 1 TTGCAA-TT-AAAAGATTGAGGCCACAACGGCGAATCTTACTTTCC-A-TGCGTGCAGTGGAACA * ** * 7208 GATTGAAGCTACAACAGC-AGATCTTGCTT--CC-TCA 62 GATTGAAGCTACAACGGCGA-ATCTCACTTCCCCGACA * * * ** * * * * 7242 --GCAGTTAAGAAGATTGAAGCTACAATAGTGGATCTTACTTTCTTAGTG-TTGCAGTGGAACAG 1 TTGCAATTAA-AAGATTGAGGCCACAACGGCGAATCTTACTTTC-CA-TGCGTGCAGTGGAACAG * ** * ** * 7304 GTTGAAGCTACAATAGCGGATCTTGCTTCCTCG--- 63 ATTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * * * 7337 --GCAGTTAAAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTG-TTGCAGTGGAACAG 1 TTGCAATT-AAAAGATTGAGGCCACAACGGCGAATCTTACTTTCC-A-TGCGTGCAGTGGAACAG * * ** 7399 ATTGAAGCTACAACAGCGGATCTTGCTTCCCCGACA 63 ATTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * ** * * * 7435 --G---TTAAAAAGATTGAAGCTACAACAACGGATCTTACTTTCCTAATG-TTGCAGTGGAACAT 1 TTGCAATT-AAAAGATTGAGGCCACAACGGCGAATCTTACTTTCC--ATGCGTGCAGTGGAACAG ** * ** * 7494 ATTGAAGCTACAACAACGGATCTTGCTTCCTCG--- 63 ATTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * * * * ** * * 7527 --GTAGTTAAAAAGATTGAAGCTACAACAGCGAATCTTACTTCCCAAGAAATGTAGCGGAACAGA 1 TTGCAATT-AAAAGATTGAGGCCACAACGGCGAATCTTACTTTCCATG-CGTGCAGTGGAACAGA * 7590 TTGAAGCTACGACGGCGAATCTCACTTCCCCGACA 64 TTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * * * 7625 TTGTAATTTAAAAGATTGAGGCCACAACGGTGGATCTTAC-TTCCAAGCGATGCAGCGGAACAAA 1 TTGCAA-TTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTCCATGCG-TGCAGTGGAACAGA * ** 7689 TTGAAGCTACAATGGCGGGTCTCACTT-CCCGACA 64 TTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * ** 7723 TTGCAATTTAAAAGATTGAGGGCACAACGCCGGATCTTAC-TTCCAAGCGATGCAGCAGAACAGA 1 TTGCAA-TTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTCCATGCG-TGCAGTGGAACAGA * * * * * 7787 TTGAAGATACGACAGCGAATCTCACTTCCCCAAAA 64 TTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * 7822 CTGCAATTTAAAAGATTGAGGCCGCAACGGCGGATCTTAC-TTCCAAGCGATGCAGTGGAACAGA 1 TTGCAA-TTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTCCATGCG-TGCAGTGGAACAGA * * 7886 TTGAAGCTACAACGGGGAATCTCACTTCCCCGATA 64 TTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * 7921 TTGCAAGTTAAAAGATTAAAGTCC-CAACGGCGAATCTTA-TTTCCCA-GCGATGCAGAGGAACA 1 TTGCAA-TTAAAAGATT-GAGGCCACAACGGCGAATCTTACTTT-CCATGCG-TGCAGTGGAACA * * * 7983 TATTGAAGCTACGACGGTGAATCTCACTTCCCCGACA 62 GATTGAAGCTACAACGGCGAATCTCACTTCCCCGACA * * * * * * * 8020 TGGCAATTAAAATAGATTGAAGTCACGACGGCGAATCTTACTTTTCAGGCGATGCAGTAGAACAG 1 TTGCAATT-AAA-AGATTGAGGCCACAACGGCGAATCTTACTTTCCATGCG-TGCAGTGGAACAG 8085 ATTGAAGCTAC 63 ATTGAAGCTAC 8096 GGTGGTGAAT Statistics Matches: 1303, Mismatches: 169, Indels: 123 0.82 0.11 0.08 Matches are distributed among these distances: 92 1 0.00 93 4 0.00 94 12 0.01 95 545 0.42 96 8 0.01 97 4 0.00 98 171 0.13 99 263 0.20 100 176 0.14 101 119 0.09 ACGTcount: A:0.32, C:0.22, G:0.22, T:0.25 Consensus pattern (98 bp): TTGCAATTAAAAGATTGAGGCCACAACGGCGAATCTTACTTTCCATGCGTGCAGTGGAACAGATT GAAGCTACAACGGCGAATCTCACTTCCCCGACA Found at i:6990 original size:95 final size:95 Alignment explanation

Indices: 6869--7599 Score: 1129 Period size: 95 Copynumber: 7.7 Consensus size: 95 6859 CATTGCAATT * * 6869 AAAAGATTGAAGCTACAACAGTGGATCTTACTTTCCTAGTGTTACAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA 6934 CAACAGCGGATCTTGCTTCCTCGGCAGTTA 66 CAACAGCGGATCTTGCTTCCTCGGCAGTTA * * 6964 AAAAGATTGAAGCTACAACAACGGATCTTACTTTCCTAGTGTTGCAGTTGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * 7029 CAACAGCAGATCTTGCTTCCTCAGCAGTTA 66 CAACAGCGGATCTTGCTTCCTCGGCAGTTA * * * * 7059 AGAAGATTGAAGCTGCAACAGTGGATCTTACTTTCTTAGTGTTGCAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * * 7124 CAATAGCGGATCTTGCTTCCTTGGAAGTTA 66 CAACAGCGGATCTTGCTTCCTCGGCAGTTA * 7154 AAAAGATTGAAGCTATAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * 7219 CAACAGCAGATCTTGCTTCCTCAGCAGTTA 66 CAACAGCGGATCTTGCTTCCTCGGCAGTTA * * * * * 7249 AGAAGATTGAAGCTACAATAGTGGATCTTACTTTCTTAGTGTTGCAGTGGAACAGGTTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * 7314 CAATAGCGGATCTTGCTTCCTCGGCAGTTA 66 CAACAGCGGATCTTGCTTCCTCGGCAGTTA 7344 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * 7409 CAACAGCGGATCTTGCTTCCCCGACAGTTA 66 CAACAGCGGATCTTGCTTCCTCGGCAGTTA * * * 7439 AAAAGATTGAAGCTACAACAACGGATCTTACTTTCCTAATGTTGCAGTGGAACATATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * 7504 CAACAACGGATCTTGCTTCCTCGGTAGTTA 66 CAACAGCGGATCTTGCTTCCTCGGCAGTTA * * * *** * * 7534 AAAAGATTGAAGCTACAACAGCGAATCTTACTTCCCAAGAAATGTAGCGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA 7599 C 66 C 7600 GACGGCGAAT Statistics Matches: 574, Mismatches: 62, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 95 574 1.00 ACGTcount: A:0.31, C:0.19, G:0.22, T:0.27 Consensus pattern (95 bp): AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA CAACAGCGGATCTTGCTTCCTCGGCAGTTA Found at i:7618 original size:95 final size:95 Alignment explanation

Indices: 6869--7700 Score: 1076 Period size: 95 Copynumber: 8.7 Consensus size: 95 6859 CATTGCAATT * * 6869 AAAAGATTGAAGCTACAACAGTGGATCTTACTTTCCTAGTGTTACAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * 6934 CAACAGCGGATCTTGCTTCCTCGGCAGTTA 66 CAACAGCGAATCTTGCTTCCTCGGCAGTTA * * 6964 AAAAGATTGAAGCTACAACAACGGATCTTACTTTCCTAGTGTTGCAGTTGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * 7029 CAACAGC-AGATCTTGCTTCCTCAGCAGTTA 66 CAACAGCGA-ATCTTGCTTCCTCGGCAGTTA * * * * 7059 AGAAGATTGAAGCTGCAACAGTGGATCTTACTTTCTTAGTGTTGCAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * * * 7124 CAATAGCGGATCTTGCTTCCTTGGAAGTTA 66 CAACAGCGAATCTTGCTTCCTCGGCAGTTA * 7154 AAAAGATTGAAGCTATAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * 7219 CAACAGC-AGATCTTGCTTCCTCAGCAGTTA 66 CAACAGCGA-ATCTTGCTTCCTCGGCAGTTA * * * * * 7249 AGAAGATTGAAGCTACAATAGTGGATCTTACTTTCTTAGTGTTGCAGTGGAACAGGTTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * 7314 CAATAGCGGATCTTGCTTCCTCGGCAGTTA 66 CAACAGCGAATCTTGCTTCCTCGGCAGTTA 7344 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * * 7409 CAACAGCGGATCTTGCTTCCCCGACAGTTA 66 CAACAGCGAATCTTGCTTCCTCGGCAGTTA * * * 7439 AAAAGATTGAAGCTACAACAACGGATCTTACTTTCCTAATGTTGCAGTGGAACATATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * * 7504 CAACAACGGATCTTGCTTCCTCGGTAGTTA 66 CAACAGCGAATCTTGCTTCCTCGGCAGTTA * * * *** * * 7534 AAAAGATTGAAGCTACAACAGCGAATCTTACTTCCCAAGAAATGTAGCGGAACAGATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA * * ** * * * 7599 CGACGGCGAATCTCACTTCCCCGACATTGTAATTT 66 CAACAGCGAATCTTGCTTCCTCGGCA--G---TTA * * * * * * * * * 7634 AAAAGATTGAGGCCACAACGGTGGATCTTAC-TTCCAAGCGATGCAGCGGAACAAATTGAAGCTA 1 AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA 7698 CAA 66 CAA 7701 TGGCGGGTCT Statistics Matches: 645, Mismatches: 83, Indels: 14 0.87 0.11 0.02 Matches are distributed among these distances: 95 586 0.91 97 1 0.00 99 30 0.05 100 28 0.04 ACGTcount: A:0.32, C:0.20, G:0.22, T:0.27 Consensus pattern (95 bp): AAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTA CAACAGCGAATCTTGCTTCCTCGGCAGTTA Found at i:7620 original size:190 final size:190 Alignment explanation

Indices: 6812--7898 Score: 1161 Period size: 190 Copynumber: 5.6 Consensus size: 190 6802 TCACAGGCTG * * * ** 6812 TGCAGTGGAACAAATTAAAGCTACAACGGCGAATCTCACTTCCCCGACA-TTGCAATTAAAAGAT 1 TGCAGTGGAACAGATTGAAGCTACAACAGCGAATCTTGCTTCCCCGACAGTT---A--AAAAGAT * 6876 TGAAGCTACAACAGTGGATCTTACTTTCCTAGTGTTACAGTGGAACAGATTGAAGCTACAACAGC 61 TGAAGCTACAACAGTGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTACAACAGC * 6941 GGATCTTGCTTCCTCGGCAGTTAAAAAGATTGAAGCTACAACAACGGATCTTACTTTCCTAGTGT 126 GGATCTTGCTTCCTCGGCAGTTAAAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGT * * * 7006 TGCAGTTGAACAGATTGAAGCTACAACAGC-AGATCTTGCTTCCTC-AGCAGTTAAGAAGATTGA 1 TGCAGTGGAACAGATTGAAGCTACAACAGCGA-ATCTTGCTTCCCCGA-CAGTTAAAAAGATTGA * * * 7069 AGCTGCAACAGTGGATCTTACTTTCTTAGTGTTGCAGTGGAACAGATTGAAGCTACAATAGCGGA 64 AGCTACAACAGTGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTACAACAGCGGA * * * 7134 TCTTGCTTCCTTGGAAGTTAAAAAGATTGAAGCTATAACAGCGGATCTTACTTTCCTAGTGT 129 TCTTGCTTCCTCGGCAGTTAAAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGT * * 7196 TGCAGTGGAACAGATTGAAGCTACAACAGC-AGATCTTGCTTCCTC-AGCAGTTAAGAAGATTGA 1 TGCAGTGGAACAGATTGAAGCTACAACAGCGA-ATCTTGCTTCCCCGA-CAGTTAAAAAGATTGA * * * * 7259 AGCTACAATAGTGGATCTTACTTTCTTAGTGTTGCAGTGGAACAGGTTGAAGCTACAATAGCGGA 64 AGCTACAACAGTGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTACAACAGCGGA 7324 TCTTGCTTCCTCGGCAGTTAAAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGT 129 TCTTGCTTCCTCGGCAGTTAAAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGT * 7386 TGCAGTGGAACAGATTGAAGCTACAACAGCGGATCTTGCTTCCCCGACAGTTAAAAAGATTGAAG 1 TGCAGTGGAACAGATTGAAGCTACAACAGCGAATCTTGCTTCCCCGACAGTTAAAAAGATTGAAG ** * * * 7451 CTACAACAACGGATCTTACTTTCCTAATGTTGCAGTGGAACATATTGAAGCTACAACAACGGATC 66 CTACAACAGTGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTACAACAGCGGATC * * * * *** 7516 TTGCTTCCTCGGTAGTTAAAAAGATTGAAGCTACAACAGCGAATCTTACTTCCCAAGAAA 131 TTGCTTCCTCGGCAGTTAAAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGT * * * * ** * 7576 TGTAGCGGAACAGATTGAAGCTACGACGGCGAATCTCACTTCCCCGACATTGTAATTTAAAAGAT 1 TGCAGTGGAACAGATTGAAGCTACAACAGCGAATCTTGCTTCCCCGACA--G---TTAAAAAGAT * * * * * * * * ** 7641 TGAGGCCACAACGGTGGATCTTAC-TTCCAAGCGATGCAGCGGAACAAATTGAAGCTACAATGGC 61 TGAAGCTACAACAGTGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTACAACAGC * ** * * * 7705 GGGTCTCACTTCC-CGACATTGCAATTTAAAAGATTGAGGGC-ACAAC-GCCGGATCTTAC-TTC 126 GGATCTTGCTTCCTCG-----GCAGTTAAAAAGATTGA-AGCTACAACAG-CGGATCTTACTTTC * * * 7766 CAAGCGA 184 CTAGTGT ** * * ** * * * 7773 TGCAGCAGAACAGATTGAAGATACGACAGCGAATCTCACTTCCCCAAAACTGCAATTTAAAAGAT 1 TGCAGTGGAACAGATTGAAGCTACAACAGCGAATCTTGCTTCCCC--GA---CAGTTAAAAAGAT * ** * * * * * 7838 TGAGGCCGCAACGGCGGATCTTAC-TTCCAAGCGATGCAGTGGAACAGATTGAAGCTACAAC 61 TGAAGCTACAACAGTGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTACAAC 7899 GGGGAATCTC Statistics Matches: 785, Mismatches: 86, Indels: 41 0.86 0.09 0.04 Matches are distributed among these distances: 190 521 0.66 191 1 0.00 192 2 0.00 193 4 0.01 194 79 0.10 195 30 0.04 197 115 0.15 198 28 0.04 199 3 0.00 202 2 0.00 ACGTcount: A:0.32, C:0.21, G:0.22, T:0.25 Consensus pattern (190 bp): TGCAGTGGAACAGATTGAAGCTACAACAGCGAATCTTGCTTCCCCGACAGTTAAAAAGATTGAAG CTACAACAGTGGATCTTACTTTCCTAGTGTTGCAGTGGAACAGATTGAAGCTACAACAGCGGATC TTGCTTCCTCGGCAGTTAAAAAGATTGAAGCTACAACAGCGGATCTTACTTTCCTAGTGT Found at i:7765 original size:49 final size:49 Alignment explanation

Indices: 7601--7766 Score: 124 Period size: 49 Copynumber: 3.4 Consensus size: 49 7591 TGAAGCTACG * * * 7601 ACGGCGAATCTCACTTCCCCGACATTGTAATTTAAAAGATTGAGGCCACA 1 ACGGCGGATCTCACTT-CCCGACATTGCAATTTAAAAGATTGAGGGCACA * * * **** * 7651 ACGGTGGATCTTACTT-CCAAGCGA-TGCAGCGGAACAA-ATTGA-AGCTACA 1 ACGGCGGATCTCACTTCCCGA-C-ATTGCAATTTAA-AAGATTGAGGGC-ACA * * 7700 ATGGCGGGTCTCACTTCCCGACATTGCAATTTAAAAGATTGAGGGCACA 1 ACGGCGGATCTCACTTCCCGACATTGCAATTTAAAAGATTGAGGGCACA * * 7749 ACGCCGGATCTTACTTCC 1 ACGGCGGATCTCACTTCC 7767 AAGCGATGCA Statistics Matches: 83, Mismatches: 25, Indels: 17 0.66 0.20 0.14 Matches are distributed among these distances: 48 7 0.08 49 55 0.66 50 21 0.25 ACGTcount: A:0.30, C:0.25, G:0.22, T:0.23 Consensus pattern (49 bp): ACGGCGGATCTCACTTCCCGACATTGCAATTTAAAAGATTGAGGGCACA Found at i:7788 original size:49 final size:49 Alignment explanation

Indices: 7637--7914 Score: 129 Period size: 49 Copynumber: 5.7 Consensus size: 49 7627 GTAATTTAAA * * * 7637 AGATTGAGGCCACAACGGTGGATCTTACTTCCAAGCGATGCAGCGGAAC 1 AGATTGAGGGCACAACGGCGGATCTTACTTCCAAGCGATGCAGCAGAAC * * * * * * **** * 7686 AAATTGA-AGCTACAATGGCGGGTCTCACTTCC-CGAC-ATTGCAATTTAAA 1 AGATTGAGGGC-ACAACGGCGGATCTTACTTCCAAG-CGA-TGCAGCAGAAC * 7735 AGATTGAGGGCACAACGCCGGATCTTACTTCCAAGCGATGCAGCAGAAC 1 AGATTGAGGGCACAACGGCGGATCTTACTTCCAAGCGATGCAGCAGAAC * ** * * * * * **** * 7784 AGATTGAAGATACGACAGCGAATCTCACTTCCCCAA--AACTGCAATTTAAA 1 AGATTGAGGGCACAACGGCGGATCTTACTT--CCAAGCGA-TGCAGCAGAAC * * ** 7834 AGATTGAGGCCGCAACGGCGGATCTTACTTCCAAGCGATGCAGTGGAAC 1 AGATTGAGGGCACAACGGCGGATCTTACTTCCAAGCGATGCAGCAGAAC * * 7883 AGATTGA-AGCTACAACGG-GGAATCTCACTTCC 1 AGATTGAGGGC-ACAACGGCGG-ATCTTACTTCC 7915 CCGATATTGC Statistics Matches: 160, Mismatches: 56, Indels: 26 0.66 0.23 0.11 Matches are distributed among these distances: 48 10 0.06 49 113 0.71 50 33 0.21 51 4 0.03 ACGTcount: A:0.32, C:0.24, G:0.23, T:0.21 Consensus pattern (49 bp): AGATTGAGGGCACAACGGCGGATCTTACTTCCAAGCGATGCAGCAGAAC Found at i:8250 original size:44 final size:43 Alignment explanation

Indices: 8149--8655 Score: 340 Period size: 44 Copynumber: 11.7 Consensus size: 43 8139 AAAATCGCAG * * * 8149 ATCTTATCTCTCTAAAGTTGCAGTAGAGCAGATC---GTATCTA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGA-CAAAGAATCCA * * * * * 8190 GTCTTATCTCCCTGAAGTTGTAGTGGAGCAGACAAACGAAACCA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACAAA-GAATCCA * * ** ** 8234 ATCCTATCTCTCTGAAGTTACAGTAGAGTGGATTAA-AATCACA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACAAAGAATC-CA * 8277 GATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATC---GCATCCA 1 -ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGA-CAAAGAATCCA * * * * 8319 GTCTTATCTCCCTGAAGTTGCAGTGGAGCAGACAGAAGAAACCA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACA-AAGAATCCA * * * * * ** 8363 ATCCTATCTCTCTAAAGTTACAGTAAAGCGGATTAA-AATCACA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACAAAGAATC-CA * * * 8406 GATCTTATCTCTCTGAAGTTACAGTGGAGCAGACAGAAGAAACCA 1 -ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACA-AAGAATCCA * * * ** * 8451 ATCCTATCTCTCTAAAGTTGCAGTAGAGCGGATTAA-AATCATA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACAAAGAATC-CA * 8494 GATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATC--A-CATCCA 1 -ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGA-CAAAGAATCCA * * * * 8536 GTCTTATCTCCCTGAAGTTGCAGTGGAGCAGATAGAAGAAGT-CA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACA-AAGAA-TCCA * * * * ** * 8580 ATCCTATCTCCCTGAAGTTGCAGTGGAGCTGATTAA-AACCGCAA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACAAAGAATC-C-A * 8624 ATCTTATTTCTCTGAAGTTGCAGTAGAGCAGA 1 ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGA 8656 TCGCATCAGA Statistics Matches: 360, Mismatches: 79, Indels: 51 0.73 0.16 0.10 Matches are distributed among these distances: 40 2 0.01 41 85 0.24 42 14 0.04 43 20 0.06 44 231 0.64 45 5 0.01 46 3 0.01 ACGTcount: A:0.32, C:0.21, G:0.20, T:0.27 Consensus pattern (43 bp): ATCTTATCTCTCTGAAGTTGCAGTAGAGCAGACAAAGAATCCA Found at i:8369 original size:129 final size:129 Alignment explanation

Indices: 8134--8438 Score: 495 Period size: 129 Copynumber: 2.4 Consensus size: 129 8124 CAGTTAAACA * * * * 8134 GATTGAAAATCGCAGATCTTATCTCTCTAAAGTTGCAGTAGAGCAGATCGTATCTAGTCTTATCT 1 GATT-AAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCGCATCCAGTCTTATCT * * * * 8199 CCCTGAAGTTGTAGTGGAGCAGACAAACGAAACCAATCCTATCTCTCTGAAGTTACAGTAGAGTG 65 CCCTGAAGTTGCAGTGGAGCAGACAAACGAAACCAATCCTATCTCTCTAAAGTTACAGTAAAGCG 8264 GATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCGCATCCAGTCTTATCTC 1 GATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCGCATCCAGTCTTATCTC 8329 CCTGAAGTTGCAGTGGAGCAGACAGAA-GAAACCAATCCTATCTCTCTAAAGTTACAGTAAAGCG 66 CCTGAAGTTGCAGTGGAGCAGACA-AACGAAACCAATCCTATCTCTCTAAAGTTACAGTAAAGCG * * 8393 GATTAAAATCACAGATCTTATCTCTCTGAAGTTACAGTGGAGCAGA 1 GATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGA 8439 CAGAAGAAAC Statistics Matches: 164, Mismatches: 10, Indels: 3 0.93 0.06 0.02 Matches are distributed among these distances: 129 158 0.96 130 6 0.04 ACGTcount: A:0.32, C:0.21, G:0.20, T:0.27 Consensus pattern (129 bp): GATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCGCATCCAGTCTTATCTC CCTGAAGTTGCAGTGGAGCAGACAAACGAAACCAATCCTATCTCTCTAAAGTTACAGTAAAGCG Found at i:8541 original size:129 final size:131 Alignment explanation

Indices: 8233--8663 Score: 392 Period size: 129 Copynumber: 3.3 Consensus size: 131 8223 AAACGAAACC * * ** * * * * 8233 AATCCTATCTCTCTGAAGTTACAGTAGAGTGGATTAAAATCACAGATCTTATCTCTCTGAAGTTG 1 AATCTTATCTCTCTGAAGTTGCAGTAGAGCAGA-TCACATCACAGATCTTATCTCCCTGAAGTTA * * * * * * * * * 8298 CAGT--AG-AG-CAGATCGCATCCAGTCTTATCTCCCTGAAGTTGCAGTGGAGCAGA-CAGAAGA 65 CAGTGGAGCAGACAGA-AGAAACCAATCCTATCTCCCTAAAGTTGCAGTAGAGCGGATTA-AA-A * * 8358 -AACC 127 CCACA * * * * * * * * 8362 AATCCTATCTCTCTAAAGTTACAGTAAAGCGGATTAAAATCACAGATCTTATCTCTCTGAAGTTA 1 AATCTTATCTCTCTGAAGTTGCAGTAGAGCAGA-TCACATCACAGATCTTATCTCCCTGAAGTTA * * 8427 CAGTGGAGCAGACAGAAGAAACCAATCCTATCTCTCTAAAGTTGCAGTAGAGCGGATTAAAATCA 65 CAGTGGAGCAGACAGAAGAAACCAATCCTATCTCCCTAAAGTTGCAGTAGAGCGGATTAAAACCA * 8492 TA 130 CA * * 8494 GATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATC-CAG-TCTTATCTCCCTGAAGTTGC 1 AATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATCACAGATCTTATCTCCCTGAAGTTAC * ** * * * * 8557 AGTGGAGCAGATAGAAGAAGTCAATCCTATCTCCCTGAAGTTGCAGTGGAGCTGATTAAAACCGC 66 AGTGGAGCAGACAGAAGAAACCAATCCTATCTCCCTAAAGTTGCAGTAGAGCGGATTAAAACCAC 8622 A 131 A * * 8623 AATCTTATTTCTCTGAAGTTGCAGTAGAGCAGATCGCATCA 1 AATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATCA 8664 GATTTTATTT Statistics Matches: 255, Mismatches: 40, Indels: 13 0.83 0.13 0.04 Matches are distributed among these distances: 129 176 0.69 130 3 0.01 131 8 0.03 132 63 0.25 133 5 0.02 ACGTcount: A:0.32, C:0.21, G:0.20, T:0.27 Consensus pattern (131 bp): AATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATCACAGATCTTATCTCCCTGAAGTTAC AGTGGAGCAGACAGAAGAAACCAATCCTATCTCCCTAAAGTTGCAGTAGAGCGGATTAAAACCAC A Found at i:8566 original size:217 final size:217 Alignment explanation

Indices: 8191--8655 Score: 716 Period size: 217 Copynumber: 2.1 Consensus size: 217 8181 TCGTATCTAG * * * 8191 TCTTATCTCCCTGAAGTTGTAGTGGAGCAGACAAACGAAACCAATCCTATCTCTCTGAAGTTACA 1 TCTTATCTCTCTGAAGTTGCAGTGGAGCAGACAAACGAAACCAATCCTATCTCTCTAAAGTTACA * * 8256 GTAGAGTGGATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCGCATCCAGT 66 GTAGAGCGGATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATCCAGT * 8321 CTTATCTCCCTGAAGTTGCAGTGGAGCAGACAGAAGAAACCAATCCTATCTCTCTAAAGTTACAG 131 CTTATCTCCCTGAAGTTGCAGTGGAGCAGACAGAAGAAACCAATCCTATCTCCCTAAAGTTACAG * * 8386 TAAAGCGGATTAAAATCACAGA 196 TAAAGCGGATTAAAACCACAAA * * 8408 TCTTATCTCTCTGAAGTTACAGTGGAGCAGACAGAA-GAAACCAATCCTATCTCTCTAAAGTTGC 1 TCTTATCTCTCTGAAGTTGCAGTGGAGCAGACA-AACGAAACCAATCCTATCTCTCTAAAGTTAC * 8472 AGTAGAGCGGATTAAAATCATAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATCCAG 65 AGTAGAGCGGATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATCCAG * ** * * 8537 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGATAGAAGAAGTCAATCCTATCTCCCTGAAGTTGCA 130 TCTTATCTCCCTGAAGTTGCAGTGGAGCAGACAGAAGAAACCAATCCTATCTCCCTAAAGTTACA ** * * 8602 GTGGAGCTGATTAAAACCGCAAA 195 GTAAAGCGGATTAAAACCACAAA * * 8625 TCTTATTTCTCTGAAGTTGCAGTAGAGCAGA 1 TCTTATCTCTCTGAAGTTGCAGTGGAGCAGA 8656 TCGCATCAGA Statistics Matches: 224, Mismatches: 23, Indels: 2 0.90 0.09 0.01 Matches are distributed among these distances: 217 222 0.99 218 2 0.01 ACGTcount: A:0.32, C:0.21, G:0.20, T:0.27 Consensus pattern (217 bp): TCTTATCTCTCTGAAGTTGCAGTGGAGCAGACAAACGAAACCAATCCTATCTCTCTAAAGTTACA GTAGAGCGGATTAAAATCACAGATCTTATCTCTCTGAAGTTGCAGTAGAGCAGATCACATCCAGT CTTATCTCCCTGAAGTTGCAGTGGAGCAGACAGAAGAAACCAATCCTATCTCCCTAAAGTTACAG TAAAGCGGATTAAAACCACAAA Done.