Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWWV01015542.1 Corchorus capsularis cultivar CVL-1 contig15563, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 41582
ACGTcount: A:0.31, C:0.19, G:0.17, T:0.33


Found at i:9058 original size:3 final size:3

Alignment explanation

Indices: 9050--9092 Score: 68 Period size: 3 Copynumber: 14.3 Consensus size: 3 9040 GGGATGAATT * * 9050 TAA TAA TAA TAA TAA TAA TAA TAA CAA TAA TAA TAA TTA TAA T 1 TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA TAA T 9093 TATTACTATT Statistics Matches: 36, Mismatches: 4, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 3 36 1.00 ACGTcount: A:0.63, C:0.02, G:0.00, T:0.35 Consensus pattern (3 bp): TAA Found at i:11703 original size:30 final size:30 Alignment explanation

Indices: 11667--11729 Score: 92 Period size: 30 Copynumber: 2.1 Consensus size: 30 11657 TCTTCAAGGG * * 11667 GGAGGGAATGATGCGCCCAAGG-CTTATCAT 1 GGAGGGAATGATGC-ACCAAGGACTTACCAT 11697 GGAGGGAATGATGCACCAAGGACTTACCAT 1 GGAGGGAATGATGCACCAAGGACTTACCAT 11727 GGA 1 GGA 11730 CTTGAAGACA Statistics Matches: 30, Mismatches: 2, Indels: 2 0.88 0.06 0.06 Matches are distributed among these distances: 29 6 0.20 30 24 0.80 ACGTcount: A:0.30, C:0.19, G:0.33, T:0.17 Consensus pattern (30 bp): GGAGGGAATGATGCACCAAGGACTTACCAT Found at i:13001 original size:30 final size:30 Alignment explanation

Indices: 12960--13017 Score: 107 Period size: 30 Copynumber: 1.9 Consensus size: 30 12950 AGGATCAAAT 12960 GGCATCCTTGGTGCGATTCCTCCATCCAAC 1 GGCATCCTTGGTGCGATTCCTCCATCCAAC * 12990 GGCATCTTTGGTGCGATTCCTCCATCCA 1 GGCATCCTTGGTGCGATTCCTCCATCCA 13018 TTGATGTCTT Statistics Matches: 27, Mismatches: 1, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 30 27 1.00 ACGTcount: A:0.16, C:0.34, G:0.21, T:0.29 Consensus pattern (30 bp): GGCATCCTTGGTGCGATTCCTCCATCCAAC Found at i:17710 original size:35 final size:35 Alignment explanation

Indices: 17647--18097 Score: 534 Period size: 35 Copynumber: 12.3 Consensus size: 35 17637 AGCCTGTGCC 17647 TCAGTCTTTCTTTACCTAATTTCCTTCCTTGAAATTAAG 1 TCAGTCTTTCTTTACCTAA--T--TTCCTTGAAATTAAG 17686 TCAGTCTTTCTTTACCTAATTTCCTTCCTTGAAATTAAG 1 TCAGTCTTTCTTTACCTAA--T--TTCCTTGAAATTAAG * 17725 TCAGTCTTTCTTTACTTAATTTCCTTGAAATTAAG 1 TCAGTCTTTCTTTACCTAATTTCCTTGAAATTAAG * 17760 TCAGTC-TTCTTTTACTTAATTTCCTTGAAATTAAG 1 TCAGTCTTTC-TTTACCTAATTTCCTTGAAATTAAG * * 17795 TCAGT-ATTCTCTCACCTAATTTCCTTGAAATTAAG 1 TCAGTCTTTCT-TTACCTAATTTCCTTGAAATTAAG 17830 TCAGTC-TTCTTTTACCTAATTTCCTTGAAATTAAG 1 TCAGTCTTTC-TTTACCTAATTTCCTTGAAATTAAG 17865 TCAGTCTTTCTTTACCTAATTTCCTTGAAATTAAG 1 TCAGTCTTTCTTTACCTAATTTCCTTGAAATTAAG * 17900 TCAGTC-TTCTTTTACTTAATTTCCTTGAAATTAAG 1 TCAGTCTTTC-TTTACCTAATTTCCTTGAAATTAAG * * 17935 TCAGTC-TTCTTTTACCTAACTCCCTTGAAATTAAG 1 TCAGTCTTTC-TTTACCTAATTTCCTTGAAATTAAG * * 17970 TCAGTCTATCTTTACCTAATTTCCTTCCTTGAAATAAAG 1 TCAGTCTTTCTTTACCTAA--T--TTCCTTGAAATTAAG * 18009 TCAGTCTTTCTTTACCTAATTTCCTTCTTTGAAATTAGG 1 TCAGTCTTTCTTTACCTAATTT-C--C-TTGAAATTAAG * 18048 TCAGACTTTCTTTACCTAATTTCCTTCCTTGAAATTAAG 1 TCAGTCTTTCTTTACCTAA--T--TTCCTTGAAATTAAG * 18087 TCCGTCTTTCT 1 TCAGTCTTTCT 18098 AATGTTTTTA Statistics Matches: 373, Mismatches: 19, Indels: 40 0.86 0.04 0.09 Matches are distributed among these distances: 34 7 0.02 35 217 0.58 36 7 0.02 37 2 0.01 38 1 0.00 39 134 0.36 40 1 0.00 41 1 0.00 42 1 0.00 43 2 0.01 ACGTcount: A:0.25, C:0.22, G:0.08, T:0.45 Consensus pattern (35 bp): TCAGTCTTTCTTTACCTAATTTCCTTGAAATTAAG Found at i:18141 original size:34 final size:34 Alignment explanation

Indices: 18103--18377 Score: 261 Period size: 34 Copynumber: 7.7 Consensus size: 34 18093 TTTCTAATGT 18103 TTTTACTTAA-TTACTATGAATTAAGCCTTTGTGA 1 TTTTACTTAATTTACT-TGAATTAAGCCTTTGTGA * * * * 18137 TTTTATTTAATTTCCTTGAATTAAGTACTTTGACTGC 1 TTTTACTTAATTTACTTGAATTAAG-CCTTTG--TGA * 18174 TGTTACTTAA-TTACTTGGAATTAAGCCTTTGTGA 1 TTTTACTTAATTTACTT-GAATTAAGCCTTTGTGA * * * 18208 TTTTACTTAATTTCCTTGAATTAAGTACTTTGACTGCTGT 1 TTTTACTTAATTTACTTGAATTAAG--CCTT---TG-TGA 18248 TTTTACTTAA-TTACTTTGAATTAAGCCTTTGTGA 1 TTTTACTTAATTTAC-TTGAATTAAGCCTTTGTGA * * * 18282 TTTTACTTAATTTCCTTGAATTAAGTACTTTGACTGC 1 TTTTACTTAATTTACTTGAATTAAG-CCTTTG--TGA * 18319 TGTTACTTAA-TTACTTGGAATTAAGCCTTTGTGA 1 TTTTACTTAATTTACTT-GAATTAAGCCTTTGTGA * 18353 TTTTACTTAATTTCCTTGAATTAAG 1 TTTTACTTAATTTACTTGAATTAAG 18378 TACTTTGACT Statistics Matches: 197, Mismatches: 25, Indels: 38 0.76 0.10 0.15 Matches are distributed among these distances: 34 78 0.40 35 29 0.15 36 23 0.12 37 37 0.19 38 3 0.02 39 5 0.03 40 22 0.11 ACGTcount: A:0.26, C:0.13, G:0.13, T:0.48 Consensus pattern (34 bp): TTTTACTTAATTTACTTGAATTAAGCCTTTGTGA Found at i:18184 original size:71 final size:71 Alignment explanation

Indices: 18105--18393 Score: 508 Period size: 71 Copynumber: 4.0 Consensus size: 71 18095 TCTAATGTTT * 18105 TTACTTAATTACTAT-GAATTAAGCCTTTGTGATTTTATTTAATTTCCTTGAATTAAGTACTTTG 1 TTACTTAATTACT-TGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTTTG 18169 ACTGCTG 65 ACTGCTG 18176 TTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTTTGA 1 TTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTTTGA 18241 CTGCTG 66 CTGCTG * 18247 TTTTTACTTAATTACTTTGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTT 1 ---TTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTT 18312 TGACTGCTG 63 TGACTGCTG 18321 TTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTTTGA 1 TTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTTTGA * 18386 CTGTTG 66 CTGCTG 18392 TT 1 TT 18394 TGCTTCTCTT Statistics Matches: 210, Mismatches: 4, Indels: 8 0.95 0.02 0.04 Matches are distributed among these distances: 70 1 0.00 71 139 0.66 74 70 0.33 ACGTcount: A:0.25, C:0.13, G:0.13, T:0.48 Consensus pattern (71 bp): TTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTACTTTGA CTGCTG Found at i:18363 original size:145 final size:145 Alignment explanation

Indices: 18100--18394 Score: 563 Period size: 145 Copynumber: 2.0 Consensus size: 145 18090 GTCTTTCTAA * 18100 TGTTTTTACTTAATTACTATGAATTAAGCCTTTGTGATTTTATTTAATTTCCTTGAATTAAGTAC 1 TGTTTTTACTTAATTACTATGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTAC 18165 TTTGACTGCTGTTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATT 66 TTTGACTGCTGTTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATT 18230 AAGTACTTTGACTGC 131 AAGTACTTTGACTGC * 18245 TGTTTTTACTTAATTACTTTGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTAC 1 TGTTTTTACTTAATTACTATGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTAC 18310 TTTGACTGCTGTTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATT 66 TTTGACTGCTGTTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATT * 18375 AAGTACTTTGACTGT 131 AAGTACTTTGACTGC 18390 TGTTT 1 TGTTT 18395 GCTTCTCTTA Statistics Matches: 147, Mismatches: 3, Indels: 0 0.98 0.02 0.00 Matches are distributed among these distances: 145 147 1.00 ACGTcount: A:0.25, C:0.13, G:0.13, T:0.49 Consensus pattern (145 bp): TGTTTTTACTTAATTACTATGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATTAAGTAC TTTGACTGCTGTTACTTAATTACTTGGAATTAAGCCTTTGTGATTTTACTTAATTTCCTTGAATT AAGTACTTTGACTGC Found at i:18460 original size:41 final size:41 Alignment explanation

Indices: 18353--18553 Score: 141 Period size: 41 Copynumber: 4.7 Consensus size: 41 18343 GCCTTTGTGA * * * * * * 18353 TTTTACTTAATTTCCTTGAATTAAGTACTTTGACTGTTGTTTGC 1 TTTT-CTTAATTACCCTGAATTAAG-ACTCTAACTG-TGCTTAC * * 18397 TTCTCTTAATTACCCTGAATTAAGACTCTAACTGTGCTTAT 1 TTTTCTTAATTACCCTGAATTAAGACTCTAACTGTGCTTAC * * * * * * 18438 TTTTCTTACTTATCCTGGATTAAGACTTTGACTGTGTTTAC 1 TTTTCTTAATTACCCTGAATTAAGACTCTAACTGTGCTTAC * * * ** 18479 TTTTCTTAAGTACCATGAATTAAGATTTTGACTTTAACTGTGCTGGC 1 TTTTCTTAATTACCCTGAATT-A-A----GACTCTAACTGTGCTTAC * 18526 TTATCTTAATTACCCTGAATTAAGACTC 1 TTTTCTTAATTACCCTGAATTAAGACTC 18554 CGACTGTGTT Statistics Matches: 122, Mismatches: 29, Indels: 15 0.73 0.17 0.09 Matches are distributed among these distances: 41 57 0.47 42 9 0.07 43 19 0.16 44 3 0.02 45 1 0.01 46 1 0.01 47 32 0.26 ACGTcount: A:0.24, C:0.17, G:0.13, T:0.45 Consensus pattern (41 bp): TTTTCTTAATTACCCTGAATTAAGACTCTAACTGTGCTTAC Found at i:21711 original size:23 final size:23 Alignment explanation

Indices: 21685--21728 Score: 79 Period size: 23 Copynumber: 1.9 Consensus size: 23 21675 TTTGCATATT 21685 TGCATTTAGTAACTTGGTATTAC 1 TGCATTTAGTAACTTGGTATTAC * 21708 TGCATTTAGTAATTTGGTATT 1 TGCATTTAGTAACTTGGTATT 21729 GTTGCATCTC Statistics Matches: 20, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 23 20 1.00 ACGTcount: A:0.25, C:0.09, G:0.18, T:0.48 Consensus pattern (23 bp): TGCATTTAGTAACTTGGTATTAC Found at i:21734 original size:23 final size:23 Alignment explanation

Indices: 21684--21735 Score: 77 Period size: 23 Copynumber: 2.3 Consensus size: 23 21674 TTTTGCATAT 21684 TTGCATTTAGTAACTTGGTATTA 1 TTGCATTTAGTAACTTGGTATTA * * * 21707 CTGCATTTAGTAATTTGGTATTG 1 TTGCATTTAGTAACTTGGTATTA 21730 TTGCAT 1 TTGCAT 21736 CTCCAATTAG Statistics Matches: 25, Mismatches: 4, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 23 25 1.00 ACGTcount: A:0.23, C:0.10, G:0.19, T:0.48 Consensus pattern (23 bp): TTGCATTTAGTAACTTGGTATTA Found at i:25669 original size:36 final size:34 Alignment explanation

Indices: 25614--25680 Score: 91 Period size: 34 Copynumber: 1.9 Consensus size: 34 25604 GATCAGTCAG * 25614 AAAAGTGAAAAAGGTAATCAGAGTAATTAAGTTCCA 1 AAAAGTGAAAAAGGCAATC--AGTAATTAAGTTCCA 25650 AAAAGT-AAAAAGGGCAATCAGTAATTAAGTT 1 AAAAGTGAAAAA-GGCAATCAGTAATTAAGTT 25681 TAGTAAGGAA Statistics Matches: 29, Mismatches: 1, Indels: 4 0.85 0.03 0.12 Matches are distributed among these distances: 34 12 0.41 35 5 0.17 36 12 0.41 ACGTcount: A:0.51, C:0.07, G:0.19, T:0.22 Consensus pattern (34 bp): AAAAGTGAAAAAGGCAATCAGTAATTAAGTTCCA Found at i:25699 original size:30 final size:30 Alignment explanation

Indices: 25635--25733 Score: 90 Period size: 30 Copynumber: 3.2 Consensus size: 30 25625 AGGTAATCAG * * 25635 AGTAATTAAGTTCCAAAAAGTAAAAAGGGCAATC 1 AGTAACTAAGTT-CAATAAG-AAAAA--GCAATC * * * * * 25669 AGTAATTAAGTTTAGTAAGGAAAAGTAATC 1 AGTAACTAAGTTCAATAAGAAAAAGCAATC * 25699 AGTGACTAAGTTCAATAAGAAAAAGCAATC 1 AGTAACTAAGTTCAATAAGAAAAAGCAATC 25729 AGTAA 1 AGTAA 25734 ATAGTAAAGT Statistics Matches: 53, Mismatches: 12, Indels: 4 0.77 0.17 0.06 Matches are distributed among these distances: 30 33 0.62 32 4 0.08 33 4 0.08 34 12 0.23 ACGTcount: A:0.49, C:0.09, G:0.18, T:0.23 Consensus pattern (30 bp): AGTAACTAAGTTCAATAAGAAAAAGCAATC Found at i:25792 original size:22 final size:22 Alignment explanation

Indices: 25753--25813 Score: 58 Period size: 20 Copynumber: 2.9 Consensus size: 22 25743 TGATAATCAG * 25753 AGTAATTAG-AAAAGAGTAAAAT 1 AGTAATCAGTAAAA-AGTAAAAT 25775 AGTAATCAGTAAAAAGT--AAT 1 AGTAATCAGTAAAAAGTAAAAT * 25795 -GATGATCAGTAAAAAGTAA 1 AG-TAATCAGTAAAAAGTAA 25814 TCAGTAAAGA Statistics Matches: 33, Mismatches: 2, Indels: 8 0.77 0.05 0.19 Matches are distributed among these distances: 19 1 0.03 20 17 0.52 22 11 0.33 23 4 0.12 ACGTcount: A:0.56, C:0.03, G:0.18, T:0.23 Consensus pattern (22 bp): AGTAATCAGTAAAAAGTAAAAT Found at i:25804 original size:20 final size:20 Alignment explanation

Indices: 25776--26038 Score: 225 Period size: 20 Copynumber: 12.9 Consensus size: 20 25766 GAGTAAAATA 25776 GTAATCAGTAAAAAGTAATG 1 GTAATCAGTAAAAAGTAATG * * * 25796 ATGATCAGTAAAAAGTAATCA 1 GTAATCAGTAAAAAGTAAT-G 25817 GTAA--AG--AAAA--AATG 1 GTAATCAGTAAAAAGTAATG * * 25831 GTAATCAGTAAAGAGTAATA 1 GTAATCAGTAAAAAGTAATG * * 25851 GTAATCAGTAAAGAGTAATA 1 GTAATCAGTAAAAAGTAATG * 25871 GTAATCAGTAAGAAGTAATG 1 GTAATCAGTAAAAAGTAATG * 25891 GTAATCAGTAAAAAGTAAAAGG 1 GTAATCAGTAAAAAGT--AATG * 25913 GTAATCAATAAAAAGTAAAGTG 1 GTAATCAGTAAAAAGT-AA-TG * 25935 GTAATCAGTAAAAAGTAAAG 1 GTAATCAGTAAAAAGTAATG * 25955 GTAATCAGTGAAAAGTAAAATG 1 GTAATCAGTAAAAAGT--AATG * * * 25977 GTAATTAG-AAAAGAGAAAAAG 1 GTAATCAGTAAAA-AG-TAATG 25998 AGTAATCAGTAAAAAGTAAAATG 1 -GTAATCAGTAAAAAGT--AATG * 26021 GTAATTAGTAAAAAGTAA 1 GTAATCAGTAAAAAGTAA 26039 AAGAAAAAAT Statistics Matches: 198, Mismatches: 27, Indels: 36 0.76 0.10 0.14 Matches are distributed among these distances: 14 4 0.02 15 3 0.02 16 2 0.01 17 4 0.02 18 3 0.02 19 2 0.01 20 90 0.45 21 13 0.07 22 70 0.35 23 7 0.04 ACGTcount: A:0.54, C:0.04, G:0.20, T:0.22 Consensus pattern (20 bp): GTAATCAGTAAAAAGTAATG Found at i:25838 original size:35 final size:34 Alignment explanation

Indices: 25767--25854 Score: 106 Period size: 35 Copynumber: 2.6 Consensus size: 34 25757 ATTAGAAAAG ** * 25767 AGTAAAATAGTAATCAGTAAAAAGTAATGATGATC 1 AGTAAAA-AGTAATCAGTAAAAAAAAATGATAATC * 25802 AGTAAAAAGTAATCAGTAAAGAAAAAATGGTAATC 1 AGTAAAAAGTAATCAGTAAA-AAAAAATGATAATC * 25837 AGTAAAGAGTAAT-AGTAA 1 AGTAAAAAGTAATCAGTAA 25855 TCAGTAAAGA Statistics Matches: 47, Mismatches: 5, Indels: 3 0.85 0.09 0.05 Matches are distributed among these distances: 34 18 0.38 35 29 0.62 ACGTcount: A:0.55, C:0.05, G:0.18, T:0.23 Consensus pattern (34 bp): AGTAAAAAGTAATCAGTAAAAAAAAATGATAATC Found at i:25860 original size:13 final size:13 Alignment explanation

Indices: 25844--25901 Score: 55 Period size: 13 Copynumber: 4.4 Consensus size: 13 25834 ATCAGTAAAG 25844 AGTAATAGTAATC 1 AGTAATAGTAATC * 25857 AGTAAAGAGTAAT- 1 AGT-AATAGTAATC ** 25870 AGTAATCAGTAAGA 1 AGTAAT-AGTAATC * 25884 AGTAATGGTAATC 1 AGTAATAGTAATC 25897 AGTAA 1 AGTAA 25902 AAAGTAAAAG Statistics Matches: 36, Mismatches: 6, Indels: 6 0.75 0.12 0.12 Matches are distributed among these distances: 12 2 0.06 13 20 0.56 14 14 0.39 ACGTcount: A:0.48, C:0.05, G:0.21, T:0.26 Consensus pattern (13 bp): AGTAATAGTAATC Found at i:25880 original size:7 final size:7 Alignment explanation

Indices: 25831--25901 Score: 58 Period size: 7 Copynumber: 10.6 Consensus size: 7 25821 AGAAAAAATG 25831 GTAATCA 1 GTAATCA ** 25838 GTAAAGA 1 GTAATCA 25845 GTAAT-A 1 GTAATCA 25851 GTAATCA 1 GTAATCA ** 25858 GTAAAGA 1 GTAATCA 25865 GTAAT-A 1 GTAATCA 25871 GTAATCA 1 GTAATCA ** 25878 GTAAGAA 1 GTAATCA * 25885 GTAAT-G 1 GTAATCA 25891 GTAATCA 1 GTAATCA 25898 GTAA 1 GTAA 25902 AAAGTAAAAG Statistics Matches: 50, Mismatches: 11, Indels: 6 0.75 0.16 0.09 Matches are distributed among these distances: 6 17 0.34 7 33 0.66 ACGTcount: A:0.48, C:0.06, G:0.21, T:0.25 Consensus pattern (7 bp): GTAATCA Found at i:25918 original size:22 final size:21 Alignment explanation

Indices: 25802--26041 Score: 242 Period size: 22 Copynumber: 11.4 Consensus size: 21 25792 AATGATGATC * 25802 AGTAAAAAGTAATCAGTAAAGA 1 AGTAAAAGGTAATCAGTAAA-A * 25824 A--AAAATGGTAATCAGTAAAG 1 AGTAAAA-GGTAATCAGTAAAA * * 25844 AGTAATA-GTAATCAGTAAAG 1 AGTAAAAGGTAATCAGTAAAA * * 25864 AGTAATA-GTAATCAGTAAGA 1 AGTAAAAGGTAATCAGTAAAA * 25884 AGT-AATGGTAATCAGTAAAA 1 AGTAAAAGGTAATCAGTAAAA * 25904 AGTAAAAGGGTAATCAATAAAA 1 AGTAAAA-GGTAATCAGTAAAA * 25926 AGTAAAGTGGTAATCAGTAAAA 1 AGTAAA-AGGTAATCAGTAAAA * 25948 AGT-AAAGGTAATCAGTGAAA 1 AGTAAAAGGTAATCAGTAAAA * 25968 AGTAAAATGGTAATTAG-AAAA 1 AGTAAAA-GGTAATCAGTAAAA * 25989 GAGAAAAAGAGTAATCAGTAAAA 1 -AGTAAAAG-GTAATCAGTAAAA * 26012 AGTAAAATGGTAATTAGTAAAA 1 AGTAAAA-GGTAATCAGTAAAA 26034 AGTAAAAG 1 AGTAAAAG 26042 AAAAAATATT Statistics Matches: 185, Mismatches: 20, Indels: 27 0.80 0.09 0.12 Matches are distributed among these distances: 19 1 0.01 20 70 0.38 21 24 0.13 22 85 0.46 23 5 0.03 ACGTcount: A:0.55, C:0.04, G:0.20, T:0.21 Consensus pattern (21 bp): AGTAAAAGGTAATCAGTAAAA Found at i:26015 original size:7 final size:7 Alignment explanation

Indices: 26005--26095 Score: 51 Period size: 7 Copynumber: 12.7 Consensus size: 7 25995 AAGAGTAATC 26005 AGTAAAA 1 AGTAAAA 26012 AGTAAAA 1 AGTAAAA * ** 26019 TGGTAATT 1 -AGTAAAA 26027 AGTAAAA 1 AGTAAAA 26034 AGT-AAA 1 AGTAAAA 26040 AG-AAAA 1 AGTAAAA * 26046 AATATTAAA 1 AGTA--AAA ** 26055 GAGTAATC 1 -AGTAAAA * 26063 AGTAAAG 1 AGTAAAA * 26070 AGTAAAT 1 AGTAAAA * 26077 AGTAAAG 1 AGTAAAA 26084 AGTAAAA 1 AGTAAAA 26091 AGTAA 1 AGTAA 26096 TCAGTAAAGA Statistics Matches: 63, Mismatches: 15, Indels: 12 0.70 0.17 0.13 Matches are distributed among these distances: 6 9 0.14 7 43 0.68 8 5 0.08 9 3 0.05 10 3 0.05 ACGTcount: A:0.60, C:0.01, G:0.18, T:0.21 Consensus pattern (7 bp): AGTAAAA Found at i:26030 original size:29 final size:29 Alignment explanation

Indices: 25994--26088 Score: 95 Period size: 29 Copynumber: 3.3 Consensus size: 29 25984 GAAAAGAGAA * 25994 AAAGAGTAATCAGTAAAAAGTAAAATGGT 1 AAAGAGTAATCAGTAAAAAGTAAAATAGT ** ** * * 26023 AATTAGTAAAAAGT-AAAAGAAAAAATATT 1 AAAGAGTAATCAGTAAAAAG-TAAAATAGT * 26052 AAAGAGTAATCAGTAAAGAGT-AAATAGT 1 AAAGAGTAATCAGTAAAAAGTAAAATAGT 26080 AAAGAGTAA 1 AAAGAGTAA 26089 AAAGTAATCA Statistics Matches: 50, Mismatches: 14, Indels: 5 0.72 0.20 0.07 Matches are distributed among these distances: 28 20 0.40 29 26 0.52 30 4 0.08 ACGTcount: A:0.59, C:0.02, G:0.18, T:0.21 Consensus pattern (29 bp): AAAGAGTAATCAGTAAAAAGTAAAATAGT Found at i:26089 original size:14 final size:14 Alignment explanation

Indices: 25994--26095 Score: 73 Period size: 14 Copynumber: 7.1 Consensus size: 14 25984 GAAAAGAGAA 25994 AAAGAGT-AATCAGT 1 AAAGAGTAAAT-AGT * * 26008 AAAAAGTAAAATGGT 1 AAAGAGT-AAATAGT ** * 26023 AATTAGTAAAAAGT 1 AAAGAGTAAATAGT ** * 26037 AAAAGAAAAAATATT 1 -AAAGAGTAAATAGT 26052 AAAGAGT-AATCAGT 1 AAAGAGTAAAT-AGT 26066 AAAGAGTAAATAGT 1 AAAGAGTAAATAGT * 26080 AAAGAGTAAAAAGT 1 AAAGAGTAAATAGT 26094 AA 1 AA 26096 TCAGTAAAGA Statistics Matches: 67, Mismatches: 16, Indels: 10 0.72 0.17 0.11 Matches are distributed among these distances: 13 3 0.04 14 43 0.64 15 18 0.27 16 3 0.04 ACGTcount: A:0.60, C:0.02, G:0.18, T:0.21 Consensus pattern (14 bp): AAAGAGTAAATAGT Found at i:26101 original size:21 final size:21 Alignment explanation

Indices: 26063--26105 Score: 61 Period size: 21 Copynumber: 2.0 Consensus size: 21 26053 AAGAGTAATC * 26063 AGTAAAGAGTAAATAGTAAAG 1 AGTAAAAAGTAAATAGTAAAG 26084 AGTAAAAAGT-AATCAGTAAAG 1 AGTAAAAAGTAAAT-AGTAAAG 26105 A 1 A 26106 AAAAATAGTA Statistics Matches: 20, Mismatches: 1, Indels: 2 0.87 0.04 0.09 Matches are distributed among these distances: 20 3 0.15 21 17 0.85 ACGTcount: A:0.58, C:0.02, G:0.21, T:0.19 Consensus pattern (21 bp): AGTAAAAAGTAAATAGTAAAG Found at i:26101 original size:28 final size:28 Alignment explanation

Indices: 25994--26103 Score: 78 Period size: 29 Copynumber: 3.9 Consensus size: 28 25984 GAAAAGAGAA ** * * 25994 AAAGAGTAATCAGTAAAAAGTAAAATGGT 1 AAAGAGTAAAAAGTAAACAGT-AAATAGT ** * * 26023 AATTAGTAAAAAGTAAA-AGAAAAAATATT 1 AAAGAGTAAAAAGTAAACAG--TAAATAGT ** * 26052 AAAGAGTAATCAGTAAAGAGTAAATAGT 1 AAAGAGTAAAAAGTAAACAGTAAATAGT * 26080 AAAGAGTAAAAAGTAATCAGTAAA 1 AAAGAGTAAAAAGTAAACAGTAAA 26104 GAAAAAATAG Statistics Matches: 61, Mismatches: 17, Indels: 7 0.72 0.20 0.08 Matches are distributed among these distances: 28 28 0.46 29 31 0.51 30 2 0.03 ACGTcount: A:0.59, C:0.03, G:0.17, T:0.21 Consensus pattern (28 bp): AAAGAGTAAAAAGTAAACAGTAAATAGT Found at i:26102 original size:35 final size:34 Alignment explanation

Indices: 26051--26205 Score: 126 Period size: 35 Copynumber: 4.6 Consensus size: 34 26041 GAAAAAATAT * * 26051 TAAAGAGTAATCAGTAAAGAGTAAATAGTAAAGAG 1 TAAAAAGTAATCAGTAAAGA-AAAATAGTAAAGAG 26086 TAAAAAGTAATCAGTAAAGAAAAAATAGT-AAGAAG 1 TAAAAAGTAATCAGTAAAG-AAAAATAGTAAAG-AG * * * * 26121 TGAGAAGAAATCAGT-----AAAATGGTAAACGA- 1 TAAAAAGTAATCAGTAAAGAAAAATAGTAAA-GAG * * 26150 TAAAGAGTAATCAGTAAAGAAAAATGGTAAAGAG 1 TAAAAAGTAATCAGTAAAGAAAAATAGTAAAGAG * * 26184 TAGAATATTAATCAGTAAAGAA 1 TA-AAAAGTAATCAGTAAAGAA 26206 GTAATGGCAA Statistics Matches: 97, Mismatches: 12, Indels: 22 0.74 0.09 0.17 Matches are distributed among these distances: 29 18 0.19 30 3 0.03 31 1 0.01 33 2 0.02 34 16 0.16 35 56 0.58 36 1 0.01 ACGTcount: A:0.56, C:0.04, G:0.21, T:0.19 Consensus pattern (34 bp): TAAAAAGTAATCAGTAAAGAAAAATAGTAAAGAG Found at i:26115 original size:21 final size:21 Alignment explanation

Indices: 26077--26116 Score: 55 Period size: 21 Copynumber: 1.9 Consensus size: 21 26067 AAGAGTAAAT * 26077 AGTAAAGAGTAAAAAGTAATC 1 AGTAAAGAGAAAAAAGTAATC 26098 AGTAAAGA-AAAAATAGTAA 1 AGTAAAGAGAAAAA-AGTAA 26117 GAAGTGAGAA Statistics Matches: 17, Mismatches: 1, Indels: 2 0.85 0.05 0.10 Matches are distributed among these distances: 20 4 0.24 21 13 0.76 ACGTcount: A:0.62, C:0.03, G:0.17, T:0.17 Consensus pattern (21 bp): AGTAAAGAGAAAAAAGTAATC Found at i:29959 original size:30 final size:30 Alignment explanation

Indices: 29923--29983 Score: 104 Period size: 30 Copynumber: 2.0 Consensus size: 30 29913 CAAAGGATCA 29923 AATGGCATCCTTGGTGCGATTCCTCCATCC 1 AATGGCATCCTTGGTGCGATTCCTCCATCC * * 29953 AATGGCATCTTTGGTGCGATTGCTCCATCC 1 AATGGCATCCTTGGTGCGATTCCTCCATCC 29983 A 1 A 29984 TTGATGTCTT Statistics Matches: 29, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 30 29 1.00 ACGTcount: A:0.18, C:0.30, G:0.21, T:0.31 Consensus pattern (30 bp): AATGGCATCCTTGGTGCGATTCCTCCATCC Found at i:29994 original size:30 final size:30 Alignment explanation

Indices: 29923--30002 Score: 88 Period size: 30 Copynumber: 2.7 Consensus size: 30 29913 CAAAGGATCA * * 29923 AATGGCATCCTTGGTGCGATTCCTCCATCC 1 AATGACATCTTTGGTGCGATTCCTCCATCC * * 29953 AATGGCATCTTTGGTGCGATTGCTCCATCC 1 AATGACATCTTTGGTGCGATTCCTCCATCC * ** * 29983 ATTGATGTCTTTTGTGCGAT 1 AATGACATCTTTGGTGCGAT 30003 CACATCTCCT Statistics Matches: 43, Mismatches: 7, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 30 43 1.00 ACGTcount: A:0.16, C:0.25, G:0.23, T:0.36 Consensus pattern (30 bp): AATGACATCTTTGGTGCGATTCCTCCATCC Found at i:31509 original size:16 final size:16 Alignment explanation

Indices: 31488--31521 Score: 68 Period size: 16 Copynumber: 2.1 Consensus size: 16 31478 TCAATGAATT 31488 CAAACTTTTTCATATA 1 CAAACTTTTTCATATA 31504 CAAACTTTTTCATATA 1 CAAACTTTTTCATATA 31520 CA 1 CA 31522 TGGGTAAAAC Statistics Matches: 18, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 16 18 1.00 ACGTcount: A:0.38, C:0.21, G:0.00, T:0.41 Consensus pattern (16 bp): CAAACTTTTTCATATA Found at i:35852 original size:9 final size:9 Alignment explanation

Indices: 35838--35862 Score: 50 Period size: 9 Copynumber: 2.8 Consensus size: 9 35828 TGATGAGGGT 35838 ACTTGGGGC 1 ACTTGGGGC 35847 ACTTGGGGC 1 ACTTGGGGC 35856 ACTTGGG 1 ACTTGGG 35863 CTTTGATGAC Statistics Matches: 16, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 9 16 1.00 ACGTcount: A:0.12, C:0.20, G:0.44, T:0.24 Consensus pattern (9 bp): ACTTGGGGC Found at i:37936 original size:15 final size:15 Alignment explanation

Indices: 37874--37929 Score: 55 Period size: 15 Copynumber: 3.9 Consensus size: 15 37864 GATTACCATT 37874 TTACTCTTTTACTGA 1 TTACTCTTTTACTGA * 37889 TTACTATTTT-CT-- 1 TTACTCTTTTACTGA * * 37901 TCTCCTTTTTTACTGA 1 T-TACTCTTTTACTGA 37917 TTACTCTTTTACT 1 TTACTCTTTTACT 37930 TCTTACTGAT Statistics Matches: 32, Mismatches: 5, Indels: 8 0.71 0.11 0.18 Matches are distributed among these distances: 12 1 0.03 13 7 0.22 14 4 0.12 15 19 0.59 16 1 0.03 ACGTcount: A:0.16, C:0.21, G:0.04, T:0.59 Consensus pattern (15 bp): TTACTCTTTTACTGA Found at i:37956 original size:21 final size:21 Alignment explanation

Indices: 37910--37994 Score: 73 Period size: 21 Copynumber: 4.0 Consensus size: 21 37900 TTCTCCTTTT * * 37910 TTACTGATTACTCTTTTACTTC 1 TTACTGATTACTATTTGAC-TC 37932 TTACTGATTACTATTTGACTC 1 TTACTGATTACTATTTGACTC * * 37953 TTACTAATTACCACTTTG-CTC 1 TTACTGATTACTA-TTTGACTC * * * * 37974 TCACTGGTTACTGTTTTACTC 1 TTACTGATTACTATTTGACTC 37995 CTAATGACTA Statistics Matches: 51, Mismatches: 10, Indels: 5 0.77 0.15 0.08 Matches are distributed among these distances: 20 3 0.06 21 27 0.53 22 21 0.41 ACGTcount: A:0.20, C:0.24, G:0.08, T:0.48 Consensus pattern (21 bp): TTACTGATTACTATTTGACTC Found at i:38026 original size:35 final size:35 Alignment explanation

Indices: 37987--38064 Score: 93 Period size: 35 Copynumber: 2.2 Consensus size: 35 37977 CTGGTTACTG 37987 TTTTACTCCTAATGACTACCTTCTGCTGATCACTA 1 TTTTACTCCTAATGACTACCTTCTGCTGATCACTA * ** ** * * 38022 TTTTACTCTTAATGGTTGTCTTTTGCTGATTACTA 1 TTTTACTCCTAATGACTACCTTCTGCTGATCACTA 38057 TTTTACTC 1 TTTTACTC 38065 TTTGCTGATT Statistics Matches: 36, Mismatches: 7, Indels: 0 0.84 0.16 0.00 Matches are distributed among these distances: 35 36 1.00 ACGTcount: A:0.19, C:0.22, G:0.10, T:0.49 Consensus pattern (35 bp): TTTTACTCCTAATGACTACCTTCTGCTGATCACTA Found at i:38069 original size:22 final size:22 Alignment explanation

Indices: 38043--38110 Score: 104 Period size: 22 Copynumber: 3.1 Consensus size: 22 38033 ATGGTTGTCT 38043 TTTGCTGATTACTATTTTACTC 1 TTTGCTGATTACTATTTTACTC 38065 TTTGCTGATTACTATTTTACTC 1 TTTGCTGATTACTATTTTACTC * 38087 TTTACTGATTA-T-TCTTTACTC 1 TTTGCTGATTACTAT-TTTACTC 38108 TTT 1 TTT 38111 ACCATTTTTC Statistics Matches: 44, Mismatches: 1, Indels: 3 0.92 0.02 0.06 Matches are distributed among these distances: 20 1 0.02 21 11 0.25 22 32 0.73 ACGTcount: A:0.18, C:0.18, G:0.07, T:0.57 Consensus pattern (22 bp): TTTGCTGATTACTATTTTACTC Found at i:38125 original size:21 final size:22 Alignment explanation

Indices: 38057--38127 Score: 74 Period size: 22 Copynumber: 3.3 Consensus size: 22 38047 CTGATTACTA * * * * 38057 TTTTACTCTTTGCTGATTACTA 1 TTTTACTCTTTACTCATTATTC * 38079 TTTTACTCTTTACTGATTATTC 1 TTTTACTCTTTACTCATTATTC * 38101 -TTTACTCTTTAC-CATTTTTC 1 TTTTACTCTTTACTCATTATTC 38121 TTTTACT 1 TTTTACT 38128 AATTACTCTC Statistics Matches: 43, Mismatches: 5, Indels: 3 0.84 0.10 0.06 Matches are distributed among these distances: 20 6 0.14 21 18 0.42 22 19 0.44 ACGTcount: A:0.17, C:0.20, G:0.04, T:0.59 Consensus pattern (22 bp): TTTTACTCTTTACTCATTATTC Found at i:38301 original size:31 final size:31 Alignment explanation

Indices: 38245--38310 Score: 89 Period size: 31 Copynumber: 2.1 Consensus size: 31 38235 AATTACTGAT * 38245 TTACTGATTACTATTTTCACCTTGACTTTTAA 1 TTACTGATTAC-ATTTTCACCTTGACTCTTAA * 38277 TTACTGATTA-ATTTCTTACCTTGACTCTTAA 1 TTACTGATTACATTT-TCACCTTGACTCTTAA 38308 TTA 1 TTA 38311 TCAATTTACT Statistics Matches: 31, Mismatches: 2, Indels: 3 0.86 0.06 0.08 Matches are distributed among these distances: 30 4 0.13 31 17 0.55 32 10 0.32 ACGTcount: A:0.26, C:0.18, G:0.06, T:0.50 Consensus pattern (31 bp): TTACTGATTACATTTTCACCTTGACTCTTAA Found at i:38443 original size:47 final size:47 Alignment explanation

Indices: 38342--38549 Score: 328 Period size: 47 Copynumber: 4.4 Consensus size: 47 38332 TTTTACTTGA * * * 38342 TTACTGATTTACTGATTACTATTACCTTGACTTTTGATTAATCTTTTT 1 TTACTGATTTACTGATTACCATCACCTTGAC-TTTGATTAATCTCTTT * 38390 TTACTGATTTACTGATTACCATCACTTTGACTTTGATTAATCTCTTT 1 TTACTGATTTACTGATTACCATCACCTTGACTTTGATTAATCTCTTT * 38437 TTACTGATTTACTGATTACCATCACTTTGACTTTGATTAATCTCTTT 1 TTACTGATTTACTGATTACCATCACCTTGACTTTGATTAATCTCTTT * * * 38484 TTACTGATTTACTGATTACCATCACCTTGACTCTGTTTAAGCTCTTT 1 TTACTGATTTACTGATTACCATCACCTTGACTTTGATTAATCTCTTT 38531 TTACTGA-TTACTGATTACC 1 TTACTGATTTACTGATTACC 38550 CCTTTTTACT Statistics Matches: 152, Mismatches: 8, Indels: 2 0.94 0.05 0.01 Matches are distributed among these distances: 46 12 0.08 47 112 0.74 48 28 0.18 ACGTcount: A:0.23, C:0.19, G:0.09, T:0.49 Consensus pattern (47 bp): TTACTGATTTACTGATTACCATCACCTTGACTTTGATTAATCTCTTT Found at i:38453 original size:26 final size:26 Alignment explanation

Indices: 38423--38501 Score: 73 Period size: 26 Copynumber: 3.2 Consensus size: 26 38413 ACTTTGACTT 38423 TGATTAATCTCTTTTTACTGATTTAC 1 TGATTAATCTCTTTTTACTGATTTAC * * 38449 TGATTACCATCAC---TT--TGACTT-- 1 TGATTA--ATCTCTTTTTACTGATTTAC 38470 TGATTAATCTCTTTTTACTGATTTAC 1 TGATTAATCTCTTTTTACTGATTTAC 38496 TGATTA 1 TGATTA 38502 CCATCACCTT Statistics Matches: 40, Mismatches: 4, Indels: 18 0.65 0.06 0.29 Matches are distributed among these distances: 19 4 0.10 21 6 0.15 22 2 0.05 23 5 0.12 24 5 0.12 25 2 0.05 26 12 0.30 28 4 0.10 ACGTcount: A:0.24, C:0.16, G:0.09, T:0.51 Consensus pattern (26 bp): TGATTAATCTCTTTTTACTGATTTAC Found at i:39259 original size:50 final size:51 Alignment explanation

Indices: 39202--39456 Score: 218 Period size: 50 Copynumber: 5.1 Consensus size: 51 39192 AAGGTAACAT * * * * 39202 TTTATTTACTAATTACT-TAAA-AGTTCAATCTTTCATTCAAAGGTTAAAGC 1 TTTATTTACCAATTACTCTAAAGA-TTCAATCTTTTATTCAAAAGTTAAATC * ** * * * * 39252 TTTATTTACCAATCACTCTAAAGATTCAATCTTTTACCCGAACA-TGACATT 1 TTTATTTACCAATTACTCTAAAGATTCAATCTTTTATTC-AAAAGTTAAATC * * 39303 TTTACTTACCAATTACT-TAAAAATTCAATCTTTTATTCAAAAGTTAAATC 1 TTTATTTACCAATTACTCTAAAGATTCAATCTTTTATTCAAAAGTTAAATC * ** *** * 39353 TTTATTTACTAATTACTCTAAAGATTCAATCTTTT-CCCAAACA-TGCCATT 1 TTTATTTACCAATTACTCTAAAGATTCAATCTTTTATTCAAA-AGTTAAATC * * * 39403 TTTGTTTACCAATTTAC-CTAAAAATTCAATCTTTTATTCAAAGGTTAAATC 1 TTTATTTACCAA-TTACTCTAAAGATTCAATCTTTTATTCAAAAGTTAAATC 39454 TTT 1 TTT 39457 TAGCAAAAGG Statistics Matches: 157, Mismatches: 39, Indels: 17 0.74 0.18 0.08 Matches are distributed among these distances: 49 3 0.02 50 86 0.55 51 65 0.41 52 3 0.02 ACGTcount: A:0.35, C:0.18, G:0.05, T:0.42 Consensus pattern (51 bp): TTTATTTACCAATTACTCTAAAGATTCAATCTTTTATTCAAAAGTTAAATC Found at i:39306 original size:101 final size:100 Alignment explanation

Indices: 39201--39456 Score: 386 Period size: 101 Copynumber: 2.5 Consensus size: 100 39191 AAAGGTAACA * * * * 39201 TTTTATTTACTAATTACTTAAAAGTTCAATCTTTCATTCAAAGGTTAAAGCTTTATTTACCAATC 1 TTTTATTTACCAATTACTTAAAAATTCAATCTTTTATTCAAAGGTTAAATCTTTATTTACCAATC * 39266 ACTCTAAAGATTCAATCTTTTACCCGAACATGACAT 66 ACTCTAAAGATTCAATCTTTT-CCCAAACATGACAT * * * * 39302 TTTTACTTACCAATTACTTAAAAATTCAATCTTTTATTCAAAAGTTAAATCTTTATTTACTAATT 1 TTTTATTTACCAATTACTTAAAAATTCAATCTTTTATTCAAAGGTTAAATCTTTATTTACCAATC * 39367 ACTCTAAAGATTCAATCTTTTCCCAAACATGCCAT 66 ACTCTAAAGATTCAATCTTTTCCCAAACATGACAT * * 39402 TTTTGTTTACCAATTTACCTAAAAATTCAATCTTTTATTCAAAGGTTAAATCTTT 1 TTTTATTTACCAA-TTACTTAAAAATTCAATCTTTTATTCAAAGGTTAAATCTTT 39457 TAGCAAAAGG Statistics Matches: 140, Mismatches: 14, Indels: 2 0.90 0.09 0.01 Matches are distributed among these distances: 100 23 0.16 101 117 0.84 ACGTcount: A:0.35, C:0.18, G:0.05, T:0.42 Consensus pattern (100 bp): TTTTATTTACCAATTACTTAAAAATTCAATCTTTTATTCAAAGGTTAAATCTTTATTTACCAATC ACTCTAAAGATTCAATCTTTTCCCAAACATGACAT Found at i:39465 original size:20 final size:20 Alignment explanation

Indices: 39442--39480 Score: 60 Period size: 20 Copynumber: 1.9 Consensus size: 20 39432 TCTTTTATTC * 39442 AAAGGTTAAATCTTTTAGCA 1 AAAGGTTAAACCTTTTAGCA * 39462 AAAGGTTACACCTTTTAGC 1 AAAGGTTAAACCTTTTAGC 39481 CAAATATCCC Statistics Matches: 17, Mismatches: 2, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 20 17 1.00 ACGTcount: A:0.36, C:0.15, G:0.15, T:0.33 Consensus pattern (20 bp): AAAGGTTAAACCTTTTAGCA Done.