Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01018788.1 Corchorus olitorius cultivar O-4 contig18821, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 153335
ACGTcount: A:0.32, C:0.18, G:0.18, T:0.32


Found at i:341 original size:16 final size:15

Alignment explanation

Indices: 311--358 Score: 51 Period size: 16 Copynumber: 3.1 Consensus size: 15 301 GTCAAAGTTG 311 AAGAAAAAATGAAAA 1 AAGAAAAAATGAAAA * 326 AAGAAGAAAGTGAAAA 1 AAGAA-AAAATGAAAA * * 342 ATGAAAGAATGGAAAA 1 AAGAAAAAAT-GAAAA 358 A 1 A 359 TCAGAAAATT Statistics Matches: 27, Mismatches: 4, Indels: 3 0.79 0.12 0.09 Matches are distributed among these distances: 15 8 0.30 16 19 0.70 ACGTcount: A:0.71, C:0.00, G:0.21, T:0.08 Consensus pattern (15 bp): AAGAAAAAATGAAAA Found at i:2464 original size:37 final size:37 Alignment explanation

Indices: 2414--2486 Score: 128 Period size: 37 Copynumber: 2.0 Consensus size: 37 2404 AATTTTTCCT * * 2414 TTTAGTAATTTCCCTGGTAACTAAAAATAATATATAC 1 TTTAGGAATTTCCCTAGTAACTAAAAATAATATATAC 2451 TTTAGGAATTTCCCTAGTAACTAAAAATAATATATA 1 TTTAGGAATTTCCCTAGTAACTAAAAATAATATATA 2487 GTATATATAT Statistics Matches: 34, Mismatches: 2, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 37 34 1.00 ACGTcount: A:0.42, C:0.12, G:0.08, T:0.37 Consensus pattern (37 bp): TTTAGGAATTTCCCTAGTAACTAAAAATAATATATAC Found at i:3288 original size:2 final size:2 Alignment explanation

Indices: 3281--3311 Score: 55 Period size: 2 Copynumber: 16.0 Consensus size: 2 3271 TACTATTATT 3281 TA TA TA TA TA -A TA TA TA TA TA TA TA TA TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 3312 AATTGTATAT Statistics Matches: 28, Mismatches: 0, Indels: 2 0.93 0.00 0.07 Matches are distributed among these distances: 1 1 0.04 2 27 0.96 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): TA Found at i:4826 original size:29 final size:29 Alignment explanation

Indices: 4792--4871 Score: 133 Period size: 29 Copynumber: 2.8 Consensus size: 29 4782 ACATAAAACG * * * 4792 GCCAAATAAGTCCCTGGACTTTAATTATA 1 GCCAAATAAGCCCCTGAACTCTAATTATA 4821 GCCAAATAAGCCCCTGAACTCTAATTATA 1 GCCAAATAAGCCCCTGAACTCTAATTATA 4850 GCCAAATAAGCCCCTGAACTCT 1 GCCAAATAAGCCCCTGAACTCT 4872 TTAAAAAGGC Statistics Matches: 48, Mismatches: 3, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 29 48 1.00 ACGTcount: A:0.35, C:0.28, G:0.12, T:0.25 Consensus pattern (29 bp): GCCAAATAAGCCCCTGAACTCTAATTATA Found at i:4886 original size:30 final size:28 Alignment explanation

Indices: 4792--4891 Score: 103 Period size: 29 Copynumber: 3.4 Consensus size: 28 4782 ACATAAAACG * * * * 4792 GCCAAATAAGTCCCTGGACTTTAATTATA 1 GCCAAATAAGCCCCTGAACTCTAA-TAAA * 4821 GCCAAATAAGCCCCTGAACTCTAATTATA 1 GCCAAATAAGCCCCTGAACTCTAA-TAAA 4850 GCCAAATAAGCCCCTGAACTCTTTAA-AAA 1 GCCAAATAAGCCCCTGAACTC--TAATAAA 4879 GGCCAAATAAGCC 1 -GCCAAATAAGCC 4892 ATTTTCTGAT Statistics Matches: 64, Mismatches: 4, Indels: 5 0.88 0.05 0.07 Matches are distributed among these distances: 29 49 0.77 30 12 0.19 31 3 0.05 ACGTcount: A:0.38, C:0.26, G:0.13, T:0.23 Consensus pattern (28 bp): GCCAAATAAGCCCCTGAACTCTAATAAA Found at i:12840 original size:16 final size:15 Alignment explanation

Indices: 12802--12843 Score: 66 Period size: 15 Copynumber: 2.7 Consensus size: 15 12792 ACAGAGATTG * 12802 ACAGAAAGCAATTAA 1 ACAGAAAACAATTAA 12817 ACAGAAAACAATTAA 1 ACAGAAAACAATTAA 12832 ACTAGAAAACAA 1 AC-AGAAAACAA 12844 AGCAAAGTAA Statistics Matches: 25, Mismatches: 1, Indels: 1 0.93 0.04 0.04 Matches are distributed among these distances: 15 16 0.64 16 9 0.36 ACGTcount: A:0.64, C:0.14, G:0.10, T:0.12 Consensus pattern (15 bp): ACAGAAAACAATTAA Found at i:19988 original size:18 final size:18 Alignment explanation

Indices: 19965--20001 Score: 56 Period size: 18 Copynumber: 2.1 Consensus size: 18 19955 CTCCTCTTGC * 19965 ATGAAAACACTTCTTTTT 1 ATGAAAACAATTCTTTTT * 19983 ATGAAAACAATTTTTTTT 1 ATGAAAACAATTCTTTTT 20001 A 1 A 20002 AACTACCCTT Statistics Matches: 17, Mismatches: 2, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 18 17 1.00 ACGTcount: A:0.38, C:0.11, G:0.05, T:0.46 Consensus pattern (18 bp): ATGAAAACAATTCTTTTT Found at i:37184 original size:17 final size:17 Alignment explanation

Indices: 37158--37212 Score: 74 Period size: 17 Copynumber: 3.2 Consensus size: 17 37148 GAACAAAAAA * * 37158 TATTATTTTACAGTGAT 1 TATTATTTTATAGAGAT * 37175 TATTAATTTATAGAGTAT 1 TATTATTTTATAGAG-AT 37193 TATTATTTTATAGAGAT 1 TATTATTTTATAGAGAT 37210 TAT 1 TAT 37213 CACTGCTCTT Statistics Matches: 33, Mismatches: 4, Indels: 2 0.85 0.10 0.05 Matches are distributed among these distances: 17 17 0.52 18 16 0.48 ACGTcount: A:0.35, C:0.02, G:0.11, T:0.53 Consensus pattern (17 bp): TATTATTTTATAGAGAT Found at i:41130 original size:11 final size:11 Alignment explanation

Indices: 41116--41154 Score: 51 Period size: 11 Copynumber: 3.4 Consensus size: 11 41106 TCTTTATTTT 41116 TTTTTTTTGTA 1 TTTTTTTTGTA 41127 TTTTTTTCTGGTA 1 TTTTTTT-T-GTA * 41140 TTTTTTCTGTA 1 TTTTTTTTGTA 41151 TTTT 1 TTTT 41155 CTTGGTGGCT Statistics Matches: 25, Mismatches: 1, Indels: 4 0.83 0.03 0.13 Matches are distributed among these distances: 11 14 0.56 12 2 0.08 13 9 0.36 ACGTcount: A:0.08, C:0.05, G:0.10, T:0.77 Consensus pattern (11 bp): TTTTTTTTGTA Found at i:41130 original size:13 final size:13 Alignment explanation

Indices: 41114--41154 Score: 50 Period size: 12 Copynumber: 3.3 Consensus size: 13 41104 TTTCTTTATT * * 41114 TTTTTTTTTTGTA 1 TTTTTTTCTGGTA 41127 TTTTTTTCTGGTA 1 TTTTTTTCTGGTA 41140 -TTTTTTCT-GTA 1 TTTTTTTCTGGTA 41151 TTTT 1 TTTT 41155 CTTGGTGGCT Statistics Matches: 25, Mismatches: 2, Indels: 3 0.83 0.07 0.10 Matches are distributed among these distances: 11 3 0.12 12 11 0.44 13 11 0.44 ACGTcount: A:0.07, C:0.05, G:0.10, T:0.78 Consensus pattern (13 bp): TTTTTTTCTGGTA Found at i:41286 original size:2 final size:2 Alignment explanation

Indices: 41281--41315 Score: 70 Period size: 2 Copynumber: 17.5 Consensus size: 2 41271 TTGGTGTGTG 41281 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 41316 CCCCAAAATG Statistics Matches: 33, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 33 1.00 ACGTcount: A:0.49, C:0.00, G:0.00, T:0.51 Consensus pattern (2 bp): TA Found at i:45491 original size:24 final size:25 Alignment explanation

Indices: 45445--45492 Score: 62 Period size: 24 Copynumber: 2.0 Consensus size: 25 45435 ATTGGAGTAT * 45445 TTATTTATCTTGTTGCTTAATTTTA 1 TTATTTATCTTGTTGATTAATTTTA * * 45470 TTATTT-TCTTGTTTATTTATTTT 1 TTATTTATCTTGTTGATTAATTTT 45493 TATTGTTACT Statistics Matches: 20, Mismatches: 3, Indels: 1 0.83 0.12 0.04 Matches are distributed among these distances: 24 14 0.70 25 6 0.30 ACGTcount: A:0.17, C:0.06, G:0.06, T:0.71 Consensus pattern (25 bp): TTATTTATCTTGTTGATTAATTTTA Found at i:47129 original size:21 final size:22 Alignment explanation

Indices: 47103--47145 Score: 79 Period size: 21 Copynumber: 2.0 Consensus size: 22 47093 AAATCTGAGG 47103 CTACCCAGCCCCGGGT-ACCCC 1 CTACCCAGCCCCGGGTGACCCC 47124 CTACCCAGCCCCGGGTGACCCC 1 CTACCCAGCCCCGGGTGACCCC 47146 AGAAGCTTAA Statistics Matches: 21, Mismatches: 0, Indels: 1 0.95 0.00 0.05 Matches are distributed among these distances: 21 16 0.76 22 5 0.24 ACGTcount: A:0.14, C:0.56, G:0.21, T:0.09 Consensus pattern (22 bp): CTACCCAGCCCCGGGTGACCCC Found at i:48908 original size:21 final size:22 Alignment explanation

Indices: 48866--48916 Score: 77 Period size: 22 Copynumber: 2.4 Consensus size: 22 48856 AGTTCTGAGG * 48866 CTACCCCGCCCCGGGTACCCCC 1 CTACCCGGCCCCGGGTACCCCC * 48888 CTGCCCGGCCCCGGGTA-CCCC 1 CTACCCGGCCCCGGGTACCCCC 48909 CTACCCGG 1 CTACCCGG 48917 GAGCGGGTGA Statistics Matches: 26, Mismatches: 3, Indels: 1 0.87 0.10 0.03 Matches are distributed among these distances: 21 11 0.42 22 15 0.58 ACGTcount: A:0.08, C:0.59, G:0.24, T:0.10 Consensus pattern (22 bp): CTACCCGGCCCCGGGTACCCCC Found at i:49798 original size:24 final size:25 Alignment explanation

Indices: 49752--49799 Score: 62 Period size: 24 Copynumber: 2.0 Consensus size: 25 49742 ATTGGAGTAT * 49752 TTATTTATCTTGTTGCTTAATTTTA 1 TTATTTATCTTGTTGATTAATTTTA * * 49777 TTATTT-TCTTGTTTATTTATTTT 1 TTATTTATCTTGTTGATTAATTTT 49800 TATTGTTACT Statistics Matches: 20, Mismatches: 3, Indels: 1 0.83 0.12 0.04 Matches are distributed among these distances: 24 14 0.70 25 6 0.30 ACGTcount: A:0.17, C:0.06, G:0.06, T:0.71 Consensus pattern (25 bp): TTATTTATCTTGTTGATTAATTTTA Found at i:52849 original size:21 final size:21 Alignment explanation

Indices: 52804--52850 Score: 60 Period size: 21 Copynumber: 2.2 Consensus size: 21 52794 AATTCCTTTT * 52804 TTTTCATTTTATTTAATTTCT 1 TTTTCATTTTATTTAATTTCC * 52825 TTTTCATTTT-TCTTATTTTCC 1 TTTTCATTTTAT-TTAATTTCC 52846 TTTTC 1 TTTTC 52851 TTATTTGAAA Statistics Matches: 23, Mismatches: 2, Indels: 2 0.85 0.07 0.07 Matches are distributed among these distances: 20 1 0.04 21 22 0.96 ACGTcount: A:0.13, C:0.15, G:0.00, T:0.72 Consensus pattern (21 bp): TTTTCATTTTATTTAATTTCC Found at i:64857 original size:16 final size:16 Alignment explanation

Indices: 64836--64899 Score: 67 Period size: 16 Copynumber: 4.1 Consensus size: 16 64826 ATGAAAGAAA 64836 TAAATTTTAATTAAAT 1 TAAATTTTAATTAAAT * 64852 TAAATTTTAATT-GAT 1 TAAATTTTAATTAAAT * * * * 64867 TTATTTTTAGTTTAAT 1 TAAATTTTAATTAAAT * 64883 TAAATTTTAAATAAAT 1 TAAATTTTAATTAAAT 64899 T 1 T 64900 TTTAACTTAA Statistics Matches: 37, Mismatches: 10, Indels: 2 0.76 0.20 0.04 Matches are distributed among these distances: 15 11 0.30 16 26 0.70 ACGTcount: A:0.42, C:0.00, G:0.03, T:0.55 Consensus pattern (16 bp): TAAATTTTAATTAAAT Found at i:69876 original size:77 final size:78 Alignment explanation

Indices: 69759--69919 Score: 270 Period size: 77 Copynumber: 2.1 Consensus size: 78 69749 TTTTTTTAAT ** 69759 TAAAACAGTAAAATGGTAAAATAAAATAGTTATATGGATATTAGATTTAATTAAATAAAAATAGA 1 TAAAACAGTAAAATGGTAAAATAAAATAGTTATAAAGATATTAGATTTAATTAAATAAAAATAGA 69824 GTTTTTAGTTGAG 66 GTTTTTAGTTGAG * * * 69837 TAAAATAGTAAAATGGT-AAATCAAATAGTTATAAAGATATTAGATTTAATTAAATAAAAATATA 1 TAAAACAGTAAAATGGTAAAATAAAATAGTTATAAAGATATTAGATTTAATTAAATAAAAATAGA 69901 GTTTTTAGTTGAG 66 GTTTTTAGTTGAG 69914 TAAAAC 1 TAAAAC 69920 TATTATATTT Statistics Matches: 77, Mismatches: 6, Indels: 1 0.92 0.07 0.01 Matches are distributed among these distances: 77 61 0.79 78 16 0.21 ACGTcount: A:0.50, C:0.02, G:0.14, T:0.35 Consensus pattern (78 bp): TAAAACAGTAAAATGGTAAAATAAAATAGTTATAAAGATATTAGATTTAATTAAATAAAAATAGA GTTTTTAGTTGAG Found at i:70714 original size:53 final size:53 Alignment explanation

Indices: 70643--70803 Score: 313 Period size: 53 Copynumber: 3.0 Consensus size: 53 70633 TGTCGGGTCA * 70643 TTTGGGTTTGGGTCAATTTTGGTTCGTTTCTTTTTCGGTTTCAAGTCATATGG 1 TTTGGGTTTGGGTCAATTTTGGTTCGTGTCTTTTTCGGTTTCAAGTCATATGG 70696 TTTGGGTTTGGGTCAATTTTGGTTCGTGTCTTTTTCGGTTTCAAGTCATATGG 1 TTTGGGTTTGGGTCAATTTTGGTTCGTGTCTTTTTCGGTTTCAAGTCATATGG 70749 TTTGGGTTTGGGTCAATTTTGGTTCGTGTCTTTTTCGGTTTCAAGTCATATGG 1 TTTGGGTTTGGGTCAATTTTGGTTCGTGTCTTTTTCGGTTTCAAGTCATATGG 70802 TT 1 TT 70804 CCAATAATTT Statistics Matches: 107, Mismatches: 1, Indels: 0 0.99 0.01 0.00 Matches are distributed among these distances: 53 107 1.00 ACGTcount: A:0.11, C:0.11, G:0.27, T:0.50 Consensus pattern (53 bp): TTTGGGTTTGGGTCAATTTTGGTTCGTGTCTTTTTCGGTTTCAAGTCATATGG Found at i:81044 original size:26 final size:26 Alignment explanation

Indices: 81005--81058 Score: 92 Period size: 26 Copynumber: 2.1 Consensus size: 26 80995 GTTTTCCATC 81005 TTAGTTTTGCTTTATTAAAATTGCAT 1 TTAGTTTTGCTTTATTAAAATTGCAT 81031 TTAGTTTTTG-TTTATTAAAATTGCAT 1 TTAG-TTTTGCTTTATTAAAATTGCAT 81057 TT 1 TT 81059 TTGCATATGT Statistics Matches: 27, Mismatches: 0, Indels: 2 0.93 0.00 0.07 Matches are distributed among these distances: 26 22 0.81 27 5 0.19 ACGTcount: A:0.26, C:0.06, G:0.11, T:0.57 Consensus pattern (26 bp): TTAGTTTTGCTTTATTAAAATTGCAT Found at i:95756 original size:80 final size:79 Alignment explanation

Indices: 95661--95822 Score: 256 Period size: 80 Copynumber: 2.0 Consensus size: 79 95651 TAGAGACGCC 95661 AATATAATCTCTAATTATTGATTAATGAAAGTGCA-TATTTGATAAAAAAAAATTCATATTACTA 1 AATATAATCTCTAATTATTGATTAATGAAAGTG-ATTATTTGAT-AAAAAAAATT-ATATTACTA * 95725 AAT-ACATGTCTCAAAT 63 AATAACACGTCTCAAAT * 95741 AATATAATCTCTAATTATTGATTAATGAAAGTGATTATTTGATAAAAAAAATTATATTGCTAAAT 1 AATATAATCTCTAATTATTGATTAATGAAAGTGATTATTTGATAAAAAAAATTATATTACTAAAT 95806 ACACACGTCTCAAAT 66 A-ACACGTCTCAAAT 95821 AA 1 AA 95823 GAAAAATGGT Statistics Matches: 77, Mismatches: 2, Indels: 6 0.91 0.02 0.07 Matches are distributed among these distances: 78 11 0.14 79 11 0.14 80 55 0.71 ACGTcount: A:0.46, C:0.10, G:0.08, T:0.36 Consensus pattern (79 bp): AATATAATCTCTAATTATTGATTAATGAAAGTGATTATTTGATAAAAAAAATTATATTACTAAAT AACACGTCTCAAAT Found at i:97271 original size:11 final size:11 Alignment explanation

Indices: 97247--97281 Score: 52 Period size: 11 Copynumber: 3.2 Consensus size: 11 97237 TTGACAGCGC 97247 AACAAAAACAA 1 AACAAAAACAA * * 97258 AACGAAAACGA 1 AACAAAAACAA 97269 AACAAAAACAA 1 AACAAAAACAA 97280 AA 1 AA 97282 AACAGAAAAA Statistics Matches: 20, Mismatches: 4, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 11 20 1.00 ACGTcount: A:0.77, C:0.17, G:0.06, T:0.00 Consensus pattern (11 bp): AACAAAAACAA Found at i:99407 original size:3 final size:3 Alignment explanation

Indices: 99399--99423 Score: 50 Period size: 3 Copynumber: 8.3 Consensus size: 3 99389 ACGCACACAC 99399 AGA AGA AGA AGA AGA AGA AGA AGA A 1 AGA AGA AGA AGA AGA AGA AGA AGA A 99424 AGGGGGAAAA Statistics Matches: 22, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 3 22 1.00 ACGTcount: A:0.68, C:0.00, G:0.32, T:0.00 Consensus pattern (3 bp): AGA Found at i:103771 original size:42 final size:42 Alignment explanation

Indices: 103712--103798 Score: 156 Period size: 42 Copynumber: 2.1 Consensus size: 42 103702 AGCAACAACT * 103712 AATATTAACTTTATTTTGATAAATTATCTAGAGATGGAGTAG 1 AATATTAACTTTATTTTGATAAATTACCTAGAGATGGAGTAG * 103754 AATATTAACTTTATTTTGATGAATTACCTAGAGATGGAGTAG 1 AATATTAACTTTATTTTGATAAATTACCTAGAGATGGAGTAG 103796 AAT 1 AAT 103799 TTAGCTAATG Statistics Matches: 43, Mismatches: 2, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 42 43 1.00 ACGTcount: A:0.38, C:0.06, G:0.17, T:0.39 Consensus pattern (42 bp): AATATTAACTTTATTTTGATAAATTACCTAGAGATGGAGTAG Found at i:105795 original size:22 final size:21 Alignment explanation

Indices: 105765--105838 Score: 69 Period size: 22 Copynumber: 3.4 Consensus size: 21 105755 TATTATGAAT * * 105765 TTTTTATAACTACCCTATTAAA 1 TTTTGATAA-TACCCTATAAAA * 105787 TTTTGATAATCACGCTATAAAA 1 TTTTGATAAT-ACCCTATAAAA * 105809 TTTTGATAATTA-CCTATGAAA 1 TTTTGATAA-TACCCTATAAAA * 105830 TTGTGATAA 1 TTTTGATAA 105839 ACTCCATATG Statistics Matches: 44, Mismatches: 6, Indels: 5 0.80 0.11 0.09 Matches are distributed among these distances: 21 16 0.36 22 27 0.61 23 1 0.02 ACGTcount: A:0.38, C:0.12, G:0.08, T:0.42 Consensus pattern (21 bp): TTTTGATAATACCCTATAAAA Found at i:105807 original size:21 final size:21 Alignment explanation

Indices: 105778--105838 Score: 77 Period size: 22 Copynumber: 2.9 Consensus size: 21 105768 TTATAACTAC * 105778 CCTATTAAATTTTGATAATCA 1 CCTATAAAATTTTGATAATCA * 105799 CGCTATAAAATTTTGATAATTA 1 C-CTATAAAATTTTGATAATCA * * 105821 CCTATGAAATTGTGATAA 1 CCTATAAAATTTTGATAA 105839 ACTCCATATG Statistics Matches: 35, Mismatches: 4, Indels: 2 0.85 0.10 0.05 Matches are distributed among these distances: 21 16 0.46 22 19 0.54 ACGTcount: A:0.39, C:0.11, G:0.10, T:0.39 Consensus pattern (21 bp): CCTATAAAATTTTGATAATCA Found at i:105921 original size:21 final size:21 Alignment explanation

Indices: 105866--105926 Score: 68 Period size: 21 Copynumber: 2.8 Consensus size: 21 105856 GATAACCAAA * 105866 CTATGAAATTTTAATAAACCTTT 1 CTATGAAATTTT-AT-AACCTTC * 105889 CTATGAAATTTTGTAACCTTC 1 CTATGAAATTTTATAACCTTC ** 105910 CTATGATTTTTTATAAC 1 CTATGAAATTTTATAAC 105927 GTCCTTGTGA Statistics Matches: 33, Mismatches: 5, Indels: 2 0.82 0.12 0.05 Matches are distributed among these distances: 21 20 0.61 22 1 0.03 23 12 0.36 ACGTcount: A:0.33, C:0.15, G:0.07, T:0.46 Consensus pattern (21 bp): CTATGAAATTTTATAACCTTC Found at i:106782 original size:2 final size:2 Alignment explanation

Indices: 106777--106806 Score: 60 Period size: 2 Copynumber: 15.0 Consensus size: 2 106767 TCTCTCTCTC 106777 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 106807 GAACTACTCC Statistics Matches: 28, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 28 1.00 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): TA Found at i:115541 original size:21 final size:21 Alignment explanation

Indices: 115517--115565 Score: 80 Period size: 21 Copynumber: 2.3 Consensus size: 21 115507 GCGAGCTTGA 115517 CCGGGCAGGTGGCGCGGATGG 1 CCGGGCAGGTGGCGCGGATGG * * 115538 CCGGGCTGGTGGCTCGGATGG 1 CCGGGCAGGTGGCGCGGATGG 115559 CCGGGCA 1 CCGGGCA 115566 AGTGACTTGG Statistics Matches: 25, Mismatches: 3, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 21 25 1.00 ACGTcount: A:0.08, C:0.27, G:0.53, T:0.12 Consensus pattern (21 bp): CCGGGCAGGTGGCGCGGATGG Found at i:115733 original size:26 final size:27 Alignment explanation

Indices: 115699--115765 Score: 109 Period size: 26 Copynumber: 2.5 Consensus size: 27 115689 AAATTACCAA * 115699 GGGCATTTTGGTCATTTTT-GCCTCAG 1 GGGCATTTTGGTCATTTTTCACCTCAG 115725 GGGCATTTTGGTCATTTTTCACCTCAG 1 GGGCATTTTGGTCATTTTTCACCTCAG * 115752 GGGCATTTAGGTCA 1 GGGCATTTTGGTCA 115766 AGATTACTGG Statistics Matches: 38, Mismatches: 2, Indels: 1 0.93 0.05 0.02 Matches are distributed among these distances: 26 19 0.50 27 19 0.50 ACGTcount: A:0.15, C:0.19, G:0.27, T:0.39 Consensus pattern (27 bp): GGGCATTTTGGTCATTTTTCACCTCAG Found at i:119310 original size:19 final size:19 Alignment explanation

Indices: 119272--119310 Score: 53 Period size: 19 Copynumber: 2.1 Consensus size: 19 119262 TTAAGCTAAA * 119272 TAGTAAAAGAAACAGTGAT 1 TAGTAAAAGAAACAGAGAT 119291 TAGTAAAAGAAGA-AGAGAT 1 TAGTAAAAGAA-ACAGAGAT 119310 T 1 T 119311 GGTTTAATTC Statistics Matches: 18, Mismatches: 1, Indels: 2 0.86 0.05 0.10 Matches are distributed among these distances: 19 17 0.94 20 1 0.06 ACGTcount: A:0.54, C:0.03, G:0.23, T:0.21 Consensus pattern (19 bp): TAGTAAAAGAAACAGAGAT Found at i:119459 original size:47 final size:47 Alignment explanation

Indices: 119291--119808 Score: 867 Period size: 47 Copynumber: 11.0 Consensus size: 47 119281 AAACAGTGAT 119291 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * * 119338 TAAGTAAAAGAAGAAGAGATTGGTTTAGTTATAGGTAATTAAACTAAG 1 T-AGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * * * * 119386 TAAGTAAAAGAAGAAGATATTAGTTTAATTTTAGGTAATTAAACTAAT 1 T-AGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * 119434 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAA 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG 119481 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * 119528 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGTTAATTAAACTAAG 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * 119575 TAGTAAAAGAAGAAGAGATTGGTGTAATTCTAGGTAATTAAACTAAG 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG 119622 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * * 119669 TAGTAAAAGAAGAAGAGACTAGTTTAATTCTAGGTAATTAAACTAAG 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * 119716 TAGTAAAAGAAGAAGAGATTAGTTTAATTCTAGGTAATTAAACTAAG 1 TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG * * * 119763 TAGCAGAAGAAGGAA-AGATTAGTTTAATTCTAGGTAATTAAACTAA 1 TAGTAAAAGAA-GAAGAGATTGGTTTAATTCTAGGTAATTAAACTAA 119809 AGAAGAGGTT Statistics Matches: 448, Mismatches: 21, Indels: 4 0.95 0.04 0.01 Matches are distributed among these distances: 47 357 0.80 48 91 0.20 ACGTcount: A:0.46, C:0.04, G:0.20, T:0.30 Consensus pattern (47 bp): TAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAG Found at i:119821 original size:83 final size:85 Alignment explanation

Indices: 119725--119897 Score: 246 Period size: 83 Copynumber: 2.0 Consensus size: 85 119715 GTAGTAAAAG * * 119725 AAGAAGAGATTAGTTTAATTCTAGGTAATTAAACTAAGTAGCAGAA-GAAGGA-AAG-ATTAGTT 1 AAGAAGAGATTAGTTTAATTCTAGGAAATTAAACTAAGTAGCA-AATGAAGAACAAGAATTAGTT 119787 TAATTC-TAGGTAATTAAACTA 65 TAA-TCATAGGTAATTAAACTA * 119808 AAGAAGAGGTTAGTTTAATTCTAGGAAATTAAACTAAGTAGCAAATGAAGAACAAGCAATTAGTT 1 AAGAAGAGATTAGTTTAATTCTAGGAAATTAAACTAAGTAGCAAATGAAGAACAAG-AATTAGTT * * 119873 TAATCATGGGTAATTAAATTA 65 TAATCATAGGTAATTAAACTA 119894 AAGA 1 AAGA 119898 GGATAAGAAG Statistics Matches: 80, Mismatches: 5, Indels: 7 0.87 0.05 0.08 Matches are distributed among these distances: 82 2 0.03 83 46 0.57 84 3 0.04 85 2 0.03 86 27 0.34 ACGTcount: A:0.46, C:0.06, G:0.19, T:0.29 Consensus pattern (85 bp): AAGAAGAGATTAGTTTAATTCTAGGAAATTAAACTAAGTAGCAAATGAAGAACAAGAATTAGTTT AATCATAGGTAATTAAACTA Found at i:123508 original size:35 final size:35 Alignment explanation

Indices: 123469--123552 Score: 141 Period size: 35 Copynumber: 2.4 Consensus size: 35 123459 AATTTATGGG * 123469 CAGAACAATGGTTTGTAACCCTTAATTTCTATTCT 1 CAGAACAATGGTTTGTAACCCTTAATTTCTATCCT * * 123504 CAGAACAATGGTTTGTAATCCTTAATTTTTATCCT 1 CAGAACAATGGTTTGTAACCCTTAATTTCTATCCT 123539 CAGAACAATGGTTT 1 CAGAACAATGGTTT 123553 TATGATATGG Statistics Matches: 46, Mismatches: 3, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 35 46 1.00 ACGTcount: A:0.30, C:0.18, G:0.13, T:0.39 Consensus pattern (35 bp): CAGAACAATGGTTTGTAACCCTTAATTTCTATCCT Found at i:132373 original size:26 final size:27 Alignment explanation

Indices: 132339--132399 Score: 88 Period size: 26 Copynumber: 2.3 Consensus size: 27 132329 AAATTACCAA * * 132339 GGGCATTTTGGTAATTTT-TGCCTCAG 1 GGGCATTTTCGTAATTTTCCGCCTCAG * 132365 GGGCATTTTCGTTATTTTCCGCCTCAG 1 GGGCATTTTCGTAATTTTCCGCCTCAG 132392 GGGCATTT 1 GGGCATTT 132400 AGGTCAAGAT Statistics Matches: 31, Mismatches: 3, Indels: 1 0.89 0.09 0.03 Matches are distributed among these distances: 26 16 0.52 27 15 0.48 ACGTcount: A:0.13, C:0.20, G:0.26, T:0.41 Consensus pattern (27 bp): GGGCATTTTCGTAATTTTCCGCCTCAG Found at i:132909 original size:19 final size:18 Alignment explanation

Indices: 132885--132920 Score: 54 Period size: 19 Copynumber: 1.9 Consensus size: 18 132875 TGAAGACTTA 132885 TTGAAGATAATTTGAAGAT 1 TTGAAGATAA-TTGAAGAT * 132904 TTGAAGATCATTGAAGA 1 TTGAAGATAATTGAAGA 132921 ATTATCTCAA Statistics Matches: 16, Mismatches: 1, Indels: 1 0.89 0.06 0.06 Matches are distributed among these distances: 18 7 0.44 19 9 0.56 ACGTcount: A:0.42, C:0.03, G:0.22, T:0.33 Consensus pattern (18 bp): TTGAAGATAATTGAAGAT Found at i:137816 original size:35 final size:35 Alignment explanation

Indices: 137776--137843 Score: 127 Period size: 35 Copynumber: 1.9 Consensus size: 35 137766 TAATTAAACT * 137776 AAGTAAGTAAAAGAAGAAGAGATTGGTTTAATTAC 1 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTAC 137811 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATT 1 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATT 137844 CTAGGTAAAT Statistics Matches: 32, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 35 32 1.00 ACGTcount: A:0.50, C:0.01, G:0.22, T:0.26 Consensus pattern (35 bp): AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTAC Found at i:137880 original size:47 final size:47 Alignment explanation

Indices: 137811--138237 Score: 782 Period size: 47 Copynumber: 9.0 Consensus size: 47 137801 GTTTAATTAC * * 137811 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTCTAGGTAAATAAACT 1 AAGT-AGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 137859 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 1 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT * 137906 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTATAGGTAATTAAACT 1 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT * 137953 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATCAAACT 1 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 138000 AAGTAAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 1 AAGT-AGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 138048 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 1 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 138095 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 1 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 138142 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 1 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT * * 138189 AAGTAGTAAAAGAAGAAGAGATTAGTTTAATTATAGGTAATTAAACT 1 AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 138236 AA 1 AA 138238 AGAAGAGGTT Statistics Matches: 370, Mismatches: 8, Indels: 3 0.97 0.02 0.01 Matches are distributed among these distances: 47 320 0.86 48 50 0.14 ACGTcount: A:0.46, C:0.04, G:0.21, T:0.29 Consensus pattern (47 bp): AAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT Found at i:138037 original size:189 final size:189 Alignment explanation

Indices: 137811--138237 Score: 802 Period size: 189 Copynumber: 2.3 Consensus size: 189 137801 GTTTAATTAC * 137811 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTCTAGGTAAATAAACTAAGTAGTAAAAGAAGAA 1 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAA 137876 GAGATTGGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTATA 66 GAGATTGGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTATA 137941 GGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATCAAACT 131 GGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATCAAACT * 138000 AAGTAAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAA 1 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAA * 138065 GAGATTGGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTA 66 GAGATTGGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTATA * 138130 GGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATTAAACT 131 GGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATCAAACT * 138189 AAGT-AGTAAAAGAAGAAGAGATTAGTTTAATTATAGGTAATTAAACTAA 1 AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTCTAGGTAATTAAACTAA 138238 AGAAGAGGTT Statistics Matches: 232, Mismatches: 6, Indels: 1 0.97 0.03 0.00 Matches are distributed among these distances: 188 43 0.19 189 189 0.81 ACGTcount: A:0.46, C:0.04, G:0.21, T:0.29 Consensus pattern (189 bp): AAGTAAGTAAAAGAAGAAGAGATTAGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAA GAGATTGGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTATA GGTAATTAAACTAAGTAGTAAAAGAAGAAGAGATTGGTTTAATTCTAGGTAATCAAACT Found at i:138242 original size:36 final size:36 Alignment explanation

Indices: 138201--138273 Score: 119 Period size: 36 Copynumber: 2.0 Consensus size: 36 138191 GTAGTAAAAG * 138201 AAGAAGAGATTAGTTTAATTATAGGTAATTAAACTA 1 AAGAAGAGATTAGTTTAATTATAGGAAATTAAACTA * * 138237 AAGAAGAGGTTAGTTTAATTCTAGGAAATTAAACTA 1 AAGAAGAGATTAGTTTAATTATAGGAAATTAAACTA 138273 A 1 A 138274 GTAGCAAATG Statistics Matches: 34, Mismatches: 3, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 36 34 1.00 ACGTcount: A:0.47, C:0.04, G:0.18, T:0.32 Consensus pattern (36 bp): AAGAAGAGATTAGTTTAATTATAGGAAATTAAACTA Found at i:138253 original size:83 final size:85 Alignment explanation

Indices: 138154--138324 Score: 238 Period size: 83 Copynumber: 2.0 Consensus size: 85 138144 GTAGTAAAAG * * * * 138154 AAGAAGAGATTGGTTTAATTCTAGGTAATTAAACTAAGTAGTAAAAGAAGAA-GAG-ATTAGTTT 1 AAGAAGAGATTAGTTTAATTCTAGGAAATTAAACTAAGTAGCAAAAGAAGAACAAGAATTAGTTT * 138217 AATTATAGGTAATTAAACTA 66 AATCATAGGTAATTAAACTA * * 138237 AAGAAGAGGTTAGTTTAATTCTAGGAAATTAAACTAAGTAGCAAATGAAGAACAAGCAATTAGTT 1 AAGAAGAGATTAGTTTAATTCTAGGAAATTAAACTAAGTAGCAAAAGAAGAACAAG-AATTAGTT * * 138302 TAATCATGGGTAATTAAATTA 65 TAATCATAGGTAATTAAACTA 138323 AA 1 AA 138325 AAGGAAAAGA Statistics Matches: 76, Mismatches: 9, Indels: 3 0.86 0.10 0.03 Matches are distributed among these distances: 83 47 0.62 84 2 0.03 86 27 0.36 ACGTcount: A:0.46, C:0.05, G:0.19, T:0.30 Consensus pattern (85 bp): AAGAAGAGATTAGTTTAATTCTAGGAAATTAAACTAAGTAGCAAAAGAAGAACAAGAATTAGTTT AATCATAGGTAATTAAACTA Found at i:138416 original size:42 final size:42 Alignment explanation

Indices: 138311--138418 Score: 110 Period size: 44 Copynumber: 2.5 Consensus size: 42 138301 TTAATCATGG * * ** 138311 GTAATTAAATTAAAAAGGAAAAGAAAAAGTAAACAGAAATTGG 1 GTAATTAAACT-AAAAGTAAAAGAAAAAGTAAACAGAAATTCA * * * 138354 GTTAATTAAACTAAAGAGTAAAAGAAAGAGTAAGCAGTAA-TCA 1 G-TAATTAAACTAAA-AGTAAAAGAAAAAGTAAACAGAAATTCA 138397 GTAATTAAACTAAGAAGTAAAA 1 GTAATTAAACTAA-AAGTAAAA 138419 AGTAGTCAAT Statistics Matches: 55, Mismatches: 7, Indels: 7 0.80 0.10 0.10 Matches are distributed among these distances: 42 19 0.35 43 7 0.13 44 29 0.53 ACGTcount: A:0.57, C:0.05, G:0.18, T:0.20 Consensus pattern (42 bp): GTAATTAAACTAAAAGTAAAAGAAAAAGTAAACAGAAATTCA Found at i:148600 original size:21 final size:21 Alignment explanation

Indices: 148574--148686 Score: 158 Period size: 21 Copynumber: 5.4 Consensus size: 21 148564 TGCTAGAAGT 148574 TCATTGGAGCAAGTTCCAAGC 1 TCATTGGAGCAAGTTCCAAGC 148595 TCATTGGAGCAAGTTCCAAGC 1 TCATTGGAGCAAGTTCCAAGC * * 148616 TCATTGGAGCAGGTTCCAAGT 1 TCATTGGAGCAAGTTCCAAGC * 148637 TCATTGGAG-AAGGTTCCAAGA 1 TCATTGGAGCAA-GTTCCAAGC * 148658 TCATTGGAG-AAGGTTTCAAGC 1 TCATTGGAGCAA-GTTCCAAGC 148679 TCATTGGA 1 TCATTGGA 148687 AATGCCTAAG Statistics Matches: 85, Mismatches: 6, Indels: 2 0.91 0.06 0.02 Matches are distributed among these distances: 20 1 0.01 21 84 0.99 ACGTcount: A:0.28, C:0.19, G:0.27, T:0.27 Consensus pattern (21 bp): TCATTGGAGCAAGTTCCAAGC Found at i:149656 original size:23 final size:27 Alignment explanation

Indices: 149605--149659 Score: 73 Period size: 27 Copynumber: 2.2 Consensus size: 27 149595 CAAAGGAACA * 149605 CAGATGGCAGAGAAGAAAAAGATGCAT 1 CAGATGGCAGAGAAGAAAAAGAAGCAT 149632 CAGATGGCAG-GAA-AAAAA-AAG-AT 1 CAGATGGCAGAGAAGAAAAAGAAGCAT 149655 CAGAT 1 CAGAT 149660 TTCAAGAGCC Statistics Matches: 27, Mismatches: 1, Indels: 4 0.84 0.03 0.12 Matches are distributed among these distances: 23 7 0.26 24 2 0.07 25 5 0.19 26 3 0.11 27 10 0.37 ACGTcount: A:0.51, C:0.11, G:0.27, T:0.11 Consensus pattern (27 bp): CAGATGGCAGAGAAGAAAAAGAAGCAT Found at i:152594 original size:15 final size:16 Alignment explanation

Indices: 152561--152600 Score: 64 Period size: 15 Copynumber: 2.6 Consensus size: 16 152551 TTACTTTGCT 152561 TTGTTTTCTAGTATAA 1 TTGTTTTCTAGTATAA * 152577 TTGTTTTCT-GTTTAA 1 TTGTTTTCTAGTATAA 152592 TTGTTTTCT 1 TTGTTTTCT 152601 TTCAACCTCT Statistics Matches: 23, Mismatches: 1, Indels: 1 0.92 0.04 0.04 Matches are distributed among these distances: 15 14 0.61 16 9 0.39 ACGTcount: A:0.15, C:0.07, G:0.12, T:0.65 Consensus pattern (16 bp): TTGTTTTCTAGTATAA Found at i:153047 original size:25 final size:24 Alignment explanation

Indices: 153011--153057 Score: 69 Period size: 26 Copynumber: 1.9 Consensus size: 24 153001 TCCTTCTATT 153011 CATCTATCATC-AAGTTTTTCATC 1 CATCTATCATCAAAGTTTTTCATC 153034 CATCTCATCCATCAAAGTTTTTCA 1 CATCT-AT-CATCAAAGTTTTTCA 153058 AATTTTCAAG Statistics Matches: 21, Mismatches: 0, Indels: 3 0.88 0.00 0.12 Matches are distributed among these distances: 23 5 0.24 24 2 0.10 25 4 0.19 26 10 0.48 ACGTcount: A:0.28, C:0.28, G:0.04, T:0.40 Consensus pattern (24 bp): CATCTATCATCAAAGTTTTTCATC Done.