Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: AWUE01014667.1 Corchorus olitorius cultivar O-4 contig14700, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 54314
ACGTcount: A:0.33, C:0.16, G:0.17, T:0.34


Found at i:9515 original size:12 final size:11

Alignment explanation

Indices: 9498--9599 Score: 97 Period size: 12 Copynumber: 9.2 Consensus size: 11 9488 TAACTACTCT 9498 TATTAATATTAA 1 TATTAATA-TAA 9510 TATTAATATTAA 1 TATTAATA-TAA * 9522 TATATATTATAA 1 TAT-TAATATAA 9534 TATATAATATAA 1 TAT-TAATATAA 9546 TA-TAATATAA 1 TATTAATATAA 9556 TA-TAATATAA 1 TATTAATATAA 9566 TATT-ATAATAA 1 TATTAAT-ATAA 9577 TA-TAAT-TATA 1 TATTAATATA-A * 9587 TAATAATATAA 1 TATTAATATAA 9598 TA 1 TA 9600 ACATATTATA Statistics Matches: 81, Mismatches: 2, Indels: 15 0.83 0.02 0.15 Matches are distributed among these distances: 9 2 0.02 10 26 0.32 11 16 0.20 12 33 0.41 13 4 0.05 ACGTcount: A:0.55, C:0.00, G:0.00, T:0.45 Consensus pattern (11 bp): TATTAATATAA Found at i:9537 original size:5 final size:5 Alignment explanation

Indices: 9501--9599 Score: 96 Period size: 5 Copynumber: 19.0 Consensus size: 5 9491 CTACTCTTAT * 9501 TAATA TTAATA TTAATA TTAATA T-ATA TTATA -ATATA TAATA TAATA 1 TAATA -TAATA -TAATA -TAATA TAATA TAATA TA-ATA TAATA TAATA * 9548 TAATA TAATA TAATA TAATA TTATAA TAATA TAAT- TATATAA TAATA 1 TAATA TAATA TAATA TAATA TAAT-A TAATA TAATA TA-AT-A TAATA 9595 TAATA 1 TAATA 9600 ACATATTATA Statistics Matches: 83, Mismatches: 3, Indels: 15 0.82 0.03 0.15 Matches are distributed among these distances: 4 6 0.07 5 51 0.61 6 24 0.29 7 2 0.02 ACGTcount: A:0.56, C:0.00, G:0.00, T:0.44 Consensus pattern (5 bp): TAATA Found at i:9600 original size:8 final size:8 Alignment explanation

Indices: 9513--9600 Score: 88 Period size: 8 Copynumber: 10.4 Consensus size: 8 9503 ATATTAATAT 9513 TAATATTAA 1 TAATA-TAA * 9522 TATATATTA 1 TA-ATATAA 9531 TAATAT-A 1 TAATATAA 9538 TAATATAATA 1 TAATAT-A-A 9548 TAATATAATA 1 TAATAT-A-A 9558 TAATATAA 1 TAATATAA * 9566 TATTATAA 1 TAATATAA 9574 TAATATAA 1 TAATATAA * 9582 TTATATAA 1 TAATATAA 9590 TAATATAA 1 TAATATAA 9598 TAA 1 TAA 9601 CATATTATAG Statistics Matches: 70, Mismatches: 5, Indels: 9 0.83 0.06 0.11 Matches are distributed among these distances: 7 7 0.10 8 36 0.51 9 7 0.10 10 20 0.29 ACGTcount: A:0.57, C:0.00, G:0.00, T:0.43 Consensus pattern (8 bp): TAATATAA Found at i:10006 original size:2 final size:2 Alignment explanation

Indices: 10001--10057 Score: 64 Period size: 2 Copynumber: 28.0 Consensus size: 2 9991 TCATATGTAG * 10001 TA TA TA TA GTA TA -A TA TA TA TA TA TA TA TA TA TA TA TA GA TA 1 TA TA TA TA -TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 10043 GTA -A TA TA GTA TA TA 1 -TA TA TA TA -TA TA TA 10058 ATGGCTTAAT Statistics Matches: 48, Mismatches: 2, Indels: 10 0.80 0.03 0.17 Matches are distributed among these distances: 1 2 0.04 2 40 0.83 3 6 0.12 ACGTcount: A:0.49, C:0.00, G:0.07, T:0.44 Consensus pattern (2 bp): TA Found at i:13832 original size:26 final size:27 Alignment explanation

Indices: 13798--13850 Score: 81 Period size: 28 Copynumber: 2.0 Consensus size: 27 13788 TATATATACT 13798 CCCTATGTTCC-TTTTATTTGTCATCA 1 CCCTATGTTCCTTTTTATTTGTCATCA * 13824 CCCTTTGTTCCTTTTTTATTTGTCATC 1 CCCTATGTTCC-TTTTTATTTGTCATC 13851 TTACACTACA Statistics Matches: 24, Mismatches: 1, Indels: 2 0.89 0.04 0.07 Matches are distributed among these distances: 26 10 0.42 28 14 0.58 ACGTcount: A:0.11, C:0.26, G:0.08, T:0.55 Consensus pattern (27 bp): CCCTATGTTCCTTTTTATTTGTCATCA Found at i:14073 original size:2 final size:2 Alignment explanation

Indices: 14066--14115 Score: 65 Period size: 2 Copynumber: 27.5 Consensus size: 2 14056 TGAATGGGAG 14066 TA TA TA TA TA TA TA TA TA TA TA TA TA TA T- TA TA -A TA -A TA 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA 14105 -A TA -A TA TA TA T 1 TA TA TA TA TA TA T 14116 CATAGTTATT Statistics Matches: 43, Mismatches: 0, Indels: 10 0.81 0.00 0.19 Matches are distributed among these distances: 1 5 0.12 2 38 0.88 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): TA Found at i:14111 original size:27 final size:29 Alignment explanation

Indices: 14081--14138 Score: 75 Period size: 30 Copynumber: 2.0 Consensus size: 29 14071 ATATATATAT 14081 ATATATAT-ATA-TATTATAATAATAATA 1 ATATATATCATAGTATTATAATAATAATA * * 14108 ATATATATCATAGTTATTATAGTAGTAATA 1 ATATATATCATAG-TATTATAATAATAATA 14138 A 1 A 14139 AATAGTTTTG Statistics Matches: 26, Mismatches: 2, Indels: 3 0.84 0.06 0.10 Matches are distributed among these distances: 27 8 0.31 28 3 0.12 30 15 0.58 ACGTcount: A:0.50, C:0.02, G:0.05, T:0.43 Consensus pattern (29 bp): ATATATATCATAGTATTATAATAATAATA Found at i:15279 original size:22 final size:21 Alignment explanation

Indices: 15216--15307 Score: 87 Period size: 22 Copynumber: 4.3 Consensus size: 21 15206 TATTTTTATG * * 15216 AAATTTTAATAATCAT-TATA 1 AAATTTTGATAATTATATATA * * 15236 AAATTTTGGTAATCTCTATATA 1 AAATTTTGATAAT-TATATATA 15258 AAATTTTGATAATTATACTATA 1 AAATTTTGATAATTATA-TATA * * * 15280 AAGTTTTTATAATAATATTATA 1 AAATTTTGATAATTATA-TATA 15302 AAATTT 1 AAATTT 15308 CGAGAACCTC Statistics Matches: 58, Mismatches: 11, Indels: 4 0.79 0.15 0.05 Matches are distributed among these distances: 20 11 0.19 21 4 0.07 22 43 0.74 ACGTcount: A:0.45, C:0.04, G:0.04, T:0.47 Consensus pattern (21 bp): AAATTTTGATAATTATATATA Found at i:15440 original size:22 final size:22 Alignment explanation

Indices: 15389--15440 Score: 61 Period size: 22 Copynumber: 2.4 Consensus size: 22 15379 ATAACCACTT 15389 AATGAAATTTTGATAATCACCC 1 AATGAAATTTTGATAATCACCC * * * 15411 TATAAAATTTTGATAA-CTTCCC 1 AATGAAATTTTGATAATC-ACCC 15433 AATGAAAT 1 AATGAAAT 15441 GTGAGTAAGT Statistics Matches: 24, Mismatches: 5, Indels: 2 0.77 0.16 0.06 Matches are distributed among these distances: 21 1 0.04 22 23 0.96 ACGTcount: A:0.42, C:0.15, G:0.08, T:0.35 Consensus pattern (22 bp): AATGAAATTTTGATAATCACCC Found at i:15471 original size:46 final size:44 Alignment explanation

Indices: 15402--15516 Score: 124 Period size: 46 Copynumber: 2.6 Consensus size: 44 15392 GAAATTTTGA ** * * 15402 TAATCACCCTATAAAATTTTGATAAC-TTCCCAATGAAATGTGAG 1 TAATCACATTATAAAATTTTGATAACATT-CCAATAAAATATGAG * * * * 15446 TAAGTGTACATTATGAAATTTTGATAACATTCCGATAAAATATTAG 1 TAA-T-CACATTATAAAATTTTGATAACATTCCAATAAAATATGAG 15492 TAATCACATTATAAAATTTTGATAA 1 TAATCACATTATAAAATTTTGATAA 15517 ACAAACCATG Statistics Matches: 58, Mismatches: 10, Indels: 6 0.78 0.14 0.08 Matches are distributed among these distances: 44 22 0.38 45 2 0.03 46 32 0.55 47 2 0.03 ACGTcount: A:0.42, C:0.12, G:0.10, T:0.36 Consensus pattern (44 bp): TAATCACATTATAAAATTTTGATAACATTCCAATAAAATATGAG Found at i:15552 original size:22 final size:21 Alignment explanation

Indices: 15527--15670 Score: 94 Period size: 22 Copynumber: 6.6 Consensus size: 21 15517 ACAAACCATG * 15527 AAATTGTGATAACCTTACTATA 1 AAATTTTGATAACCTT-CTATA * * 15549 AAATTTTTATAAACCTCCCTATA 1 AAATTTTGAT-AACCT-TCTATA * 15572 AAATTTTGATAACCTTCATTTGA 1 AAATTTTGATAACCTTC-TAT-A * * 15595 AAA-TTTGATAATCTTTCTATG 1 AAATTTTGATAA-CCTTCTATA * * * 15616 AATTTTTGATAACCTCCTTATG 1 AAATTTTGATAACCTTC-TATA * ** 15638 AAATTTT-ATTAACCTCCTACG 1 AAATTTTGA-TAACCTTCTATA 15659 AAATTTTGATAA 1 AAATTTTGATAA 15671 GAACTCTATT Statistics Matches: 99, Mismatches: 14, Indels: 19 0.75 0.11 0.14 Matches are distributed among these distances: 21 20 0.20 22 52 0.53 23 27 0.27 ACGTcount: A:0.36, C:0.15, G:0.07, T:0.42 Consensus pattern (21 bp): AAATTTTGATAACCTTCTATA Found at i:15663 original size:21 final size:22 Alignment explanation

Indices: 15625--15665 Score: 66 Period size: 21 Copynumber: 1.9 Consensus size: 22 15615 GAATTTTTGA * 15625 TAACCTCCTTATGAAATTTTAT 1 TAACCTCCTTACGAAATTTTAT 15647 TAACCTCC-TACGAAATTTT 1 TAACCTCCTTACGAAATTTT 15666 GATAAGAACT Statistics Matches: 18, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 21 10 0.56 22 8 0.44 ACGTcount: A:0.32, C:0.22, G:0.05, T:0.41 Consensus pattern (22 bp): TAACCTCCTTACGAAATTTTAT Found at i:15808 original size:22 final size:22 Alignment explanation

Indices: 15780--15853 Score: 71 Period size: 22 Copynumber: 3.4 Consensus size: 22 15770 ATTTCACTAT * 15780 AAATTTTGATAAACTCATTATG 1 AAATTTTGATAAACTCATAATG * * 15802 AAATTTT-ATTAACTACACAATG 1 AAATTTTGATAAACT-CATAATG * * 15824 AAAATTTGATAACCTTC-TAATG 1 AAATTTTGATAAAC-TCATAATG 15846 AAATTTTG 1 AAATTTTG 15854 GTAACCATAC Statistics Matches: 41, Mismatches: 8, Indels: 6 0.75 0.15 0.11 Matches are distributed among these distances: 21 6 0.15 22 29 0.71 23 5 0.12 24 1 0.02 ACGTcount: A:0.42, C:0.11, G:0.08, T:0.39 Consensus pattern (22 bp): AAATTTTGATAAACTCATAATG Found at i:15868 original size:22 final size:22 Alignment explanation

Indices: 15775--15880 Score: 69 Period size: 22 Copynumber: 4.9 Consensus size: 22 15765 TGTTAATTTC 15775 ACTAT-AAATTTTGATAAACTCAT 1 ACTATGAAATTTTGAT-AAC-CAT * * 15798 --TATGAAATTTT-ATTAACTAC 1 ACTATGAAATTTTGA-TAACCAT * * 15818 ACAATGAAAATTTGATAACC-T 1 ACTATGAAATTTTGATAACCAT * * 15839 TCTAATGAAATTTTGGTAACCAT 1 ACT-ATGAAATTTTGATAACCAT * * 15862 ACTATGAATTTTTTATAAC 1 ACTATGAAATTTTGATAAC 15881 ATCTCCTTCT Statistics Matches: 62, Mismatches: 14, Indels: 15 0.68 0.15 0.16 Matches are distributed among these distances: 20 1 0.02 21 8 0.13 22 49 0.79 23 4 0.06 ACGTcount: A:0.41, C:0.12, G:0.08, T:0.40 Consensus pattern (22 bp): ACTATGAAATTTTGATAACCAT Found at i:15921 original size:22 final size:22 Alignment explanation

Indices: 15896--16025 Score: 86 Period size: 22 Copynumber: 6.0 Consensus size: 22 15886 CTTCTGAAAT * 15896 ACCACATTATAAAATTTTGATA 1 ACCACACTATAAAATTTTGATA * 15918 ACCACACTATGAAATTTTGATA 1 ACCACACTATAAAATTTTGATA * * * * * 15940 ATCTCCCTCTAAAATTTTGTTA 1 ACCACACTATAAAATTTTGATA * * * * 15962 A--TCTCTATGAAATTGTGATA 1 ACCACACTATAAAATTTTGATA ** * * 15982 ATTACACTATGAAATTTTGGTA 1 ACCACACTATAAAATTTTGATA * * 16004 ACAACACT-TGAAATTTTGATA 1 ACCACACTATAAAATTTTGATA 16025 A 1 A 16026 GCTCACTCTA Statistics Matches: 86, Mismatches: 20, Indels: 5 0.77 0.18 0.05 Matches are distributed among these distances: 20 15 0.17 21 13 0.15 22 58 0.67 ACGTcount: A:0.38, C:0.15, G:0.09, T:0.38 Consensus pattern (22 bp): ACCACACTATAAAATTTTGATA Found at i:16113 original size:22 final size:22 Alignment explanation

Indices: 16086--16328 Score: 150 Period size: 22 Copynumber: 10.8 Consensus size: 22 16076 TTAATCAGAG 16086 TATGAAATTTT-AGTAACCTCCC 1 TATGAAATTTTGA-TAACCTCCC * * 16108 TGTGAAATTTTGATAACCTTCC 1 TATGAAATTTTGATAACCTCCC * * * * 16130 AATGGAATTCTGATAACCTCCT 1 TATGAAATTTTGATAACCTCCC * * * 16152 TATAAAATTTTAATAACCTCCA 1 TATGAAATTTTGATAACCTCCC * ** 16174 TATAAAATTTTGATAATATCCC 1 TATGAAATTTTGATAACCTCCC * * * 16196 TGTGAAATTTAATTTTAATAACCTGCC 1 TATG--A---AATTTTGATAACCTCCC * 16223 TATGAAATTTTGATAACATCCC 1 TATGAAATTTTGATAACCTCCC * ** 16245 -ATGAAATTGTGATAA-CTACAT 1 TATGAAATTTTGATAACCT-CCC * * * 16266 TATAAAATTTTGATATCCTAACC 1 TATGAAATTTTGATAACCT-CCC * * ** 16289 TATGAAATTTTGGTAACCACAT 1 TATGAAATTTTGATAACCTCCC * 16311 TATAAAATTTTGATAACC 1 TATGAAATTTTGATAACC 16329 ACTAAGACAT Statistics Matches: 165, Mismatches: 47, Indels: 18 0.72 0.20 0.08 Matches are distributed among these distances: 20 1 0.01 21 15 0.09 22 112 0.68 23 19 0.12 24 1 0.01 25 1 0.01 27 16 0.10 ACGTcount: A:0.37, C:0.16, G:0.09, T:0.37 Consensus pattern (22 bp): TATGAAATTTTGATAACCTCCC Found at i:16217 original size:49 final size:48 Alignment explanation

Indices: 16157--16252 Score: 140 Period size: 49 Copynumber: 2.0 Consensus size: 48 16147 CTCCTTATAA * * 16157 AATTTTAATAACCT-CCATATAAAATTTTGATAATATCCCTGTGAAATTT 1 AATTTTAATAACCTGCC-TATAAAATTTTGATAACATCCC-ATGAAATTT * 16206 AATTTTAATAACCTGCCTATGAAATTTTGATAACATCCCATGAAATT 1 AATTTTAATAACCTGCCTATAAAATTTTGATAACATCCCATGAAATT 16253 GTGATAACTA Statistics Matches: 43, Mismatches: 3, Indels: 3 0.88 0.06 0.06 Matches are distributed among these distances: 48 7 0.16 49 34 0.79 50 2 0.05 ACGTcount: A:0.39, C:0.16, G:0.07, T:0.39 Consensus pattern (48 bp): AATTTTAATAACCTGCCTATAAAATTTTGATAACATCCCATGAAATTT Found at i:16250 original size:21 final size:22 Alignment explanation

Indices: 16206--16260 Score: 67 Period size: 21 Copynumber: 2.5 Consensus size: 22 16196 TGTGAAATTT * * * 16206 AATTTTAATAACCTGCCTATGA 1 AATTTTGATAACATGCCCATGA 16228 AATTTTGATAACAT-CCCATGA 1 AATTTTGATAACATGCCCATGA * 16249 AATTGTGATAAC 1 AATTTTGATAAC 16261 TACATTATAA Statistics Matches: 29, Mismatches: 4, Indels: 1 0.85 0.12 0.03 Matches are distributed among these distances: 21 17 0.59 22 12 0.41 ACGTcount: A:0.38, C:0.16, G:0.11, T:0.35 Consensus pattern (22 bp): AATTTTGATAACATGCCCATGA Found at i:16319 original size:115 final size:115 Alignment explanation

Indices: 16091--16320 Score: 243 Period size: 115 Copynumber: 2.0 Consensus size: 115 16081 CAGAGTATGA * * * * * 16091 AATTTTAGTAACCTCCCTGTGAAATTTTGATAACCTTCCAATGGAATTCTGATAACCTCCTTATA 1 AATTTTAATAACCTCCCTATGAAATTTTGATAACCATCCAATGAAATTCTGATAACCTCATTATA * * * * 16156 AAATTTTAATAACCTCCATATAAAATTTTGATAATATCCCTGTGAAATTT 66 AAATTTTAATAACCTCCATATAAAATTTTGATAACATCACTATAAAATTT * * * 16206 AATTTTAATAACCTGCCTATGAAATTTTGATAA-CATCCCATGAAATTGTGATAA-CTACATTAT 1 AATTTTAATAACCTCCCTATGAAATTTTGATAACCATCCAATGAAATTCTGATAACCT-CATTAT * * * * * 16269 AAAATTTTGATATCCTAACC-TATGAAATTTTGGTAACCA-CATTATAAAATTT 65 AAAATTTTAATAACCT--CCATATAAAATTTTGATAA-CATCACTATAAAATTT 16321 TGATAACCAC Statistics Matches: 94, Mismatches: 17, Indels: 8 0.79 0.14 0.07 Matches are distributed among these distances: 113 2 0.02 114 36 0.38 115 53 0.56 116 3 0.03 ACGTcount: A:0.37, C:0.17, G:0.09, T:0.38 Consensus pattern (115 bp): AATTTTAATAACCTCCCTATGAAATTTTGATAACCATCCAATGAAATTCTGATAACCTCATTATA AAATTTTAATAACCTCCATATAAAATTTTGATAACATCACTATAAAATTT Found at i:16760 original size:23 final size:23 Alignment explanation

Indices: 16734--16777 Score: 54 Period size: 23 Copynumber: 1.9 Consensus size: 23 16724 TTAAATCTAA * 16734 TATCCTTATACCT-ATTTTATTTT 1 TATCATTATA-CTAATTTTATTTT * 16757 TATCATTCTACTAATTTTATT 1 TATCATTATACTAATTTTATT 16778 AAAAAACTTA Statistics Matches: 18, Mismatches: 2, Indels: 2 0.82 0.09 0.09 Matches are distributed among these distances: 22 2 0.11 23 16 0.89 ACGTcount: A:0.25, C:0.16, G:0.00, T:0.59 Consensus pattern (23 bp): TATCATTATACTAATTTTATTTT Found at i:21592 original size:15 final size:15 Alignment explanation

Indices: 21574--21608 Score: 54 Period size: 15 Copynumber: 2.3 Consensus size: 15 21564 ACTAGTATAT 21574 ATAAATAAATA-ATAA 1 ATAAATAAATATA-AA 21589 ATAAATAAATATAAA 1 ATAAATAAATATAAA 21604 ATAAA 1 ATAAA 21609 ATAAGAAGCC Statistics Matches: 19, Mismatches: 0, Indels: 2 0.90 0.00 0.10 Matches are distributed among these distances: 15 18 0.95 16 1 0.05 ACGTcount: A:0.74, C:0.00, G:0.00, T:0.26 Consensus pattern (15 bp): ATAAATAAATATAAA Found at i:25948 original size:17 final size:17 Alignment explanation

Indices: 25928--25961 Score: 50 Period size: 17 Copynumber: 2.0 Consensus size: 17 25918 GCTAATAATA 25928 ATTATGAAATAATTATT 1 ATTATGAAATAATTATT ** 25945 ATTATTCAATAATTATT 1 ATTATGAAATAATTATT 25962 CCTCAATTTC Statistics Matches: 15, Mismatches: 2, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 17 15 1.00 ACGTcount: A:0.44, C:0.03, G:0.03, T:0.50 Consensus pattern (17 bp): ATTATGAAATAATTATT Found at i:27179 original size:8 final size:9 Alignment explanation

Indices: 27159--27203 Score: 62 Period size: 8 Copynumber: 5.4 Consensus size: 9 27149 AGCTAATTCA 27159 ACTATATAT 1 ACTATATAT 27168 ACTATA-AT 1 ACTATATAT 27176 ACTATA-AT 1 ACTATATAT 27184 A-TATATAT 1 ACTATATAT 27192 A-TATATAT 1 ACTATATAT 27200 ACTA 1 ACTA 27204 GTAAATACAC Statistics Matches: 34, Mismatches: 0, Indels: 4 0.89 0.00 0.11 Matches are distributed among these distances: 7 4 0.12 8 22 0.65 9 8 0.24 ACGTcount: A:0.49, C:0.09, G:0.00, T:0.42 Consensus pattern (9 bp): ACTATATAT Found at i:38694 original size:98 final size:100 Alignment explanation

Indices: 38561--38747 Score: 306 Period size: 98 Copynumber: 1.9 Consensus size: 100 38551 ATATAATGTA 38561 ATAATGTAATACAAAGTAAATACAAGTATTACTTTTTCGTTGTGCATATATATGTCATTCTTATG 1 ATAATGTAATA-AAAGTAAATACAAGTATTACTTTTTCGTTGTGCATATATATGTCATTCTTATG * 38626 ACATGCACACACAACCAAACTAGCATTATATATATT 65 ACATGCACACACAACAAAACTAGCATTATATATATT * * * 38662 ATAATGTAAT-AATGT-AATACAAGTATTACTTTTTCGTTGTGCATCTGTATGTCATTCTTATGA 1 ATAATGTAATAAAAGTAAATACAAGTATTACTTTTTCGTTGTGCATATATATGTCATTCTTATGA * 38725 CATGCACACACAACAAAATTAGC 66 CATGCACACACAACAAAACTAGC 38748 TAAGGTTTTT Statistics Matches: 81, Mismatches: 5, Indels: 3 0.91 0.06 0.03 Matches are distributed among these distances: 98 67 0.83 99 4 0.05 101 10 0.12 ACGTcount: A:0.37, C:0.16, G:0.11, T:0.36 Consensus pattern (100 bp): ATAATGTAATAAAAGTAAATACAAGTATTACTTTTTCGTTGTGCATATATATGTCATTCTTATGA CATGCACACACAACAAAACTAGCATTATATATATT Found at i:40229 original size:29 final size:29 Alignment explanation

Indices: 40195--40294 Score: 87 Period size: 29 Copynumber: 3.4 Consensus size: 29 40185 ATAACATTAA 40195 GCCCTTATTTGGCCAAATTAAAAGATCGG 1 GCCCTTATTTGGCCAAATTAAAAGATCGG ** ** 40224 GCCCTTATTTGAG-CATTTTCAATAACG-TTAG 1 GCCCTTATTTG-GCCAAATT-AA-AA-GATCGG * * 40255 GCCCTTATTTGGCCAAATTAAAATATCGA 1 GCCCTTATTTGGCCAAATTAAAAGATCGG * 40284 GCCCCTATTTG 1 GCCCTTATTTG 40295 AGTAATTAGC Statistics Matches: 54, Mismatches: 11, Indels: 12 0.70 0.14 0.16 Matches are distributed among these distances: 29 28 0.52 30 6 0.11 31 19 0.35 32 1 0.02 ACGTcount: A:0.28, C:0.22, G:0.17, T:0.33 Consensus pattern (29 bp): GCCCTTATTTGGCCAAATTAAAAGATCGG Found at i:40259 original size:60 final size:60 Alignment explanation

Indices: 40163--40296 Score: 205 Period size: 60 Copynumber: 2.2 Consensus size: 60 40153 TTCAACAACA * * 40163 GGCCCTTTTTTGAGCATTTTCCATAACATTAAGCCCTTATTTGGCCAAATTAAAAGATCG 1 GGCCCTTATTTGAGCATTTTCAATAACATTAAGCCCTTATTTGGCCAAATTAAAAGATCG * * * 40223 GGCCCTTATTTGAGCATTTTCAATAACGTTAGGCCCTTATTTGGCCAAATTAAAATATCG 1 GGCCCTTATTTGAGCATTTTCAATAACATTAAGCCCTTATTTGGCCAAATTAAAAGATCG * * 40283 AGCCCCTATTTGAG 1 GGCCCTTATTTGAG 40297 TAATTAGCCT Statistics Matches: 67, Mismatches: 7, Indels: 0 0.91 0.09 0.00 Matches are distributed among these distances: 60 67 1.00 ACGTcount: A:0.28, C:0.22, G:0.16, T:0.34 Consensus pattern (60 bp): GGCCCTTATTTGAGCATTTTCAATAACATTAAGCCCTTATTTGGCCAAATTAAAAGATCG Found at i:40261 original size:31 final size:31 Alignment explanation

Indices: 40162--40265 Score: 90 Period size: 31 Copynumber: 3.4 Consensus size: 31 40152 TTTCAACAAC * * * 40162 AGGCCCTTTTTTGAGCATTTTCCATAACATT 1 AGGCCCTTATTTGAGCATTTTCAATAACGTT * ** * 40193 AAGCCCTTATTTG-GCCAAATT-AA-AA-GATC 1 AGGCCCTTATTTGAG-CATTTTCAATAACG-TT * 40222 GGGCCCTTATTTGAGCATTTTCAATAACGTT 1 AGGCCCTTATTTGAGCATTTTCAATAACGTT 40253 AGGCCCTTATTTG 1 AGGCCCTTATTTG 40266 GCCAAATTAA Statistics Matches: 54, Mismatches: 13, Indels: 12 0.68 0.16 0.15 Matches are distributed among these distances: 29 18 0.33 30 5 0.09 31 30 0.56 32 1 0.02 ACGTcount: A:0.26, C:0.21, G:0.16, T:0.37 Consensus pattern (31 bp): AGGCCCTTATTTGAGCATTTTCAATAACGTT Found at i:40884 original size:21 final size:21 Alignment explanation

Indices: 40846--40890 Score: 54 Period size: 21 Copynumber: 2.1 Consensus size: 21 40836 ATAGTTTAGA ** 40846 TTTAATTTAATTTGTTTTATT 1 TTTAATTTAATTACTTTTATT * * 40867 TTTAGTTTAATTACTTTTCTT 1 TTTAATTTAATTACTTTTATT 40888 TTT 1 TTT 40891 TATTATTTTT Statistics Matches: 20, Mismatches: 4, Indels: 0 0.83 0.17 0.00 Matches are distributed among these distances: 21 20 1.00 ACGTcount: A:0.20, C:0.04, G:0.04, T:0.71 Consensus pattern (21 bp): TTTAATTTAATTACTTTTATT Found at i:40972 original size:54 final size:54 Alignment explanation

Indices: 40905--41019 Score: 203 Period size: 54 Copynumber: 2.1 Consensus size: 54 40895 ATTTTTTTAA * 40905 TCCCCTTTATTCTTGAGTGATTCTTAGAGAAGATTCATCTTTAGATAAGTCTAT 1 TCCCCTTTATTCTTGAGTGATTCTTAGAAAAGATTCATCTTTAGATAAGTCTAT * * 40959 TCCCCTTTGTTCTTGAGTGATTCTTAGAAAAGATTTATCTTTAGATAAGTCTAT 1 TCCCCTTTATTCTTGAGTGATTCTTAGAAAAGATTCATCTTTAGATAAGTCTAT 41013 TCCCCTT 1 TCCCCTT 41020 CTCAGTGGGA Statistics Matches: 58, Mismatches: 3, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 54 58 1.00 ACGTcount: A:0.24, C:0.18, G:0.14, T:0.43 Consensus pattern (54 bp): TCCCCTTTATTCTTGAGTGATTCTTAGAAAAGATTCATCTTTAGATAAGTCTAT Found at i:41194 original size:17 final size:17 Alignment explanation

Indices: 41169--41205 Score: 56 Period size: 17 Copynumber: 2.2 Consensus size: 17 41159 TCTCTAAGAA * 41169 TTTTCTTTTATTTTTTG 1 TTTTCTTTTAATTTTTG * 41186 TTTTTTTTTAATTTTTG 1 TTTTCTTTTAATTTTTG 41203 TTT 1 TTT 41206 AAGAACTTTT Statistics Matches: 18, Mismatches: 2, Indels: 0 0.90 0.10 0.00 Matches are distributed among these distances: 17 18 1.00 ACGTcount: A:0.08, C:0.03, G:0.05, T:0.84 Consensus pattern (17 bp): TTTTCTTTTAATTTTTG Found at i:43891 original size:21 final size:22 Alignment explanation

Indices: 43867--43907 Score: 66 Period size: 21 Copynumber: 1.9 Consensus size: 22 43857 AAATTAAATG * 43867 CAATTTGACCCCTG-TTTTATT 1 CAATTTGACCACTGATTTTATT 43888 CAATTTGACCACTGATTTTA 1 CAATTTGACCACTGATTTTA 43908 GAAATTATGC Statistics Matches: 18, Mismatches: 1, Indels: 1 0.90 0.05 0.05 Matches are distributed among these distances: 21 13 0.72 22 5 0.28 ACGTcount: A:0.24, C:0.22, G:0.10, T:0.44 Consensus pattern (22 bp): CAATTTGACCACTGATTTTATT Found at i:45780 original size:5 final size:5 Alignment explanation

Indices: 45770--45805 Score: 51 Period size: 5 Copynumber: 7.8 Consensus size: 5 45760 AAATGGTTGC 45770 TTTGT TTTG- TTTGT TTTG- TTTGT TTTG- TTTGT TTTG 1 TTTGT TTTGT TTTGT TTTGT TTTGT TTTGT TTTGT TTTG 45806 ATGTTTATTG Statistics Matches: 28, Mismatches: 0, Indels: 6 0.82 0.00 0.18 Matches are distributed among these distances: 4 12 0.43 5 16 0.57 ACGTcount: A:0.00, C:0.00, G:0.22, T:0.78 Consensus pattern (5 bp): TTTGT Found at i:45784 original size:9 final size:9 Alignment explanation

Indices: 45770--45805 Score: 72 Period size: 9 Copynumber: 4.0 Consensus size: 9 45760 AAATGGTTGC 45770 TTTGTTTTG 1 TTTGTTTTG 45779 TTTGTTTTG 1 TTTGTTTTG 45788 TTTGTTTTG 1 TTTGTTTTG 45797 TTTGTTTTG 1 TTTGTTTTG 45806 ATGTTTATTG Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 9 27 1.00 ACGTcount: A:0.00, C:0.00, G:0.22, T:0.78 Consensus pattern (9 bp): TTTGTTTTG Found at i:49445 original size:15 final size:15 Alignment explanation

Indices: 49414--49447 Score: 52 Period size: 15 Copynumber: 2.3 Consensus size: 15 49404 ACATGCTTTC 49414 AAAAGATTTTTCGAA 1 AAAAGATTTTTCGAA 49429 AAAATGATTTTTC-AA 1 AAAA-GATTTTTCGAA 49444 AAAA 1 AAAA 49448 ACAAACAAAA Statistics Matches: 18, Mismatches: 0, Indels: 2 0.90 0.00 0.10 Matches are distributed among these distances: 15 10 0.56 16 8 0.44 ACGTcount: A:0.53, C:0.06, G:0.09, T:0.32 Consensus pattern (15 bp): AAAAGATTTTTCGAA Found at i:49448 original size:16 final size:15 Alignment explanation

Indices: 49414--49448 Score: 52 Period size: 16 Copynumber: 2.3 Consensus size: 15 49404 ACATGCTTTC 49414 AAAAGATTTTTCGAAA 1 AAAAGATTTTTC-AAA * 49430 AAATGATTTTTCAAA 1 AAAAGATTTTTCAAA 49445 AAAA 1 AAAA 49449 CAAACAAAAA Statistics Matches: 17, Mismatches: 2, Indels: 1 0.85 0.10 0.05 Matches are distributed among these distances: 15 6 0.35 16 11 0.65 ACGTcount: A:0.54, C:0.06, G:0.09, T:0.31 Consensus pattern (15 bp): AAAAGATTTTTCAAA Found at i:51095 original size:27 final size:27 Alignment explanation

Indices: 51057--51131 Score: 132 Period size: 27 Copynumber: 2.8 Consensus size: 27 51047 CCCTGTTCAG * 51057 TTTATTTCAGTTGATCTAGGGCGATCT 1 TTTATTTCAGTTGATCCAGGGCGATCT 51084 TTTATTTCAGTTGATCCAGGGCGATCT 1 TTTATTTCAGTTGATCCAGGGCGATCT * 51111 TTTATTTCAGTTGACCCAGGG 1 TTTATTTCAGTTGATCCAGGG 51132 TGGTCCTTAT Statistics Matches: 46, Mismatches: 2, Indels: 0 0.96 0.04 0.00 Matches are distributed among these distances: 27 46 1.00 ACGTcount: A:0.19, C:0.17, G:0.23, T:0.41 Consensus pattern (27 bp): TTTATTTCAGTTGATCCAGGGCGATCT Found at i:51205 original size:20 final size:21 Alignment explanation

Indices: 51180--51220 Score: 57 Period size: 20 Copynumber: 2.0 Consensus size: 21 51170 TCTTTCCTTC 51180 AGTTTACTTC-AGTTAATCCA 1 AGTTTACTTCAAGTTAATCCA * * 51200 AGTTTATTTCAAGTTGATCCA 1 AGTTTACTTCAAGTTAATCCA 51221 GGGCGATCCT Statistics Matches: 18, Mismatches: 2, Indels: 1 0.86 0.10 0.05 Matches are distributed among these distances: 20 9 0.50 21 9 0.50 ACGTcount: A:0.29, C:0.17, G:0.12, T:0.41 Consensus pattern (21 bp): AGTTTACTTCAAGTTAATCCA Found at i:51242 original size:57 final size:57 Alignment explanation

Indices: 51144--51257 Score: 160 Period size: 57 Copynumber: 2.0 Consensus size: 57 51134 GTCCTTATTC 51144 AGTTTATTTCAATTGATCCAGGGCGATCTTTCCTTCAGTTTACTT-CAGTTAATCCA 1 AGTTTATTTCAATTGATCCAGGGCGATCTTTCCTTCAGTTTACTTGCAGTTAATCCA ** * * 51200 AGTTTATTTCAAGTTGATCCAGGGCGATCCTTT-CTTTGGTTTATTTGCAGTTGATCCA 1 AGTTTATTTCAA-TTGATCCAGGGCGAT-CTTTCCTTCAGTTTACTTGCAGTTAATCCA 51258 GAGCGATCTT Statistics Matches: 51, Mismatches: 4, Indels: 4 0.86 0.07 0.07 Matches are distributed among these distances: 56 12 0.24 57 25 0.49 58 14 0.27 ACGTcount: A:0.21, C:0.19, G:0.18, T:0.42 Consensus pattern (57 bp): AGTTTATTTCAATTGATCCAGGGCGATCTTTCCTTCAGTTTACTTGCAGTTAATCCA Found at i:51267 original size:94 final size:91 Alignment explanation

Indices: 51111--51279 Score: 221 Period size: 94 Copynumber: 1.8 Consensus size: 91 51101 AGGGCGATCT * * * 51111 TTTATTTCAGTTGACCCAGGGTGGTCCTTATTCAGTTTATTTCAATTGATCCAGGGCGATCTTTC 1 TTTATTTCAGTTGACCCAGGGCGATCCTTATTCAGTTTATTTCAATTGATCCAGAGCGATCTTTC * 51176 CTTCAGTTTACTTCAGTTAATCCAAG 66 CTTCAATTTACTTCAGTTAATCCAAG * * ** * 51202 TTTATTTCAAGTTGATCCAGGGCGATCCTTTCTTTGGTTTATTTGCAGTTGATCCAGAGCGATCT 1 TTTATTTC-AGTTGACCCAGGGCGATCC-TTATTCAGTTTATTT-CAATTGATCCAGAGCGATCT * 51267 TTTCTTCAATTTA 63 TTCCTTCAATTTA 51280 TTGTAGTTGA Statistics Matches: 65, Mismatches: 10, Indels: 3 0.83 0.13 0.04 Matches are distributed among these distances: 91 8 0.12 92 16 0.25 93 12 0.18 94 29 0.45 ACGTcount: A:0.20, C:0.20, G:0.18, T:0.43 Consensus pattern (91 bp): TTTATTTCAGTTGACCCAGGGCGATCCTTATTCAGTTTATTTCAATTGATCCAGAGCGATCTTTC CTTCAATTTACTTCAGTTAATCCAAG Found at i:51300 original size:36 final size:36 Alignment explanation

Indices: 51202--51584 Score: 297 Period size: 36 Copynumber: 10.4 Consensus size: 36 51192 TTAATCCAAG * * *** 51202 TTTATTTCAAGTTGATCCAGGGCGATCCTTTCTTTGG 1 TTTATTTC-AGTTGATCCAGGGTGATCTTTTCTTCAA * * 51239 TTTATTTGCAGTTGATCCAGAGCGATCTTTTCTTCAA 1 TTTATTT-CAGTTGATCCAGGGTGATCTTTTCTTCAA * * 51276 TTTATTGT-AGTTGATCTAGGGCGATCTTTTTCTTCATCA 1 TTTATT-TCAGTTGATCCAGGGTGATC-TTTTCTTCA--A * * * * * * 51315 GTTTATTTCGATTTGACCCAGGGTGGTC-CTTGTTCAG 1 -TTTATTTC-AGTTGATCCAGGGTGATCTTTTCTTCAA * * 51352 TTTATTTCAATTGAGCCAGGGTGATCTTTTCTTCAA 1 TTTATTTCAGTTGATCCAGGGTGATCTTTTCTTCAA * * * 51388 TTTATTGCAGTTGATCCAGGGCGATTTTTTCTTCATCA 1 TTTATTTCAGTTGATCCAGGGTGATCTTTTCTTCA--A * * * * * 51426 GTTTATTTCAATTTGACCCAGGGTGGTC-CTTGTTCAA 1 -TTTATTTC-AGTTGATCCAGGGTGATCTTTTCTTCAA * 51463 TTTATTTCAATTGATCCAGGGTGATCTTTTCTTCAA 1 TTTATTTCAGTTGATCCAGGGTGATCTTTTCTTCAA * * * * * 51499 TTTATTTTAATGGATCCAGGGTGGTCTTTTCTTCAG 1 TTTATTTCAGTTGATCCAGGGTGATCTTTTCTTCAA * ** * * 51535 TTTATTTCAATCAATCC-GAGGTGATCTTTTCTGCAG 1 TTTATTTCAGTTGATCCAG-GGTGATCTTTTCTTCAA 51571 TTTATTTCAGTTGA 1 TTTATTTCAGTTGA 51585 CCGAAAAATT Statistics Matches: 280, Mismatches: 51, Indels: 31 0.77 0.14 0.09 Matches are distributed among these distances: 35 31 0.11 36 147 0.52 37 46 0.16 38 3 0.01 39 21 0.08 40 19 0.07 41 13 0.05 ACGTcount: A:0.19, C:0.17, G:0.19, T:0.45 Consensus pattern (36 bp): TTTATTTCAGTTGATCCAGGGTGATCTTTTCTTCAA Found at i:51457 original size:111 final size:112 Alignment explanation

Indices: 51200--51545 Score: 450 Period size: 111 Copynumber: 3.1 Consensus size: 112 51190 AGTTAATCCA * * * * * ** * * * 51200 AGTTTATTTCAAGTTGATCCAGGGCGATCCTTTCTTTGGTTTATTTGCAGTTGATCCAGAGCGAT 1 AGTTTATTTCAATTTGACCCAGGGTGGTCC-TTGTTCAGTTTATTT-CAATTGATCCAGGGTGAT * * 51265 CTTTTCTTCAATTTATTGTAGTTGATCTAGGGCGATCTTTTTCTTCATC 64 CTTTTCTTCAATTTATTGCAGTTGATCCAGGGCGATCTTTTTCTTCATC * * 51314 AGTTTATTTCGATTTGACCCAGGGTGGTCCTTGTTCAGTTTATTTCAATTGAGCCAGGGTGATCT 1 AGTTTATTTCAATTTGACCCAGGGTGGTCCTTGTTCAGTTTATTTCAATTGATCCAGGGTGATCT 51379 TTTCTTCAATTTATTGCAGTTGATCCAGGGCGAT-TTTTTCTTCATC 66 TTTCTTCAATTTATTGCAGTTGATCCAGGGCGATCTTTTTCTTCATC * 51425 AGTTTATTTCAATTTGACCCAGGGTGGTCCTTGTTCAATTTATTTCAATTGATCCAGGGTGATCT 1 AGTTTATTTCAATTTGACCCAGGGTGGTCCTTGTTCAGTTTATTTCAATTGATCCAGGGTGATCT ** * * * * 51490 TTTCTTCAATTTATTTTAATGGATCCAGGGTGGTC-TTTTC-T--TC 66 TTTCTTCAATTTATTGCAGTTGATCCAGGGCGATCTTTTTCTTCATC 51533 AGTTTATTTCAAT 1 AGTTTATTTCAAT 51546 CAATCCGAGG Statistics Matches: 208, Mismatches: 23, Indels: 8 0.87 0.10 0.03 Matches are distributed among these distances: 108 15 0.07 110 1 0.00 111 107 0.51 112 48 0.23 113 12 0.06 114 25 0.12 ACGTcount: A:0.19, C:0.17, G:0.19, T:0.45 Consensus pattern (112 bp): AGTTTATTTCAATTTGACCCAGGGTGGTCCTTGTTCAGTTTATTTCAATTGATCCAGGGTGATCT TTTCTTCAATTTATTGCAGTTGATCCAGGGCGATCTTTTTCTTCATC Found at i:51874 original size:30 final size:30 Alignment explanation

Indices: 51838--52289 Score: 764 Period size: 30 Copynumber: 15.0 Consensus size: 30 51828 TGCTTCATAA 51838 CTTTATTTTAATCCTGGTTT-AGGATCATTG 1 CTTTATTTTAATCCT-GTTTGAGGATCATTG 51868 CTTTATTTTTAATCCTGTTTGAGGATCATTG 1 CTTTA-TTTTAATCCTGTTTGAGGATCATTG 51899 CTTTATTTTAATCCTGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG * 51929 CTTTATTTTAATCCTATTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG * * 51959 CTTTATTTTAATCCTGATCGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG * 51989 CTTCATTTTAATCCTGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG 52019 CTTTATTTTTAATCCTGTTTGAGGATCATTG 1 CTTTA-TTTTAATCCTGTTTGAGGATCATTG 52050 CTTTATTTTAATCCTGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG * 52080 CTTCATTTTAATCCTGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG 52110 CTTTATTTTAATCCTGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG 52140 CTTTATTTTAATCCTGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG * * 52170 CTTCATTTTAATCATGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG * 52200 CTTTATTTTAATCCTGGTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG * * 52230 CTTCATTTTAATCATGTTTGAGGATCATTG 1 CTTTATTTTAATCCTGTTTGAGGATCATTG 52260 CTTTATTTTAATCCTGGTTT-AGGATCATTG 1 CTTTATTTTAATCCT-GTTTGAGGATCATTG 52290 TTTCATCAGT Statistics Matches: 398, Mismatches: 20, Indels: 8 0.93 0.05 0.02 Matches are distributed among these distances: 30 339 0.85 31 59 0.15 ACGTcount: A:0.21, C:0.14, G:0.17, T:0.49 Consensus pattern (30 bp): CTTTATTTTAATCCTGTTTGAGGATCATTG Found at i:54059 original size:21 final size:22 Alignment explanation

Indices: 54035--54075 Score: 75 Period size: 21 Copynumber: 1.9 Consensus size: 22 54025 AAATTAAATG 54035 CAATTTGACCCCTG-TTTTATT 1 CAATTTGACCCCTGATTTTATT 54056 CAATTTGACCCCTGATTTTA 1 CAATTTGACCCCTGATTTTA 54076 GAAATTATGC Statistics Matches: 19, Mismatches: 0, Indels: 1 0.95 0.00 0.05 Matches are distributed among these distances: 21 14 0.74 22 5 0.26 ACGTcount: A:0.22, C:0.24, G:0.10, T:0.44 Consensus pattern (22 bp): CAATTTGACCCCTGATTTTATT Found at i:54081 original size:21 final size:21 Alignment explanation

Indices: 54036--54081 Score: 58 Period size: 21 Copynumber: 2.2 Consensus size: 21 54026 AATTAAATGC ** 54036 AATTTGACCCCTGTTTTATTC 1 AATTTGACCCCTGTTTTATGA 54057 AATTTGACCCCTGATTTTA-GA 1 AATTTGACCCCTG-TTTTATGA 54078 AATT 1 AATT 54082 ATGCTAAAAT Statistics Matches: 22, Mismatches: 2, Indels: 2 0.85 0.08 0.08 Matches are distributed among these distances: 21 17 0.77 22 5 0.23 ACGTcount: A:0.26, C:0.20, G:0.11, T:0.43 Consensus pattern (21 bp): AATTTGACCCCTGTTTTATGA Done.