Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: NTFQ01002737.1 Kokia drynarioides strain JFW-HI SEQ_115041, whole genome shotgun sequence

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 73847
ACGTcount: A:0.34, C:0.17, G:0.15, T:0.33


Found at i:2533 original size:29 final size:26

Alignment explanation

Indices: 2501--2583 Score: 67 Period size: 29 Copynumber: 2.9 Consensus size: 26 2491 TTCAAGGTAG 2501 AAATGAAATTTTTCAAAATTTTGGAGCA 1 AAAT-AAATTTTTCAAAATTTTGGAG-A * * 2529 TAAATAAAATTTTCTAAAAGTTTTAAGAGA 1 -AAATAAATTTTTC-AAAA-TTTT-GGAGA * 2559 AAATCATAATTTCTCAAAATTTTGG 1 AAAT-A-AATTTTTCAAAATTTTGG 2584 GGGCTAAAGC Statistics Matches: 44, Mismatches: 5, Indels: 11 0.73 0.08 0.18 Matches are distributed among these distances: 28 9 0.20 29 16 0.36 30 10 0.23 31 9 0.20 ACGTcount: A:0.45, C:0.07, G:0.11, T:0.37 Consensus pattern (26 bp): AAATAAATTTTTCAAAATTTTGGAGA Found at i:7353 original size:86 final size:83 Alignment explanation

Indices: 7214--7409 Score: 270 Period size: 86 Copynumber: 2.3 Consensus size: 83 7204 ACTTTTATAT * * * * * 7214 GTCTAAAATCAAACTAAATCAACCC--CAAAAATTTTCTAGAAATAACTATTCAAAGATCATGAA 1 GTCTAACAGCAAACTAAATCAACCCAAAAAAAAATTTCTAGAAATAACTATTCAAAAATCATGAA * 7277 AAGTCGTCACCTGTACAT 66 AAGTCGTCACCTGTACAA * 7295 GTCTCTAACAGCAAACTAAATCAACCCAAAAAGAAAATTTCTAGAAATGACTATTCAAAAATCAT 1 G--TCTAACAGCAAACTAAATCAACCCAAAAA-AAAATTTCTAGAAATAACTATTCAAAAATCAT * 7360 GAAAAGTCGTCACCTGTAGAA 63 GAAAAGTCGTCACCTGTACAA * 7381 GTCTAGCAGCAAACTAAATCAACCCAAAA 1 GTCTAACAGCAAACTAAATCAACCCAAAA 7410 GATAACTGAC Statistics Matches: 101, Mismatches: 9, Indels: 7 0.86 0.08 0.06 Matches are distributed among these distances: 81 1 0.01 83 22 0.22 84 27 0.27 85 2 0.02 86 49 0.49 ACGTcount: A:0.46, C:0.21, G:0.10, T:0.22 Consensus pattern (83 bp): GTCTAACAGCAAACTAAATCAACCCAAAAAAAAATTTCTAGAAATAACTATTCAAAAATCATGAA AAGTCGTCACCTGTACAA Found at i:11441 original size:30 final size:30 Alignment explanation

Indices: 11365--11442 Score: 86 Period size: 30 Copynumber: 2.6 Consensus size: 30 11355 TCTATATTTT * 11365 CCAAGTTCAA-AACCAAAATAGATCTAATTA 1 CCAAGTTCAAGAACC-AAATAGACCTAATTA * * * * * 11395 CCCAGTTTAAGTACCAAATGGACCTAGTTA 1 CCAAGTTCAAGAACCAAATAGACCTAATTA 11425 CCAAGTTCAAGAACCAAA 1 CCAAGTTCAAGAACCAAA 11443 AATTACATTA Statistics Matches: 38, Mismatches: 9, Indels: 2 0.78 0.18 0.04 Matches are distributed among these distances: 30 35 0.92 31 3 0.08 ACGTcount: A:0.44, C:0.23, G:0.12, T:0.22 Consensus pattern (30 bp): CCAAGTTCAAGAACCAAATAGACCTAATTA Found at i:28064 original size:4 final size:4 Alignment explanation

Indices: 28055--28084 Score: 51 Period size: 4 Copynumber: 7.5 Consensus size: 4 28045 AGGTACCATT * 28055 TGTA TGTA TGTA TGTA TGCA TGTA TGTA TG 1 TGTA TGTA TGTA TGTA TGTA TGTA TGTA TG 28085 CATGGTGTTG Statistics Matches: 24, Mismatches: 2, Indels: 0 0.92 0.08 0.00 Matches are distributed among these distances: 4 24 1.00 ACGTcount: A:0.23, C:0.03, G:0.27, T:0.47 Consensus pattern (4 bp): TGTA Found at i:28072 original size:12 final size:12 Alignment explanation

Indices: 28055--28088 Score: 59 Period size: 12 Copynumber: 2.8 Consensus size: 12 28045 AGGTACCATT * 28055 TGTATGTATGTA 1 TGTATGCATGTA 28067 TGTATGCATGTA 1 TGTATGCATGTA 28079 TGTATGCATG 1 TGTATGCATG 28089 GTGTTGAATT Statistics Matches: 21, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 12 21 1.00 ACGTcount: A:0.24, C:0.06, G:0.26, T:0.44 Consensus pattern (12 bp): TGTATGCATGTA Found at i:33827 original size:5 final size:5 Alignment explanation

Indices: 33817--33850 Score: 68 Period size: 5 Copynumber: 6.8 Consensus size: 5 33807 ATACGCAAGT 33817 CATTG CATTG CATTG CATTG CATTG CATTG CATT 1 CATTG CATTG CATTG CATTG CATTG CATTG CATT 33851 TTCATTGCCT Statistics Matches: 29, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 5 29 1.00 ACGTcount: A:0.21, C:0.21, G:0.18, T:0.41 Consensus pattern (5 bp): CATTG Found at i:35454 original size:33 final size:33 Alignment explanation

Indices: 35416--35485 Score: 104 Period size: 33 Copynumber: 2.1 Consensus size: 33 35406 CTCTAATCAT * * * 35416 AACCCTAACTCTAAATACCAACACTAACCTTAA 1 AACCCTAACCCTAAACACCAACACTAACCCTAA * 35449 AACCCTAACCCTAAACACCAACCCTAACCCTAA 1 AACCCTAACCCTAAACACCAACACTAACCCTAA 35482 AACC 1 AACC 35486 TTACACTATC Statistics Matches: 33, Mismatches: 4, Indels: 0 0.89 0.11 0.00 Matches are distributed among these distances: 33 33 1.00 ACGTcount: A:0.44, C:0.40, G:0.00, T:0.16 Consensus pattern (33 bp): AACCCTAACCCTAAACACCAACACTAACCCTAA Found at i:35578 original size:26 final size:26 Alignment explanation

Indices: 35529--35578 Score: 73 Period size: 26 Copynumber: 1.9 Consensus size: 26 35519 ACTCTAAACT * 35529 TTAAACCCTATCCCTAACCCTAAAAA 1 TTAAACCCTATCCCTAACACTAAAAA * * 35555 TTAAACCTTATCCCTAACATTAAA 1 TTAAACCCTATCCCTAACACTAAA 35579 CCCTAACCCC Statistics Matches: 21, Mismatches: 3, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 26 21 1.00 ACGTcount: A:0.42, C:0.30, G:0.00, T:0.28 Consensus pattern (26 bp): TTAAACCCTATCCCTAACACTAAAAA Found at i:35603 original size:18 final size:18 Alignment explanation

Indices: 35577--35615 Score: 51 Period size: 18 Copynumber: 2.2 Consensus size: 18 35567 CCTAACATTA * 35577 AACCCTAACCCCAACCAT 1 AACCATAACCCCAACCAT * * 35595 AACCATAACCCTAACCCT 1 AACCATAACCCCAACCAT 35613 AAC 1 AAC 35616 ATTAACCCTA Statistics Matches: 18, Mismatches: 3, Indels: 0 0.86 0.14 0.00 Matches are distributed among these distances: 18 18 1.00 ACGTcount: A:0.41, C:0.46, G:0.00, T:0.13 Consensus pattern (18 bp): AACCATAACCCCAACCAT Found at i:35621 original size:18 final size:19 Alignment explanation

Indices: 35540--35664 Score: 69 Period size: 19 Copynumber: 6.3 Consensus size: 19 35530 TAAACCCTAT * 35540 CCCTAACCCTAAAAATTAAA 1 CCCTAACCCT-AACATTAAA * * 35560 CCTTATCCCTAACATTAAA 1 CCCTAACCCTAACATTAAA * * 35579 CCCTAACCCCAACCATAACCATAA 1 CCCTAACCCTAA-CAT---TA-AA 35603 CCCTAACCCTAACATT-AA 1 CCCTAACCCTAACATTAAA * 35621 CCCTAAACACC-AACACTAAA 1 CCCT-AAC-CCTAACATTAAA * * 35641 CCTTAAACCCT-ACA-CAAA 1 CCCT-AACCCTAACATTAAA 35659 CCCTAA 1 CCCTAA 35665 ATACCAACAC Statistics Matches: 83, Mismatches: 13, Indels: 21 0.71 0.11 0.18 Matches are distributed among these distances: 17 2 0.02 18 12 0.14 19 30 0.36 20 22 0.27 23 4 0.05 24 13 0.16 ACGTcount: A:0.43, C:0.38, G:0.00, T:0.18 Consensus pattern (19 bp): CCCTAACCCTAACATTAAA Found at i:35674 original size:19 final size:19 Alignment explanation

Indices: 35619--35684 Score: 71 Period size: 20 Copynumber: 3.4 Consensus size: 19 35609 CCCTAACATT 35619 AACCCTAAACACCAACACTA 1 AACCCTAAA-ACCAACACTA * * * 35639 AACCTTAAACCCTACAC-A 1 AACCCTAAAACCAACACTA 35657 AACCCTAAATACCAACACTA 1 AACCCTAAA-ACCAACACTA * 35677 AGCCCTAA 1 AACCCTAA 35685 CTTAAACCCT Statistics Matches: 37, Mismatches: 7, Indels: 4 0.77 0.15 0.08 Matches are distributed among these distances: 18 9 0.24 19 12 0.32 20 16 0.43 ACGTcount: A:0.47, C:0.38, G:0.02, T:0.14 Consensus pattern (19 bp): AACCCTAAAACCAACACTA Found at i:35697 original size:7 final size:7 Alignment explanation

Indices: 35687--35856 Score: 59 Period size: 7 Copynumber: 24.9 Consensus size: 7 35677 AGCCCTAACT 35687 TAAACCC 1 TAAACCC * * 35694 TAAATCT 1 TAAACCC * 35701 TAAACAC 1 TAAACCC * 35708 TAAA-CT 1 TAAACCC ** 35714 TAAACAG 1 TAAACCC * 35721 TAAACCA 1 TAAACCC * 35728 TAAACTC 1 TAAACCC * 35735 TAAACAC 1 TAAACCC 35742 TAAA-CC 1 TAAACCC * 35748 TTAACCC 1 TAAACCC * 35755 TAAACAC 1 TAAACCC 35762 -AAACCC 1 TAAACCC 35768 T-AACCC 1 TAAACCC * 35774 TAAAACCT 1 T-AAACCC * 35782 TACCATACCA 1 TA--A-ACCC * 35792 TAAA-CT 1 TAAACCC 35798 TAAACCC 1 TAAACCC 35805 T-AACCC 1 TAAACCC * * 35811 TAAAACA 1 TAAACCC * * 35818 TTAACAC 1 TAAACCC * 35825 TAATCCC 1 TAAACCC * 35832 -CAACCC 1 TAAACCC * 35838 T-AATCC 1 TAAACCC 35844 TAAACCC 1 TAAACCC 35851 TAAACC 1 TAAACC 35857 TTAATCACCA Statistics Matches: 115, Mismatches: 36, Indels: 24 0.66 0.21 0.14 Matches are distributed among these distances: 6 39 0.34 7 64 0.56 8 6 0.05 9 1 0.01 10 5 0.04 ACGTcount: A:0.46, C:0.34, G:0.01, T:0.20 Consensus pattern (7 bp): TAAACCC Found at i:35724 original size:27 final size:27 Alignment explanation

Indices: 35683--35765 Score: 71 Period size: 27 Copynumber: 3.1 Consensus size: 27 35673 ACTAAGCCCT * * * 35683 AACTTAAACCCTAAATCTTAAACACTA 1 AACTTAAACACTAAACCATAAACACTA * * 35710 AACTTAAACAGTAAACCATAAACTCTA 1 AACTTAAACACTAAACCATAAACACTA * * * 35737 AACACTAAAC-CTTAACCCTAAACAC-A 1 AAC-TTAAACACTAAACCATAAACACTA 35763 AAC 1 AAC 35766 CCTAACCCTA Statistics Matches: 45, Mismatches: 10, Indels: 3 0.78 0.17 0.05 Matches are distributed among these distances: 26 4 0.09 27 36 0.80 28 5 0.11 ACGTcount: A:0.51, C:0.28, G:0.01, T:0.20 Consensus pattern (27 bp): AACTTAAACACTAAACCATAAACACTA Found at i:35771 original size:19 final size:20 Alignment explanation

Indices: 35714--36001 Score: 83 Period size: 19 Copynumber: 14.6 Consensus size: 20 35704 ACACTAAACT * * * 35714 TAAACAGTAAACCATAAACTC 1 TAAACACTAAACCCT-AACCC * 35735 TAAACACTAAACCTTAACCC 1 TAAACACTAAACCCTAACCC 35755 TAAACAC-AAACCCTAACCC 1 TAAACACTAAACCCTAACCC * * * 35774 TAAA-ACCT-TACCAT-ACCA 1 TAAACA-CTAAACCCTAACCC * 35792 TAAAC-TTAAACCCTAACCC 1 TAAACACTAAACCCTAACCC * * 35811 TAAA-ACATTAACACTAATCCC 1 TAAACAC-TAAACCCTAA-CCC * * * 35832 -CAACCCT-AATCCTAAACCC 1 TAAACACTAAACCCT-AACCC * * 35851 TAAAC-CTTAATCACC-AACCT 1 TAAACAC-TAAAC-CCTAACCC * ** 35871 TAAAC-CTTAACATTAAACACC 1 TAAACACTAAACCCT-AAC-CC * * 35892 --AAC-CTTAAACCTTAACCT 1 TAAACAC-TAAACCCTAACCC * 35910 TAAACAC-CAACCCTAACCAC 1 TAAACACTAAACCCTAACC-C 35930 -AAATCA-TAAACCCTAACCC 1 TAAA-CACTAAACCCTAACCC * * * 35949 TAAATAC-CAACCTTAACCC 1 TAAACACTAAACCCTAACCC 35968 TTAAAC-CATAAACCCTAACCC 1 -TAAACAC-TAAACCCTAACCC * 35989 TTAAACCCTAAAC 1 -TAAACACTAAAC 36002 ATTAACATTA Statistics Matches: 197, Mismatches: 40, Indels: 60 0.66 0.13 0.20 Matches are distributed among these distances: 17 1 0.01 18 13 0.07 19 70 0.36 20 69 0.35 21 41 0.21 22 3 0.02 ACGTcount: A:0.44, C:0.35, G:0.00, T:0.20 Consensus pattern (20 bp): TAAACACTAAACCCTAACCC Found at i:35940 original size:39 final size:39 Alignment explanation

Indices: 35892--35994 Score: 118 Period size: 39 Copynumber: 2.6 Consensus size: 39 35882 ATTAAACACC * * 35892 AACCTTAAACCTTAACCTTAAACACCAACCCTAACCAC-A 1 AACCATAAACCCTAACCTTAAACACCAACCCTAACC-CTA * * * * 35931 AATCATAAACCCTAACCCTAAATACCAACCTTAACCCTTA 1 AACCATAAACCCTAACCTTAAACACCAACCCTAACCC-TA 35971 AACCATAAACCCTAACCCTTAAAC 1 AACCATAAACCCTAA-CCTTAAAC 35995 CCTAAACATT Statistics Matches: 52, Mismatches: 9, Indels: 4 0.80 0.14 0.06 Matches are distributed among these distances: 38 1 0.02 39 30 0.58 40 15 0.29 41 6 0.12 ACGTcount: A:0.44, C:0.37, G:0.00, T:0.19 Consensus pattern (39 bp): AACCATAAACCCTAACCTTAAACACCAACCCTAACCCTA Found at i:36019 original size:7 final size:7 Alignment explanation

Indices: 35957--36121 Score: 78 Period size: 7 Copynumber: 24.3 Consensus size: 7 35947 CCTAAATACC 35957 AACCTTA 1 AACCTTA * 35964 ACCCTTA 1 AACCTTA * 35971 AACCATA 1 AACCTTA * 35978 AACCCTA 1 AACCTTA * 35985 ACCCTTA 1 AACCTTA * 35992 AACCCTA 1 AACCTTA * 35999 AACATT- 1 AACCTTA * 36005 AACATTA 1 AACCTTA 36012 AACCTTA 1 AACCTTA * 36019 AACCCT- 1 AACCTTA * 36025 AA-CATA 1 AACCTTA * * 36031 ATCCAT- 1 AACCTTA 36037 AA-CTTA 1 AACCTTA 36043 AACCTTA 1 AACCTTA 36050 AACCTT- 1 AACCTTA * 36056 AA-CATA 1 AACCTTA * * 36062 ATCCAT- 1 AACCTTA 36068 AA-CTTA 1 AACCTTA * 36074 AACCCTA 1 AACCTTA 36081 AACCTTAACA 1 AACCTT---A 36091 CAACCTTA 1 -AACCTTA * 36099 AACCCTA 1 AACCTTA * 36106 AACATTA 1 AACCTTA 36113 AACCTTA 1 AACCTTA 36120 AA 1 AA 36122 AGATTAATGT Statistics Matches: 119, Mismatches: 26, Indels: 26 0.70 0.15 0.15 Matches are distributed among these distances: 5 8 0.07 6 18 0.15 7 85 0.71 8 1 0.01 10 1 0.01 11 6 0.05 ACGTcount: A:0.45, C:0.30, G:0.00, T:0.24 Consensus pattern (7 bp): AACCTTA Found at i:36047 original size:31 final size:31 Alignment explanation

Indices: 36004--36090 Score: 147 Period size: 31 Copynumber: 2.8 Consensus size: 31 35994 CCCTAAACAT * 36004 TAACATTAAACCTTAAACCCTAACATAATCCA 1 TAAC-TTAAACCTTAAACCTTAACATAATCCA 36036 TAACTTAAACCTTAAACCTTAACATAATCCA 1 TAACTTAAACCTTAAACCTTAACATAATCCA * 36067 TAACTTAAACCCTAAACCTTAACA 1 TAACTTAAACCTTAAACCTTAACA 36091 CAACCTTAAA Statistics Matches: 53, Mismatches: 2, Indels: 1 0.95 0.04 0.02 Matches are distributed among these distances: 31 49 0.92 32 4 0.08 ACGTcount: A:0.46, C:0.28, G:0.00, T:0.26 Consensus pattern (31 bp): TAACTTAAACCTTAAACCTTAACATAATCCA Found at i:36115 original size:14 final size:14 Alignment explanation

Indices: 35685--36121 Score: 104 Period size: 13 Copynumber: 32.9 Consensus size: 14 35675 TAAGCCCTAA * 35685 CTTAAACCCTAAAT 1 CTTAAACCCTAAAC * 35699 CTTAAACACTAAA- 1 CTTAAACCCTAAAC ** 35712 CTTAAACAGTAAAC 1 CTTAAACCCTAAAC * * 35726 CATAAACTCTAAAC 1 CTTAAACCCTAAAC * 35740 AC-TAAA-CCTTAAC 1 -CTTAAACCCTAAAC * * 35753 CCTAAACAC-AAAC 1 CTTAAACCCTAAAC * 35766 CCT-AACCCTAAAAC 1 CTTAAACCCT-AAAC * 35780 CTT--A-CC-ATAC 1 CTTAAACCCTAAAC * * 35790 CATAAA-CTTAAAC 1 CTTAAACCCTAAAC * 35803 CCT-AACCCTAAAAC 1 CTTAAACCCT-AAAC * * * 35817 ATT-AACACTAATC 1 CTTAAACCCTAAAC ** * 35830 C-CCAACCCTAATC 1 CTTAAACCCTAAAC 35843 C-TAAACCCTAAAC 1 CTTAAACCCTAAAC * 35856 CTTAATCACC--AAC 1 CTTAAAC-CCTAAAC * 35869 CTTAAA-CCTTAAC 1 CTTAAACCCTAAAC * 35882 ATTAAACACC--AAC 1 CTTAAAC-CCTAAAC * 35895 CTTAAA-CCTTAAC 1 CTTAAACCCTAAAC 35908 CTTAAACACC--AAC 1 CTTAAAC-CCTAAAC * * 35921 CCT-AACCAC-AAAT 1 CTTAAACC-CTAAAC * 35934 CATAAACCCT-AAC 1 CTTAAACCCTAAAC * * 35947 CCTAAATACC--AAC 1 CTTAAA-CCCTAAAC 35960 CTT-AACCCTTAAAC 1 CTTAAACCC-TAAAC * * 35974 CATAAACCCTAACC 1 CTTAAACCCTAAAC 35988 CTTAAACCCTAAAC 1 CTTAAACCCTAAAC * ** 36002 ATT-AACATTAAAC 1 CTTAAACCCTAAAC 36015 CTTAAACCCT-AA- 1 CTTAAACCCTAAAC * * * 36027 CATAATCCAT-AA- 1 CTTAAACCCTAAAC * 36039 CTTAAACCTTAAAC 1 CTTAAACCCTAAAC * 36053 CTTAACATAATCC-ATAA- 1 CTT---A-AACCCTA-AAC 36070 CTTAAACCCTAAAC 1 CTTAAACCCTAAAC * 36084 CTT-AACAC--AAC 1 CTTAAACCCTAAAC 36095 CTTAAACCCTAAAC 1 CTTAAACCCTAAAC * * 36109 ATTAAACCTTAAA 1 CTTAAACCCTAAA 36122 AGATTAATGT Statistics Matches: 317, Mismatches: 62, Indels: 88 0.68 0.13 0.19 Matches are distributed among these distances: 10 5 0.02 11 13 0.04 12 37 0.12 13 137 0.43 14 103 0.32 15 12 0.04 17 5 0.02 18 5 0.02 ACGTcount: A:0.45, C:0.33, G:0.00, T:0.22 Consensus pattern (14 bp): CTTAAACCCTAAAC Found at i:36724 original size:11 final size:11 Alignment explanation

Indices: 36705--36749 Score: 51 Period size: 11 Copynumber: 4.3 Consensus size: 11 36695 TATATAATAG 36705 TATATATTTAT 1 TATATATTTAT 36716 T-TGATATTTAT 1 TAT-ATATTTAT 36727 TATATATTTA- 1 TATATATTTAT * 36737 -ATAAATTTAT 1 TATATATTTAT 36747 TAT 1 TAT 36750 TTCCTTTAAT Statistics Matches: 29, Mismatches: 1, Indels: 8 0.76 0.03 0.21 Matches are distributed among these distances: 9 8 0.28 10 1 0.03 11 19 0.66 12 1 0.03 ACGTcount: A:0.38, C:0.00, G:0.02, T:0.60 Consensus pattern (11 bp): TATATATTTAT Found at i:38029 original size:58 final size:57 Alignment explanation

Indices: 37980--38382 Score: 279 Period size: 58 Copynumber: 7.0 Consensus size: 57 37970 TTTTGGTCCT * * * * * 37980 CAAACTTCCAAAATTACATTTTGACCCCTAAACTTTTCAAAAATTACATTTAAACCC 1 CAAACTTCCAAAATCACATTTTTACCCCTAAACTTCTCAAAAATTACATTTTACCCC * * * * * * 38037 TAAACTTCTAAAAATTACATTTTAACCCTTAAACTTCTAAAAAATTACATTTTTACCCC 1 CAAACTTC-CAAAATCACATTTTTACCCCTAAACTTCTCAAAAATTACA-TTTTACCCC ** * * * * * * 38096 TTAACTTCTAAAAATCACATTTTTA-CCCTCGAACTCCTCAAAAATTACATTTTGCTCT 1 CAAACTTC-CAAAATCACATTTTTACCCCT-AAACTTCTCAAAAATTACATTTTACCCC * * * * 38154 CTAACTTCCAAAAATCACATTTTTA-CCCTCGAACTCCT-AAAAATTACATTTTTGCCCC 1 CAAACTTCC-AAAATCACATTTTTACCCCT-AAACTTCTCAAAAATTACA-TTTTACCCC * * * 38212 CAAACTTTCAAAATCACATTTTTACCTCGT--ACTT-TCAAAAATTATATTTTACCCC 1 CAAACTTCCAAAATCACATTTTTACC-CCTAAACTTCTCAAAAATTACATTTTACCCC * * *** ** 38267 CAAATTTCCAAAAATTACA-TTTTACCCCCGTACTTC-CAAAAATCGCATTTTTA-CCC 1 CAAACTTCC-AAAATCACATTTTTACCCCTAAACTTCTCAAAAATTACA-TTTTACCCC * * * 38323 TAAACTTCTCAAAAAAT-ACATTTTTACCCCCAAACTTC-CAAAAATCACCATTTTACCCC 1 CAAACTTC-C--AAAATCACATTTTTACCCCTAAACTTCTCAAAAATTA-CATTTTACCCC 38382 C 1 C 38383 GAGCATCCAA Statistics Matches: 286, Mismatches: 42, Indels: 34 0.79 0.12 0.09 Matches are distributed among these distances: 54 1 0.00 55 23 0.08 56 41 0.14 57 41 0.14 58 128 0.45 59 52 0.18 ACGTcount: A:0.36, C:0.28, G:0.02, T:0.34 Consensus pattern (57 bp): CAAACTTCCAAAATCACATTTTTACCCCTAAACTTCTCAAAAATTACATTTTACCCC Found at i:38366 original size:28 final size:28 Alignment explanation

Indices: 37956--38396 Score: 252 Period size: 29 Copynumber: 15.4 Consensus size: 28 37946 TCCCTAAATT *** * 37956 TTCCAAAAATCACATTTTGGTCCTCAAAC 1 TTCCAAAAAT-ACATTTTTACCCCCAAAC * * * 37985 TTCCAAAATTACATTTTGACCCCTAAAC 1 TTCCAAAAATACATTTTTACCCCCAAAC * * * * 38013 TTTTCAAAAATTACA-TTTAAACCCTAAAC 1 -TTCCAAAAA-TACATTTTTACCCCCAAAC * * ** 38042 TTCTAAAAATTACATTTTAACCCTTAAAC 1 TTCCAAAAA-TACATTTTTACCCCCAAAC * ** 38071 TTCTAAAAAATTACATTTTTACCCCTTAAC 1 TTC-CAAAAA-TACATTTTTACCCCCAAAC * * * 38101 TTCTAAAAATCACATTTTTACCCTCGAAC 1 TTCCAAAAAT-ACATTTTTACCCCCAAAC * * * * * 38130 TCCTCAAAAATTACA-TTTTGCTCTCTAAC 1 TTC-CAAAAA-TACATTTTTACCCCCAAAC * * 38159 TTCCAAAAATCACATTTTTACCCTCGAAC 1 TTCCAAAAAT-ACATTTTTACCCCCAAAC * 38188 -TCCTAAAAATTACATTTTTGCCCCCAAAC 1 TTCC-AAAAA-TACATTTTTACCCCCAAAC * * ** 38217 TTTC-AAAATCACATTTTTA-CCTCGTAC 1 TTCCAAAAAT-ACATTTTTACCCCCAAAC * * * 38244 TTTCAAAAAT-TATATTTTACCCCCAAAT 1 TTCCAAAAATACAT-TTTTACCCCCAAAC ** 38272 TTCCAAAAATTACA-TTTTACCCCCGTAC 1 TTCCAAAAA-TACATTTTTACCCCCAAAC * * 38300 TTCCAAAAATCGCATTTTTA-CCCTAAAC 1 TTCCAAAAAT-ACATTTTTACCCCCAAAC 38328 TTCTCAAAAAATACATTTTTACCCCCAAAC 1 TTC-C-AAAAATACATTTTTACCCCCAAAC * * 38358 TTCCAAAAATCACCA-TTTTACCCCCGAGC 1 TTCCAAAAAT-A-CATTTTTACCCCCAAAC * 38387 ATCCAAAAAT 1 TTCCAAAAAT 38397 TACCATTTAT Statistics Matches: 330, Mismatches: 57, Indels: 50 0.76 0.13 0.11 Matches are distributed among these distances: 26 2 0.01 27 17 0.05 28 104 0.32 29 147 0.45 30 59 0.18 31 1 0.00 ACGTcount: A:0.36, C:0.27, G:0.03, T:0.34 Consensus pattern (28 bp): TTCCAAAAATACATTTTTACCCCCAAAC Found at i:39039 original size:2 final size:2 Alignment explanation

Indices: 39032--39056 Score: 50 Period size: 2 Copynumber: 12.5 Consensus size: 2 39022 CGAGTTCTAA 39032 AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT A 39057 CAAATAATTT Statistics Matches: 23, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 23 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:47664 original size:42 final size:42 Alignment explanation

Indices: 47617--47700 Score: 159 Period size: 42 Copynumber: 2.0 Consensus size: 42 47607 CCCTTTTCAA * 47617 ATGTTCATCATCCATTTAGAGAGATGCAGTTAGGAGTTAATC 1 ATGTTCATCATCCATTTAGAAAGATGCAGTTAGGAGTTAATC 47659 ATGTTCATCATCCATTTAGAAAGATGCAGTTAGGAGTTAATC 1 ATGTTCATCATCCATTTAGAAAGATGCAGTTAGGAGTTAATC 47701 GTTGGATGAA Statistics Matches: 41, Mismatches: 1, Indels: 0 0.98 0.02 0.00 Matches are distributed among these distances: 42 41 1.00 ACGTcount: A:0.32, C:0.14, G:0.20, T:0.33 Consensus pattern (42 bp): ATGTTCATCATCCATTTAGAAAGATGCAGTTAGGAGTTAATC Found at i:51184 original size:17 final size:18 Alignment explanation

Indices: 51164--51209 Score: 78 Period size: 17 Copynumber: 2.7 Consensus size: 18 51154 ATTCATGTTG 51164 AGTTTAAATTTAGTTT-A 1 AGTTTAAATTTAGTTTGA 51181 AGTTTAAATTTAGTTTGA 1 AGTTTAAATTTAGTTTGA 51199 A-TTTAAATTTA 1 AGTTTAAATTTA 51210 AATTAATTAA Statistics Matches: 28, Mismatches: 0, Indels: 2 0.93 0.00 0.07 Matches are distributed among these distances: 17 26 0.93 18 2 0.07 ACGTcount: A:0.37, C:0.00, G:0.11, T:0.52 Consensus pattern (18 bp): AGTTTAAATTTAGTTTGA Found at i:51924 original size:25 final size:27 Alignment explanation

Indices: 51894--51965 Score: 87 Period size: 27 Copynumber: 2.7 Consensus size: 27 51884 TTAAATAGTT * 51894 ATTAATAAC-GTTAGTAAT-AA-ATAAC 1 ATTAATAACAG-TAATAATAAATATAAC 51919 ATTAATAACAGTAATAATAAATATAAC 1 ATTAATAACAGTAATAATAAATATAAC * * 51946 ATTAATAATATTAATAATAA 1 ATTAATAACAGTAATAATAA 51966 TAAAAAAAAA Statistics Matches: 41, Mismatches: 3, Indels: 4 0.85 0.06 0.08 Matches are distributed among these distances: 25 15 0.37 26 3 0.07 27 23 0.56 ACGTcount: A:0.57, C:0.06, G:0.04, T:0.33 Consensus pattern (27 bp): ATTAATAACAGTAATAATAAATATAAC Found at i:51962 original size:18 final size:18 Alignment explanation

Indices: 51913--51962 Score: 57 Period size: 18 Copynumber: 2.8 Consensus size: 18 51903 GTTAGTAATA * 51913 AATAACATTAATAACAGT 1 AATAATATTAATAACAGT * * 51931 AATAATA-AATATAACATT 1 AATAATATTA-ATAACAGT 51949 AATAATATTAATAA 1 AATAATATTAATAA 51963 TAATAAAAAA Statistics Matches: 26, Mismatches: 4, Indels: 4 0.76 0.12 0.12 Matches are distributed among these distances: 17 1 0.04 18 24 0.92 19 1 0.04 ACGTcount: A:0.60, C:0.06, G:0.02, T:0.32 Consensus pattern (18 bp): AATAATATTAATAACAGT Found at i:52354 original size:18 final size:19 Alignment explanation

Indices: 52320--52356 Score: 58 Period size: 19 Copynumber: 2.0 Consensus size: 19 52310 GGCTCGCCAT * 52320 GGTTTGAGGGAAAAAGGGA 1 GGTTTGAGAGAAAAAGGGA 52339 GGTTTGAGAG-AAAAGGGA 1 GGTTTGAGAGAAAAAGGGA 52357 AAAAAAGGAA Statistics Matches: 17, Mismatches: 1, Indels: 1 0.89 0.05 0.05 Matches are distributed among these distances: 18 8 0.47 19 9 0.53 ACGTcount: A:0.38, C:0.00, G:0.46, T:0.16 Consensus pattern (19 bp): GGTTTGAGAGAAAAAGGGA Found at i:53080 original size:119 final size:117 Alignment explanation

Indices: 52899--53274 Score: 431 Period size: 119 Copynumber: 3.2 Consensus size: 117 52889 CTAAACTATG * * 52899 CAAAAATTACCATTTTACCCCCAAACTTTCCAAAATTCCATTTTTAACTCCGA-TTTTCCAAAAA 1 CAAAAATTACCATTTTA-CCCCAAACTTTCCAAAATTCCATTTTTAACACCAATTTTTCCAAAAA * * 52963 TTACCATTTTACCCTCGAACT-TCCAAAAATTCCATTTTTTAC-CCTAATTTTTTTC 65 TTACCATTTTACCCTCGAACTCT-CAAAAATTCCATTTTTCACATC-AA--TTTTTC * 53018 CAAAAATTACCATTTTACCCTCAAA-TTTCCAAAATTCCATTTTTGACACCAATTTTTCCAAAAA 1 CAAAAATTACCATTTTACCC-CAAACTTTCCAAAATTCCATTTTTAACACCAATTTTTCCAAAAA 53082 TTACCATTTTACCCTCGAACTCTCAAAAATTCCATTTTTCACATCAATTTTTC 65 TTACCATTTTACCCTCGAACTCTCAAAAATTCCATTTTTCACATCAATTTTTC * ** * * * 53135 CAAAAATTACCATTTTACCCCTAAACTTTCAAAAATTGTATTTTTTACCCCAATTTTTTTCAAAA 1 CAAAAATTACCATTTTACCCC-AAACTTTCCAAAATTCCATTTTTAACACCAA-TTTTTCCAAAA * * ** * * * * 53200 ATTACCATTTTATCCTAAGATGTCT-AAAAATTCTATTTTTAACCTCGAA-CTTTC 64 ATTACCATTTTACCCT-CGAACTCTCAAAAATTCCATTTTTCACATC-AATTTTTC * * 53254 CCAAAATTACCATTTTGCCCC 1 CAAAAATTACCATTTTACCCC 53275 TGAATGTCCA Statistics Matches: 227, Mismatches: 21, Indels: 18 0.85 0.08 0.07 Matches are distributed among these distances: 116 1 0.00 117 29 0.13 118 49 0.22 119 139 0.61 120 9 0.04 ACGTcount: A:0.33, C:0.26, G:0.02, T:0.39 Consensus pattern (117 bp): CAAAAATTACCATTTTACCCCAAACTTTCCAAAATTCCATTTTTAACACCAATTTTTCCAAAAAT TACCATTTTACCCTCGAACTCTCAAAAATTCCATTTTTCACATCAATTTTTC Found at i:53217 original size:59 final size:59 Alignment explanation

Indices: 52899--53211 Score: 400 Period size: 58 Copynumber: 5.3 Consensus size: 59 52889 CTAAACTATG * * * * * 52899 CAAAAATTACCATTTTACCCCCAAACTTTCCAAAATTCCATTTTTAACTCCGA-TTTTC 1 CAAAAATTACCATTTTACCCTCAAACTTTCAAAAATTCCATTTTTTACACCAATTTTTC * * 52957 CAAAAATTACCATTTTACCCTCGAACTTCCAAAAATTCCATTTTTTAC-CCTAATTTTTTTC 1 CAAAAATTACCATTTTACCCTCAAACTTTCAAAAATTCCATTTTTTACACC-AA--TTTTTC * * 53018 CAAAAATTACCATTTTACCCTCAAA-TTTCCAAAATTCCATTTTTGACACCAATTTTTC 1 CAAAAATTACCATTTTACCCTCAAACTTTCAAAAATTCCATTTTTTACACCAATTTTTC * * * * 53076 CAAAAATTACCATTTTACCCTCGAACTCTCAAAAATTCCATTTTTCACATCAATTTTTC 1 CAAAAATTACCATTTTACCCTCAAACTTTCAAAAATTCCATTTTTTACACCAATTTTTC ** * * 53135 CAAAAATTACCATTTTACCC-CTAAACTTTCAAAAATTGTATTTTTTACCCCAATTTTTTT 1 CAAAAATTACCATTTTACCCTC-AAACTTTCAAAAATTCCATTTTTTACACCAA-TTTTTC 53195 CAAAAATTACCATTTTA 1 CAAAAATTACCATTTTA 53212 TCCTAAGATG Statistics Matches: 224, Mismatches: 23, Indels: 14 0.86 0.09 0.05 Matches are distributed among these distances: 57 2 0.01 58 75 0.33 59 73 0.33 60 43 0.19 61 31 0.14 ACGTcount: A:0.34, C:0.26, G:0.02, T:0.39 Consensus pattern (59 bp): CAAAAATTACCATTTTACCCTCAAACTTTCAAAAATTCCATTTTTTACACCAATTTTTC Found at i:53229 original size:60 final size:57 Alignment explanation

Indices: 52899--53269 Score: 354 Period size: 59 Copynumber: 6.3 Consensus size: 57 52889 CTAAACTATG * * * * * 52899 CAAAAATTACCATTTTACCCCCAAACTTTCCAAAATTCCATTTTTAACTCCGA-TTTTC 1 CAAAAATTACCATTTTA-CCCTAAA-TTTCAAAAATTCCATTTTTTACACCAATTTTTC * * 52957 CAAAAATTACCATTTTACCCTCGAACTTCCAAAAATTCCATTTTTTAC-CCTAATTTTTTTC 1 CAAAAATTACCATTTTACCCT--AAATTTCAAAAATTCCATTTTTTACACC-AA--TTTTTC * * 53018 CAAAAATTACCATTTTACCCTCAAATTTCCAAAATTCCATTTTTGACACCAATTTTTC 1 CAAAAATTACCATTTTACCCT-AAATTTCAAAAATTCCATTTTTTACACCAATTTTTC * * * * 53076 CAAAAATTACCATTTTACCCTCGAACTCTCAAAAATTCCATTTTTCACATCAATTTTTC 1 CAAAAATTACCATTTTACCCT--AAATTTCAAAAATTCCATTTTTTACACCAATTTTTC ** * * 53135 CAAAAATTACCATTTTACCCCTAAACTTTCAAAAATTGTATTTTTTACCCCAATTTTTTT 1 CAAAAATTACCATTTTA-CCCTAAA-TTTCAAAAATTCCATTTTTTACACCAA-TTTTTC * * * * * 53195 CAAAAATTACCATTTTATCCTAAGATGTCTAAAAATTCTATTTTTAAC-CTCGAA-CTTTC 1 CAAAAATTACCATTTTACCCTAA-ATTTC-AAAAATTCCATTTTTTACAC-C-AATTTTTC * 53254 CCAAAATTACCATTTT 1 CAAAAATTACCATTTT 53270 GCCCCTGAAT Statistics Matches: 266, Mismatches: 32, Indels: 29 0.81 0.10 0.09 Matches are distributed among these distances: 57 5 0.02 58 66 0.25 59 98 0.37 60 67 0.25 61 30 0.11 ACGTcount: A:0.34, C:0.25, G:0.02, T:0.39 Consensus pattern (57 bp): CAAAAATTACCATTTTACCCTAAATTTCAAAAATTCCATTTTTTACACCAATTTTTC Found at i:53232 original size:178 final size:177 Alignment explanation

Indices: 52899--53274 Score: 476 Period size: 178 Copynumber: 2.1 Consensus size: 177 52889 CTAAACTATG * * * 52899 CAAAAATTACCATTTTACCCCCAAACTTTCCAAAATTCCATTTTTAACTCCGATTTTCCAAAAAT 1 CAAAAATTACCATTTTACCCCCAAACTCTCAAAAATTCCATTTTTAACTCCAATTTTCCAAAAAT * * 52964 TACCATTTTACCCTCGAACTTCCAAAAATTCCATTTTTTACCCTAATTTTTTTCCAAAAATTACC 66 TACCATTTTACCCTCAAACTTCCAAAAATTCCATTTTTTACCCCAATTTTTTTCCAAAAATTACC * * * * 53029 ATTTTACCCTCAAATTTCCAAAATTCCATTTTTGACAC-C-AATTTTTC 131 ATTTTACCCTCAAATGTCAAAAATTCCATTTTTAAC-CTCGAA-CTTTC * * * 53076 CAAAAATTACCATTTTACCCTCGAACTCTCAAAAATTCCATTTTTCACAT-CAATTTTTCCAAAA 1 CAAAAATTACCATTTTACCCCCAAACTCTCAAAAATTCCATTTTTAAC-TCCAA-TTTTCCAAAA * ** 53140 ATTACCATTTTACCC-CTAAACTTTCAAAAATTGTATTTTTTACCCCAATTTTTTT-CAAAAATT 64 ATTACCATTTTACCCTC-AAACTTCCAAAAATTCCATTTTTTACCCCAATTTTTTTCCAAAAATT * * 53203 ACCATTTTATCCT-AAGATGTCTAAAAATTCTATTTTTAACCTCGAACTTTC 128 ACCATTTTACCCTCAA-ATGTC-AAAAATTCCATTTTTAACCTCGAACTTTC * * 53254 CCAAAATTACCATTTTGCCCC 1 CAAAAATTACCATTTTACCCC 53275 TGAATGTCCA Statistics Matches: 172, Mismatches: 20, Indels: 13 0.84 0.10 0.06 Matches are distributed among these distances: 176 2 0.01 177 71 0.41 178 97 0.56 179 2 0.01 ACGTcount: A:0.33, C:0.26, G:0.02, T:0.39 Consensus pattern (177 bp): CAAAAATTACCATTTTACCCCCAAACTCTCAAAAATTCCATTTTTAACTCCAATTTTCCAAAAAT TACCATTTTACCCTCAAACTTCCAAAAATTCCATTTTTTACCCCAATTTTTTTCCAAAAATTACC ATTTTACCCTCAAATGTCAAAAATTCCATTTTTAACCTCGAACTTTC Found at i:53442 original size:28 final size:28 Alignment explanation

Indices: 53404--53626 Score: 160 Period size: 28 Copynumber: 7.8 Consensus size: 28 53394 AAACTATCAA 53404 AAAATTACCATTTTACCCTCGAACTTCC 1 AAAATTACCATTTTACCCTCGAACTTCC ** 53432 AAAAGTT-CCATTTTTACCC-CGATTTATCC 1 AAAA-TTACCA-TTTTACCCTCGAACT-TCC 53461 AAAAAATTACCATTTTACCCTCGAACTTCC 1 --AAAATTACCATTTTACCCTCGAACTTCC * * 53491 AAAAATT-CTATTTTGACCCT--AATTTTGCC 1 -AAAATTACCATTTT-ACCCTCGAA-CTT-CC * * 53520 AAAATTACCATTTTACCCCCGAA-TATCT 1 AAAATTACCATTTTACCCTCGAACT-TCC * 53548 AAAATT-CCATTTTTGACCCT-AAACTTTCCC 1 AAAATTACCA-TTTT-ACCCTCGAAC-TT-CC * 53578 AAAATTACCATTTTGCCC-CTGAA-TGTCC 1 AAAATTACCATTTTACCCTC-GAACT-TCC 53606 AAAATTACCATTTTACCCTCG 1 AAAATTACCATTTTACCCTCG 53627 TGTATCCAAA Statistics Matches: 156, Mismatches: 15, Indels: 48 0.71 0.07 0.22 Matches are distributed among these distances: 27 5 0.03 28 64 0.41 29 44 0.28 30 29 0.19 31 14 0.09 ACGTcount: A:0.32, C:0.28, G:0.05, T:0.35 Consensus pattern (28 bp): AAAATTACCATTTTACCCTCGAACTTCC Found at i:53610 original size:58 final size:58 Alignment explanation

Indices: 53390--53619 Score: 224 Period size: 59 Copynumber: 3.9 Consensus size: 58 53380 CCCTAAAAGT * * 53390 CCCTAAACTATCAAAAAATTACCATTTTACCCTCGAACTTCCAAAAGTTCCATTTTT-A 1 CCCTAAACTTTCCAAAAATTACCATTTTACCCTCGAACTTCCAAAA-TTCCATTTTTGA ** * * 53448 CCC-CGATTTATCCAAAAAATTACCATTTTACCCTCGAACTTCCAAAAATTCTA-TTTTGA 1 CCCTAAACTT-TCC-AAAAATTACCATTTTACCCTCGAACTTCC-AAAATTCCATTTTTGA * * * 53507 CCCT-AATTTTGCC-AAAATTACCATTTTACCCCCGAA-TATCTAAAATTCCATTTTTGA 1 CCCTAAACTTT-CCAAAAATTACCATTTTACCCTCGAACT-TCCAAAATTCCATTTTTGA * * 53564 CCCTAAACTTTCCCAAAATTACCATTTTGCCC-CTGAA-TGTCCAAAATTACCATTTT 1 CCCTAAACTTTCCAAAAATTACCATTTTACCCTC-GAACT-TCCAAAATT-CCATTTT 53620 ACCCTCGTGT Statistics Matches: 146, Mismatches: 14, Indels: 23 0.80 0.08 0.13 Matches are distributed among these distances: 56 9 0.06 57 39 0.27 58 44 0.30 59 50 0.34 60 4 0.03 ACGTcount: A:0.33, C:0.28, G:0.05, T:0.34 Consensus pattern (58 bp): CCCTAAACTTTCCAAAAATTACCATTTTACCCTCGAACTTCCAAAATTCCATTTTTGA Found at i:61425 original size:23 final size:23 Alignment explanation

Indices: 61320--61425 Score: 124 Period size: 23 Copynumber: 4.5 Consensus size: 23 61310 AGCACGTCTC * 61320 GTGCTCTCTGTTCTTAGCACTGTGT 1 GTGCTCTCTG-T-TTAGCACTTTGT * 61345 GTGCTCTCTGATTAGCACTTTGT 1 GTGCTCTCTGTTTAGCACTTTGT * * 61368 GTGCTCTCTGATTAGTACTTTGT 1 GTGCTCTCTGTTTAGCACTTTGT * * 61391 GTACTCTTTGTTTAGCA-TTGTGT 1 GTGCTCTCTGTTTAGCACTT-TGT 61414 GTGCTCTCTGTT 1 GTGCTCTCTGTT 61426 GCCCAACACT Statistics Matches: 71, Mismatches: 9, Indels: 4 0.85 0.11 0.05 Matches are distributed among these distances: 22 2 0.03 23 59 0.83 25 10 0.14 ACGTcount: A:0.10, C:0.20, G:0.23, T:0.47 Consensus pattern (23 bp): GTGCTCTCTGTTTAGCACTTTGT Found at i:73822 original size:13 final size:13 Alignment explanation

Indices: 73804--73831 Score: 56 Period size: 13 Copynumber: 2.2 Consensus size: 13 73794 AAAAATCTAG 73804 AGAAATGATGAAA 1 AGAAATGATGAAA 73817 AGAAATGATGAAA 1 AGAAATGATGAAA 73830 AG 1 AG 73832 GCTATAAAAG Statistics Matches: 15, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 13 15 1.00 ACGTcount: A:0.61, C:0.00, G:0.25, T:0.14 Consensus pattern (13 bp): AGAAATGATGAAA Done.