Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Scaffold2528

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 23253
ACGTcount: A:0.33, C:0.19, G:0.16, T:0.33


Found at i:108 original size:28 final size:28

Alignment explanation

Indices: 77--134 Score: 107 Period size: 28 Copynumber: 2.1 Consensus size: 28 67 TTACTCCTTT 77 ATATTAAGATATTAAGTTATTATATATA 1 ATATTAAGATATTAAGTTATTATATATA * 105 ATATTAAGATATTACGTTATTATATATA 1 ATATTAAGATATTAAGTTATTATATATA 133 AT 1 AT 135 GTGAATGTCT Statistics Matches: 29, Mismatches: 1, Indels: 0 0.97 0.03 0.00 Matches are distributed among these distances: 28 29 1.00 ACGTcount: A:0.45, C:0.02, G:0.07, T:0.47 Consensus pattern (28 bp): ATATTAAGATATTAAGTTATTATATATA Found at i:478 original size:22 final size:23 Alignment explanation

Indices: 451--502 Score: 61 Period size: 23 Copynumber: 2.3 Consensus size: 23 441 CGTATGTCCA 451 AAACTA-ACAACTTTTATTTTAC 1 AAACTAGACAACTTTTATTTTAC * *** 473 AAACTAGTCTTTTTTTATTTTAC 1 AAACTAGACAACTTTTATTTTAC 496 AAACTAG 1 AAACTAG 503 TCTTTTTTTA Statistics Matches: 25, Mismatches: 4, Indels: 1 0.83 0.13 0.03 Matches are distributed among these distances: 22 6 0.24 23 19 0.76 ACGTcount: A:0.37, C:0.15, G:0.04, T:0.44 Consensus pattern (23 bp): AAACTAGACAACTTTTATTTTAC Found at i:508 original size:22 final size:23 Alignment explanation

Indices: 462--513 Score: 104 Period size: 23 Copynumber: 2.3 Consensus size: 23 452 AACTAACAAC 462 TTTTATTTTACAAACTAGTCTTT 1 TTTTATTTTACAAACTAGTCTTT 485 TTTTATTTTACAAACTAGTCTTT 1 TTTTATTTTACAAACTAGTCTTT 508 TTTTAT 1 TTTTAT 514 ATGATAGTAT Statistics Matches: 29, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 23 29 1.00 ACGTcount: A:0.25, C:0.12, G:0.04, T:0.60 Consensus pattern (23 bp): TTTTATTTTACAAACTAGTCTTT Found at i:2418 original size:50 final size:50 Alignment explanation

Indices: 2303--2544 Score: 279 Period size: 50 Copynumber: 4.7 Consensus size: 50 2293 GATTATAACA * * ** * * 2303 TGCCAAAGCCATGTCCCAGACATGGTCTTACATGGGATGTTCTCATGTTGG 1 TGCCAATGCCATGTCCCAGACATGGTCTTACAGGGGA-CCTCTCATCTCGG * 2354 TGCCCAA-GCCATGTCCCAGACATGGTCTTATAGGGGACCTCTCATCTCGG 1 TG-CCAATGCCATGTCCCAGACATGGTCTTACAGGGGACCTCTCATCTCGG * * * * 2404 TGCCAACGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCGTCTCAG 1 TGCCAATGCCATGTCCCAGACATGGTCTTACAGGGGACCTCTCATCTCGG * * * 2454 TGCCCATGCCATGTCCCAGACATGGTCTTGCAGGGGACCTCTCATGATCTTAAGG 1 TGCCAATGCCATGTCCCAGACATGGTCTTACAGGGGACCTCTC---ATC-T-CGG 2509 ATGCCAATGCCATGTCCCAGACATGGTCTTACAGGG 1 -TGCCAATGCCATGTCCCAGACATGGTCTTACAGGG 2545 ATCTCTTTAC Statistics Matches: 164, Mismatches: 19, Indels: 11 0.85 0.10 0.06 Matches are distributed among these distances: 49 4 0.02 50 89 0.54 51 30 0.18 52 4 0.02 53 2 0.01 54 1 0.01 55 1 0.01 56 33 0.20 ACGTcount: A:0.21, C:0.29, G:0.24, T:0.25 Consensus pattern (50 bp): TGCCAATGCCATGTCCCAGACATGGTCTTACAGGGGACCTCTCATCTCGG Found at i:2491 original size:100 final size:104 Alignment explanation

Indices: 2302--2550 Score: 319 Period size: 100 Copynumber: 2.4 Consensus size: 104 2292 TGATTATAAC ** * ** 2302 ATGCCAAAGCCATGTCCCAGACATGGTCTTACATGGGATGTTCTCATGTTGGTGCCCAAGCCATG 1 ATGCCAAAGCCATGTCCCAGACATGGTCTTACATGGGA-CCTCTCATCTCAGTGCCCAAGCCATG * * 2367 TCCCAGACATGGTCTTATAGGGGACCTCTC-ATC-T-CGG 65 TCCCAGACATGGTCTTACAGGGGACCTCTCAATCTTAAGG * * * 2404 -TGCCAACGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCGTCTCAGTGCCCATGCCATGT 1 ATGCCAAAGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATCTCAGTGCCCAAGCCATGT * 2468 CCCAGACATGGTCTTGCAGGGGACCTCTCATGATCTTAAGG 66 CCCAGACATGGTCTTACAGGGGACCTCTCA--ATCTTAAGG * * 2509 ATGCCAATGCCATGTCCCAGACATGGTCTTACA-GGGATCTCT 1 ATGCCAAAGCCATGTCCCAGACATGGTCTTACATGGGACCTCT 2551 TTACCCAAAT Statistics Matches: 128, Mismatches: 13, Indels: 9 0.85 0.09 0.06 Matches are distributed among these distances: 100 47 0.37 101 36 0.28 103 3 0.02 104 1 0.01 105 10 0.08 106 31 0.24 ACGTcount: A:0.21, C:0.29, G:0.24, T:0.26 Consensus pattern (104 bp): ATGCCAAAGCCATGTCCCAGACATGGTCTTACATGGGACCTCTCATCTCAGTGCCCAAGCCATGT CCCAGACATGGTCTTACAGGGGACCTCTCAATCTTAAGG Found at i:2666 original size:13 final size:13 Alignment explanation

Indices: 2645--2676 Score: 55 Period size: 13 Copynumber: 2.5 Consensus size: 13 2635 GCTTGGATCA * 2645 TCATCAAATAAAT 1 TCATAAAATAAAT 2658 TCATAAAATAAAT 1 TCATAAAATAAAT 2671 TCATAA 1 TCATAA 2677 TTTCTGGAAA Statistics Matches: 18, Mismatches: 1, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 13 18 1.00 ACGTcount: A:0.56, C:0.12, G:0.00, T:0.31 Consensus pattern (13 bp): TCATAAAATAAAT Found at i:7687 original size:21 final size:22 Alignment explanation

Indices: 7628--7687 Score: 52 Period size: 21 Copynumber: 2.8 Consensus size: 22 7618 TCAGAAGCAT * * * 7628 ATACAACACATAAAGTGCCTGA 1 ATACGACACATATAGTGCCTCA ** * 7650 A-ACGACACACGTGGTGCC-CA 1 ATACGACACATATAGTGCCTCA 7670 ATACGACACATATAGTGC 1 ATACGACACATATAGTGC 7688 TTGATCGGCA Statistics Matches: 28, Mismatches: 9, Indels: 3 0.70 0.22 0.08 Matches are distributed among these distances: 20 2 0.07 21 25 0.89 22 1 0.04 ACGTcount: A:0.38, C:0.27, G:0.18, T:0.17 Consensus pattern (22 bp): ATACGACACATATAGTGCCTCA Found at i:11961 original size:12 final size:12 Alignment explanation

Indices: 11944--11974 Score: 62 Period size: 12 Copynumber: 2.6 Consensus size: 12 11934 TAGGTAAATA 11944 ATATATATACAT 1 ATATATATACAT 11956 ATATATATACAT 1 ATATATATACAT 11968 ATATATA 1 ATATATA 11975 ACTTAAAATA Statistics Matches: 19, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 12 19 1.00 ACGTcount: A:0.52, C:0.06, G:0.00, T:0.42 Consensus pattern (12 bp): ATATATATACAT Found at i:13611 original size:44 final size:44 Alignment explanation

Indices: 13563--13682 Score: 160 Period size: 40 Copynumber: 2.8 Consensus size: 44 13553 ATCTTCGACT * * * 13563 TGCTCCACTATTGCTTAGGGAGATAAGATCTGGTTTCTTTCG-TC 1 TGCTCCACTACTGCTTAGGGAGATAAGATCTGGTTTATGT-GATC * 13607 TGCTCCACTACTGCTTAGGGAGATAAGA-C---TTGATGTGATC 1 TGCTCCACTACTGCTTAGGGAGATAAGATCTGGTTTATGTGATC 13647 TGCTCCACTACTGCTTAGGGAGATAAGATCTGGTTT 1 TGCTCCACTACTGCTTAGGGAGATAAGATCTGGTTT 13683 TCTTCACTCT Statistics Matches: 66, Mismatches: 5, Indels: 10 0.81 0.06 0.12 Matches are distributed among these distances: 39 1 0.02 40 34 0.52 41 1 0.02 43 1 0.02 44 29 0.44 ACGTcount: A:0.22, C:0.20, G:0.24, T:0.34 Consensus pattern (44 bp): TGCTCCACTACTGCTTAGGGAGATAAGATCTGGTTTATGTGATC Found at i:13837 original size:46 final size:46 Alignment explanation

Indices: 13787--13980 Score: 121 Period size: 46 Copynumber: 4.3 Consensus size: 46 13777 ATCTGCTTCG * 13787 CTGCCAAATACAGGAAGGCAAGATCTGCAATCTTCAATTTATTCCA 1 CTGCCAAATACAGGAAGACAAGATCTGCAATCTTCAATTTATTCCA * * * * ** * * * 13833 CTGCCAAATACAGGGAGATAGAGTTAT-C-GGCTTCAATGTACTCCT 1 CTGCCAAATACAGGAAGACA-AGATCTGCAATCTTCAATTTATTCCA * * * ** * * * ** * 13878 CTG--TAGT-CAGGGAGGTAAAATCTGCCATCTTCGATCTGCTT-CG 1 CTGCCAAATACAGGAAGACAAGATCTGCAATCTTCAAT-TTATTCCA * * 13921 CTGCCAAATACAGAAAGACAAGATCTGCAATCTTCAATCTATTCCA 1 CTGCCAAATACAGGAAGACAAGATCTGCAATCTTCAATTTATTCCA 13967 CTGCCAAATACAGG 1 CTGCCAAATACAGG 13981 GAGATAGAAT Statistics Matches: 102, Mismatches: 38, Indels: 16 0.65 0.24 0.10 Matches are distributed among these distances: 41 3 0.03 42 10 0.10 43 12 0.12 44 1 0.01 45 19 0.19 46 53 0.52 47 4 0.04 ACGTcount: A:0.31, C:0.24, G:0.19, T:0.26 Consensus pattern (46 bp): CTGCCAAATACAGGAAGACAAGATCTGCAATCTTCAATTTATTCCA Found at i:13838 original size:134 final size:134 Alignment explanation

Indices: 13683--15027 Score: 2399 Period size: 134 Copynumber: 10.1 Consensus size: 134 13673 GATCTGGTTT * * * * * 13683 TCTTCACTCTATTCTACTGCCAAACACAGGGAGATAGAGTTATCGGCTTCAATGCACTCCACTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 13748 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 13813 GCAA 131 GCAA * 13817 TCTTCAATTTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT * * 13882 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGAAAGACAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 13947 GCAA 131 GCAA * * 13951 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAATTATCGGCTTCAATGTACTCCACTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT * * 14016 AGTCAGGGAGGTAAAATCCGCCATCTTCGACCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 14081 GCAA 131 GCAA * * 14085 TCTTCAATCTATTCCACTGCC-AA-CCAGGGAGATAGAGTTATTGGCTTCAATGTACTCCTCTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT * * * * 14148 AGTCAGGGAGGTAAAATCTGCCATCTTTGATCTGCTACTCTGCCAAATACAGGAAGTCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 14213 GCAA 131 GCAA 14217 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 14282 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 14347 GCAA 131 GCAA 14351 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT * 14416 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATATAGGAAGGCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 14481 GCAA 131 GCAA 14485 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 14550 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 14615 GCAA 131 GCAA 14619 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT * * 14684 AGTCAGGGAGGTAAAATCTTCCATCTTCGATATGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 14749 GCAA 131 GCAA * * 14753 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCACTAT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT * 14818 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCTAAATACAGGAAGGCAAGATCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT * 14883 ACAA 131 GCAA * * 14887 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCACTAT 1 TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT * * 14952 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAAGCAAGGTCT 66 AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT 15017 G-ATA 131 GCA-A 15021 TCTTCAA 1 TCTTCAA 15028 CCAGCTCTAC Statistics Matches: 1163, Mismatches: 45, Indels: 6 0.96 0.04 0.00 Matches are distributed among these distances: 132 120 0.10 133 5 0.00 134 1038 0.89 ACGTcount: A:0.29, C:0.24, G:0.20, T:0.27 Consensus pattern (134 bp): TCTTCAATCTATTCCACTGCCAAATACAGGGAGATAGAGTTATCGGCTTCAATGTACTCCTCTGT AGTCAGGGAGGTAAAATCTGCCATCTTCGATCTGCTTCGCTGCCAAATACAGGAAGGCAAGATCT GCAA Found at i:15112 original size:86 final size:86 Alignment explanation

Indices: 14975--15324 Score: 556 Period size: 86 Copynumber: 4.1 Consensus size: 86 14965 AAATCTGCCA * * * * 14975 TCTTCGATCTGCTTCGCTGCCAAATACAGGAAAGCAAGGTCTGATATCTTCAACCAGCTCTACTA 1 TCTTCGATCTGCTTCGCTGTC-AATGCAGGAAGGCAAGATCTGATATCTTCAACCAGCTCTACTA 15040 CAAACGAGAGAGGCAAGGTTTG 65 CAAACGAGAGAGGCAAGGTTTG * * 15062 TCTTCGATCTGCTTCGCTGTCAGTGCAGGAAGGCAAGATCTGCTATCTTCAACCAGCTCTACTAC 1 TCTTCGATCTGCTTCGCTGTCAATGCAGGAAGGCAAGATCTGATATCTTCAACCAGCTCTACTAC * 15127 AAACGAGAGTGGCAAGGTTTG 66 AAACGAGAGAGGCAAGGTTTG * 15148 TCTTCGATCTGCTTCGCTGTCAATGCAGGAAGGCAAGATCTGCTATCTTCAACCAGCTCTACTAC 1 TCTTCGATCTGCTTCGCTGTCAATGCAGGAAGGCAAGATCTGATATCTTCAACCAGCTCTACTAC * 15213 AAACGAGAGTGGCAAGGTTTG 66 AAACGAGAGAGGCAAGGTTTG * * * 15234 TCTTCGATCTGCTTCACTGTCAATGCAGGAAGGCAAGATCTGATAACTTCAACCAGCTCTGCTAC 1 TCTTCGATCTGCTTCGCTGTCAATGCAGGAAGGCAAGATCTGATATCTTCAACCAGCTCTACTAC * * * 15299 GACCGAGAGAGGCAAGGTTTA 66 AAACGAGAGAGGCAAGGTTTG 15320 TCTTC 1 TCTTC 15325 AATTTTTACT Statistics Matches: 247, Mismatches: 16, Indels: 1 0.94 0.06 0.00 Matches are distributed among these distances: 86 227 0.92 87 20 0.08 ACGTcount: A:0.27, C:0.25, G:0.23, T:0.26 Consensus pattern (86 bp): TCTTCGATCTGCTTCGCTGTCAATGCAGGAAGGCAAGATCTGATATCTTCAACCAGCTCTACTAC AAACGAGAGAGGCAAGGTTTG Found at i:15184 original size:42 final size:42 Alignment explanation

Indices: 15050--15270 Score: 107 Period size: 42 Copynumber: 5.2 Consensus size: 42 15040 CAAACGAGAG * * 15050 AGGCAAGGTTTGTCTTCGATCTGCTTCGCTGTCAGTGCAGGA 1 AGGCAAGGTTTGTCTTCGATCTGCTTCACTGTCAATGCAGGA * * * * * * * 15092 AGGCAAGATCTGCTATCTTCAACCAGC-TCTACT-ACAA-AC-GAGA 1 AGGCAAGGT-T--TGTCTTCGATCTGCTTC-ACTGTCAATGCAG-GA * * 15135 GTGGCAAGGTTTGTCTTCGATCTGCTTCGCTGTCAATGCAGGA 1 -AGGCAAGGTTTGTCTTCGATCTGCTTCACTGTCAATGCAGGA * * * * * * * 15178 AGGCAAGATCTGCTATCTTCAACCAGC-TCTACT-ACAA-AC-GAGA 1 AGGCAAGGT-T--TGTCTTCGATCTGCTTC-ACTGTCAATGCAG-GA * 15221 GTGGCAAGGTTTGTCTTCGATCTGCTTCACTGTCAATGCAGGA 1 -AGGCAAGGTTTGTCTTCGATCTGCTTCACTGTCAATGCAGGA 15264 AGGCAAG 1 AGGCAAG 15271 ATCTGATAAC Statistics Matches: 123, Mismatches: 36, Indels: 40 0.62 0.18 0.20 Matches are distributed among these distances: 41 25 0.20 42 33 0.27 43 16 0.13 44 25 0.20 45 24 0.20 ACGTcount: A:0.25, C:0.24, G:0.25, T:0.27 Consensus pattern (42 bp): AGGCAAGGTTTGTCTTCGATCTGCTTCACTGTCAATGCAGGA Found at i:19478 original size:19 final size:20 Alignment explanation

Indices: 19454--19497 Score: 56 Period size: 19 Copynumber: 2.3 Consensus size: 20 19444 CGTGAAAGTC * * 19454 TAATGCATATG-ATGCAATG 1 TAATGCAAATGCATGAAATG 19473 TAATGCAAATGCATGAAATG 1 TAATGCAAATGCATGAAATG 19493 -AATGC 1 TAATGC 19498 CAAAAGAAAC Statistics Matches: 22, Mismatches: 2, Indels: 2 0.85 0.08 0.08 Matches are distributed among these distances: 19 15 0.68 20 7 0.32 ACGTcount: A:0.41, C:0.11, G:0.20, T:0.27 Consensus pattern (20 bp): TAATGCAAATGCATGAAATG Found at i:21843 original size:10 final size:10 Alignment explanation

Indices: 21828--21856 Score: 58 Period size: 10 Copynumber: 2.9 Consensus size: 10 21818 TAGGTAAATA 21828 ATATATATAC 1 ATATATATAC 21838 ATATATATAC 1 ATATATATAC 21848 ATATATATA 1 ATATATATA 21857 ACTTAAAATA Statistics Matches: 19, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 10 19 1.00 ACGTcount: A:0.52, C:0.07, G:0.00, T:0.41 Consensus pattern (10 bp): ATATATATAC Done.