Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Scaffold1791

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 17084
ACGTcount: A:0.31, C:0.23, G:0.15, T:0.31


Found at i:2310 original size:39 final size:40

Alignment explanation

Indices: 2265--2488 Score: 233 Period size: 40 Copynumber: 5.7 Consensus size: 40 2255 GCTCCTCGTT * * * * 2265 CAAATGCCTTCGGGACATAGCCCGGTTTTAGTAACTCACA 1 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA * 2305 C-AATGCCTTCGGGACTTAACCCGGATTTAATAACTCGCA 1 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA * * 2344 CGAATGCCTTCGGGACTTAACCCGGATTTAGTATCTCGCA 1 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA * * * 2384 CAAAGGCCTTCGGG-CTTAACCCGGAACTT-GTATCTCGCA 1 CAAATGCCTTCGGGACTTAACCCGG-ATTTAGTAACTCGCA ** * * * * 2423 CAAATGCCTTC-GGATCTTAGTCCGGATATATTCACTTAGCA 1 CAAATGCCTTCGGGA-CTTAACCCGGATTTAGTAAC-TCGCA * 2464 CAAA-GCCTTCGGGACTTAGCCCGGA 1 CAAATGCCTTCGGGACTTAACCCGGA 2489 CAGCATTCAA Statistics Matches: 157, Mismatches: 20, Indels: 14 0.82 0.10 0.07 Matches are distributed among these distances: 38 2 0.01 39 66 0.42 40 78 0.50 41 11 0.07 ACGTcount: A:0.25, C:0.28, G:0.21, T:0.25 Consensus pattern (40 bp): CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA Found at i:2354 original size:79 final size:78 Alignment explanation

Indices: 2265--2489 Score: 247 Period size: 79 Copynumber: 2.8 Consensus size: 78 2255 GCTCCTCGTT * * * 2265 CAAATGCCTTCGGGACATAGCCCGGTTTTAGTAACTCACACAATGCCTTCGGGACTTAACCCGGA 1 CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCACACAAAGCCTTCGGGACTTAACCCGGA * 2330 TTTAATAACTCGCA 66 CTT-ATAACTCGCA * * * * 2344 CGAATGCCTTCGGGACTTAACCCGGATTTAGTATCTCGCACAAAGGCCTTCGGG-CTTAACCCGG 1 CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCACACAAA-GCCTTCGGGACTTAACCCGG * * 2408 AACTTGTATCTCGCA 65 -ACTTATAACTCGCA * * * * * * 2423 CAAATGCCTTC-GGATCTTAGTCCGGATATATTCACTTAGCACAAAGCCTTCGGGACTTAGCCCG 1 CAAATGCCTTCGGGA-CTTAGCCCGGATTTAGTAACTCA-CACAAAGCCTTCGGGACTTAACCCG 2487 GAC 64 GAC 2490 AGCATTCAAT Statistics Matches: 121, Mismatches: 20, Indels: 10 0.80 0.13 0.07 Matches are distributed among these distances: 78 3 0.02 79 91 0.75 80 27 0.22 ACGTcount: A:0.25, C:0.28, G:0.21, T:0.25 Consensus pattern (78 bp): CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCACACAAAGCCTTCGGGACTTAACCCGGA CTTATAACTCGCA Found at i:2497 original size:41 final size:41 Alignment explanation

Indices: 2420--2497 Score: 97 Period size: 40 Copynumber: 1.9 Consensus size: 41 2410 CTTGTATCTC * * * 2420 GCACAAATGCCTTCGGATCTTAGTCCGGATATATTCACTTA 1 GCACAAATGCCTTCGGATCTTAGCCCGGACACATTCACTTA 2461 GCACAAA-GCCTTCGGGA-CTTAGCCCGGACAGCATTCA 1 GCACAAATGCCTTC-GGATCTTAGCCCGGACA-CATTCA 2498 ATTAATCATG Statistics Matches: 32, Mismatches: 3, Indels: 4 0.82 0.08 0.10 Matches are distributed among these distances: 40 17 0.53 41 15 0.47 ACGTcount: A:0.27, C:0.28, G:0.21, T:0.24 Consensus pattern (41 bp): GCACAAATGCCTTCGGATCTTAGCCCGGACACATTCACTTA Found at i:10238 original size:39 final size:40 Alignment explanation

Indices: 10193--10416 Score: 233 Period size: 40 Copynumber: 5.7 Consensus size: 40 10183 GCTCCTCGTT * * * * 10193 CAAATGCCTTCGGGACATAGCCCGGTTTTAGTAACTCACA 1 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA * 10233 C-AATGCCTTCGGGACTTAACCCGGATTTAATAACTCGCA 1 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA * * 10272 CGAATGCCTTCGGGACTTAACCCGGATTTAGTATCTCGCA 1 CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA * * * 10312 CAAAGGCCTTCGGG-CTTAACCCGGAACTT-GTATCTCGCA 1 CAAATGCCTTCGGGACTTAACCCGG-ATTTAGTAACTCGCA ** * * * * 10351 CAAATGCCTTC-GGATCTTAGTCCGGATATATTCACTTAGCA 1 CAAATGCCTTCGGGA-CTTAACCCGGATTTAGTAAC-TCGCA * 10392 CAAA-GCCTTCGGGACTTAGCCCGGA 1 CAAATGCCTTCGGGACTTAACCCGGA 10417 CAGCATTCAA Statistics Matches: 157, Mismatches: 20, Indels: 14 0.82 0.10 0.07 Matches are distributed among these distances: 38 2 0.01 39 66 0.42 40 78 0.50 41 11 0.07 ACGTcount: A:0.25, C:0.28, G:0.21, T:0.25 Consensus pattern (40 bp): CAAATGCCTTCGGGACTTAACCCGGATTTAGTAACTCGCA Found at i:10296 original size:79 final size:78 Alignment explanation

Indices: 10193--10417 Score: 247 Period size: 79 Copynumber: 2.8 Consensus size: 78 10183 GCTCCTCGTT * * * 10193 CAAATGCCTTCGGGACATAGCCCGGTTTTAGTAACTCACACAATGCCTTCGGGACTTAACCCGGA 1 CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCACACAAAGCCTTCGGGACTTAACCCGGA * 10258 TTTAATAACTCGCA 66 CTT-ATAACTCGCA * * * * 10272 CGAATGCCTTCGGGACTTAACCCGGATTTAGTATCTCGCACAAAGGCCTTCGGG-CTTAACCCGG 1 CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCACACAAA-GCCTTCGGGACTTAACCCGG * * 10336 AACTTGTATCTCGCA 65 -ACTTATAACTCGCA * * * * * * 10351 CAAATGCCTTC-GGATCTTAGTCCGGATATATTCACTTAGCACAAAGCCTTCGGGACTTAGCCCG 1 CAAATGCCTTCGGGA-CTTAGCCCGGATTTAGTAACTCA-CACAAAGCCTTCGGGACTTAACCCG 10415 GAC 64 GAC 10418 AGCATTCAAT Statistics Matches: 121, Mismatches: 20, Indels: 10 0.80 0.13 0.07 Matches are distributed among these distances: 78 3 0.02 79 91 0.75 80 27 0.22 ACGTcount: A:0.25, C:0.28, G:0.21, T:0.25 Consensus pattern (78 bp): CAAATGCCTTCGGGACTTAGCCCGGATTTAGTAACTCACACAAAGCCTTCGGGACTTAACCCGGA CTTATAACTCGCA Found at i:10425 original size:41 final size:41 Alignment explanation

Indices: 10348--10425 Score: 97 Period size: 40 Copynumber: 1.9 Consensus size: 41 10338 CTTGTATCTC * * * 10348 GCACAAATGCCTTCGGATCTTAGTCCGGATATATTCACTTA 1 GCACAAATGCCTTCGGATCTTAGCCCGGACACATTCACTTA 10389 GCACAAA-GCCTTCGGGA-CTTAGCCCGGACAGCATTCA 1 GCACAAATGCCTTC-GGATCTTAGCCCGGACA-CATTCA 10426 ATTAATCATG Statistics Matches: 32, Mismatches: 3, Indels: 4 0.82 0.08 0.10 Matches are distributed among these distances: 40 17 0.53 41 15 0.47 ACGTcount: A:0.27, C:0.28, G:0.21, T:0.24 Consensus pattern (41 bp): GCACAAATGCCTTCGGATCTTAGCCCGGACACATTCACTTA Found at i:12198 original size:37 final size:37 Alignment explanation

Indices: 12135--12239 Score: 165 Period size: 37 Copynumber: 2.8 Consensus size: 37 12125 AGCTCAGGCG * * * 12135 AAATCTCCACACGAAGTTATCAGGTCTTACCCGGACA 1 AAATCTCCACACGTAGTCATCGGGTCTTACCCGGACA * 12172 AAATCTCCACACGTAGTCATCGGGTCTTACCCGGATA 1 AAATCTCCACACGTAGTCATCGGGTCTTACCCGGACA * 12209 TAATCTCCACACGTAGTCATCGGGTCTTACC 1 AAATCTCCACACGTAGTCATCGGGTCTTACC 12240 GGAATATATT Statistics Matches: 63, Mismatches: 5, Indels: 0 0.93 0.07 0.00 Matches are distributed among these distances: 37 63 1.00 ACGTcount: A:0.28, C:0.30, G:0.17, T:0.25 Consensus pattern (37 bp): AAATCTCCACACGTAGTCATCGGGTCTTACCCGGACA Found at i:12525 original size:47 final size:47 Alignment explanation

Indices: 12446--12639 Score: 271 Period size: 47 Copynumber: 4.1 Consensus size: 47 12436 TATATATATT * * * * 12446 TCACATTAGCCATTCGGCTTTACCACATATATGCATGTTCATATTCA 1 TCACATTGGCCATTCGGCCTTATCACATATATGCATGTTCACATTCA * * ** * 12493 CCACATTGGCCATTCGGCCTTATGACGCATATGCATGCTCACATTCA 1 TCACATTGGCCATTCGGCCTTATCACATATATGCATGTTCACATTCA 12540 TCACATTGGCCATTCGGCCTTATCACATATATGCATGTTCACATTCA 1 TCACATTGGCCATTCGGCCTTATCACATATATGCATGTTCACATTCA * * ** 12587 TCACATTGGCCATTCGGCCTTATCTCATATATACACATTCACATTCA 1 TCACATTGGCCATTCGGCCTTATCACATATATGCATGTTCACATTCA 12634 TCACAT 1 TCACAT 12640 AAAATCCTAA Statistics Matches: 129, Mismatches: 18, Indels: 0 0.88 0.12 0.00 Matches are distributed among these distances: 47 129 1.00 ACGTcount: A:0.26, C:0.29, G:0.12, T:0.33 Consensus pattern (47 bp): TCACATTGGCCATTCGGCCTTATCACATATATGCATGTTCACATTCA Found at i:12567 original size:22 final size:22 Alignment explanation

Indices: 12539--12610 Score: 58 Period size: 22 Copynumber: 3.1 Consensus size: 22 12529 GCTCACATTC 12539 ATCACATTGGCCATTCGGCCTT 1 ATCACATTGGCCATTCGGCCTT * * * 12561 ATCACATATATG-CATGTTC-ACATT 1 ATCACAT-T-GGCCA--TTCGGCCTT 12585 CATCACATTGGCCATTCGGCCTT 1 -ATCACATTGGCCATTCGGCCTT 12608 ATC 1 ATC 12611 TCATATATAC Statistics Matches: 37, Mismatches: 6, Indels: 14 0.65 0.11 0.25 Matches are distributed among these distances: 22 13 0.35 23 7 0.19 24 7 0.19 25 10 0.27 ACGTcount: A:0.24, C:0.29, G:0.14, T:0.33 Consensus pattern (22 bp): ATCACATTGGCCATTCGGCCTT Done.