Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Ga02g00001.P1

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 23871
ACGTcount: A:0.34, C:0.16, G:0.16, T:0.34


Found at i:4453 original size:2 final size:2

Alignment explanation

Indices: 4446--4476 Score: 62 Period size: 2 Copynumber: 15.5 Consensus size: 2 4436 TTCTGAGATA 4446 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 4477 ACGACCTTTA Statistics Matches: 29, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 29 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:12952 original size:43 final size:43 Alignment explanation

Indices: 12851--12976 Score: 155 Period size: 43 Copynumber: 2.9 Consensus size: 43 12841 AATTAATAAT * * * * 12851 ATTATCCAAATGCATAATTAGTATTAC-ACTACATGATAACAACAAC 1 ATTATCCAAGTGCATAACTAATATTACAAC-ACAT--T-ATAACAAC 12897 ATTATCCAAGTGCATAACTAATATTACAACACATTATAACAAC 1 ATTATCCAAGTGCATAACTAATATTACAACACATTATAACAAC * * 12940 ATTGTCCAAGTGCATAACTAATATTACACCACATTAT 1 ATTATCCAAGTGCATAACTAATATTACAACACATTAT 12977 CAATATCTTC Statistics Matches: 73, Mismatches: 6, Indels: 5 0.87 0.07 0.06 Matches are distributed among these distances: 43 42 0.58 44 1 0.01 46 28 0.38 47 2 0.03 ACGTcount: A:0.44, C:0.21, G:0.06, T:0.29 Consensus pattern (43 bp): ATTATCCAAGTGCATAACTAATATTACAACACATTATAACAAC Found at i:15250 original size:897 final size:896 Alignment explanation

Indices: 13693--15485 Score: 3424 Period size: 897 Copynumber: 2.0 Consensus size: 896 13683 GATTTTCCCA * 13693 AAAATGGTTCATCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG 1 AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG 13758 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA 66 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA * * 13823 CTCTTTTAAATGATATCGAATACCATGATATGGGGTAATATATCCATTACGGATGCCATATCCAG 131 CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG 13888 CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 196 CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 13953 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG 261 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG * 14018 GTGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT 326 GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT 14083 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA 391 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA * 14148 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATTTAATGCTCCAAT 456 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT * 14213 ACAATCTTTAAAATAAGGATTAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG 521 ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG * 14278 GTAATTTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 586 GTAATCTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 14343 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCACTCGAAACCTTACATTATGACCAATTAT 651 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCA-TCGAAACCTTACATTATGACCAATTAT * 14408 ATGTAAAAATATAACTACTTGCTCCCTAATATTCACTGATTTAGTTGATTGTAACAAATTATTTC 715 ATGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTC * * 14473 TACTAAGAATATCACACAAATTAAAAAATGCAACCGGTCGCATCCTTATCATATCAATACAATGC 780 TACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCGCATCCTTATCATATCAATACAATGC 14538 CGGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC 845 CGGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC * 14590 AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCTAAAAACACGTTCAATAGTG 1 AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG * 14655 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCTTGAGCACTAAA 66 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA 14720 CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG 131 CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG * 14785 CATCAGCAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 196 CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 14850 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG 261 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG 14915 GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT 326 GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT * 14980 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCTCCCCTTTACGGCTA 391 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA 15045 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT 456 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT 15110 ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG 521 ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG * 15175 GTAATCTAATAACTAGTTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 586 GTAATCTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 15240 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCATCGAAACCTTACATTATGACCAATTATA 651 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCATCGAAACCTTACATTATGACCAATTATA 15305 TGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTCT 716 TGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTCT * 15370 ACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCTCATCCTTATCATATCAATACAATGCC 781 ACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCGCATCCTTATCATATCAATACAATGCC * 15435 GGTCACCACTATATAAAATATTATTAATATAATTTTCTCTTTCATAATCTC 846 GGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC 15486 GATTTACATG Statistics Matches: 879, Mismatches: 17, Indels: 1 0.98 0.02 0.00 Matches are distributed among these distances: 896 204 0.23 897 675 0.77 ACGTcount: A:0.35, C:0.20, G:0.13, T:0.32 Consensus pattern (896 bp): AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG GTAATCTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCATCGAAACCTTACATTATGACCAATTATA TGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTCT ACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCGCATCCTTATCATATCAATACAATGCC GGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC Found at i:15721 original size:2 final size:2 Alignment explanation

Indices: 15714--15742 Score: 58 Period size: 2 Copynumber: 14.5 Consensus size: 2 15704 AAGTGCATAC 15714 AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 15743 AGTTTCTCAA Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 27 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:16223 original size:2 final size:2 Alignment explanation

Indices: 16212--16241 Score: 53 Period size: 2 Copynumber: 15.5 Consensus size: 2 16202 GAACCTTTAC 16212 TA TA -A TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 16242 GGTTTCATTT Statistics Matches: 27, Mismatches: 0, Indels: 2 0.93 0.00 0.07 Matches are distributed among these distances: 1 1 0.04 2 26 0.96 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): TA Found at i:18884 original size:46 final size:46 Alignment explanation

Indices: 18834--19009 Score: 207 Period size: 46 Copynumber: 3.8 Consensus size: 46 18824 TGGTTGAGCA 18834 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG 1 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG * * * 18880 TCCGAACTCGTTGAGTTGAGTCCGAGTTC-GTGA--GATG-TAACTAGG 1 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAA-T--G * 18925 CATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACG 1 --TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG * * * * 18973 CCCGAGCTCGTTGATTTGAGTCCGAGTTCGCTTATGG 1 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGG 19010 GCGGGTTATA Statistics Matches: 110, Mismatches: 11, Indels: 18 0.79 0.08 0.13 Matches are distributed among these distances: 42 2 0.02 43 5 0.05 45 3 0.03 46 62 0.56 47 29 0.26 48 3 0.03 50 4 0.04 51 2 0.02 ACGTcount: A:0.20, C:0.21, G:0.29, T:0.30 Consensus pattern (46 bp): TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG Found at i:18992 original size:93 final size:93 Alignment explanation

Indices: 18831--19002 Score: 308 Period size: 93 Copynumber: 1.8 Consensus size: 93 18821 GGATGGTTGA * * 18831 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATGTCCGAACTCGTTGAGT 1 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAACTCGTTGAGT 18896 TGAGTCCGAGTTCGTGAGATGTAACTAG 66 TGAGTCCGAGTTCGTGAGATGTAACTAG * * 18924 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAGCTCGTTGATT 1 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAACTCGTTGAGT 18989 TGAGTCCGAGTTCG 66 TGAGTCCGAGTTCG 19003 CTTATGGGCG Statistics Matches: 75, Mismatches: 4, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 93 75 1.00 ACGTcount: A:0.21, C:0.22, G:0.29, T:0.28 Consensus pattern (93 bp): GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAACTCGTTGAGT TGAGTCCGAGTTCGTGAGATGTAACTAG Found at i:20362 original size:18 final size:18 Alignment explanation

Indices: 20341--20387 Score: 51 Period size: 18 Copynumber: 2.6 Consensus size: 18 20331 TTAAATTGTT 20341 TAAATTTAAAAAATTAT-A 1 TAAATTTAAAAAA-TATCA * * 20359 TAAAATTATAAAATATCA 1 TAAATTTAAAAAATATCA 20377 TCAAATTTAAA 1 T-AAATTTAAA 20388 TTTAAAATCG Statistics Matches: 23, Mismatches: 4, Indels: 3 0.77 0.13 0.10 Matches are distributed among these distances: 17 3 0.13 18 13 0.57 19 7 0.30 ACGTcount: A:0.60, C:0.04, G:0.00, T:0.36 Consensus pattern (18 bp): TAAATTTAAAAAATATCA Found at i:20762 original size:55 final size:55 Alignment explanation

Indices: 20674--20779 Score: 185 Period size: 55 Copynumber: 1.9 Consensus size: 55 20664 AAGGAGGTGC * * 20674 TTTTACACCTAGAAGGATAACTGATTTTGGGGGTCAAAATTACGGTGAAAAATTG 1 TTTTACACCGAGAAGGATAACTGATTTTGGGGGTCAAAATTAAGGTGAAAAATTG * 20729 TTTTACACCGAGAAGGATAACTGATTTTTGGGGTCAAAATTAAGGTGAAAA 1 TTTTACACCGAGAAGGATAACTGATTTTGGGGGTCAAAATTAAGGTGAAAA 20780 TTTATTTTTC Statistics Matches: 48, Mismatches: 3, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 55 48 1.00 ACGTcount: A:0.36, C:0.10, G:0.24, T:0.30 Consensus pattern (55 bp): TTTTACACCGAGAAGGATAACTGATTTTGGGGGTCAAAATTAAGGTGAAAAATTG Found at i:20963 original size:31 final size:31 Alignment explanation

Indices: 20923--21014 Score: 125 Period size: 30 Copynumber: 3.0 Consensus size: 31 20913 TATTTTGTAC 20923 CATTATATTAAATTATTTAAATATAATTTAT 1 CATTATATTAAATTATTTAAATATAATTTAT * 20954 CATTGTATTAAATTATTT-AATATAATTTAT 1 CATTATATTAAATTATTTAAATATAATTTAT * * * * 20984 TATTATATTTAATTTTTTAAA-ATATTTTAT 1 CATTATATTAAATTATTTAAATATAATTTAT 21014 C 1 C 21015 TATCATTAAT Statistics Matches: 53, Mismatches: 7, Indels: 3 0.84 0.11 0.05 Matches are distributed among these distances: 30 34 0.64 31 19 0.36 ACGTcount: A:0.40, C:0.03, G:0.01, T:0.55 Consensus pattern (31 bp): CATTATATTAAATTATTTAAATATAATTTAT Found at i:21032 original size:28 final size:27 Alignment explanation

Indices: 20968--21040 Score: 62 Period size: 27 Copynumber: 2.7 Consensus size: 27 20958 GTATTAAATT * 20968 ATTTAATATAAT-TTATTATTATATTTA 1 ATTTAATAAAATATT-TTATTATATTTA ** 20995 ATTTTTTAAAATATTTTATCTATCA-TTA 1 ATTTAATAAAATATTTTAT-TAT-ATTTA * 21023 ATTTAAT-AAATACTTTAT 1 ATTTAATAAAATATTTTAT 21041 ATTAATTACT Statistics Matches: 37, Mismatches: 6, Indels: 6 0.76 0.12 0.12 Matches are distributed among these distances: 27 23 0.62 28 13 0.35 29 1 0.03 ACGTcount: A:0.40, C:0.04, G:0.00, T:0.56 Consensus pattern (27 bp): ATTTAATAAAATATTTTATTATATTTA Done.