Tandem Repeats Finder Program written by:

                 Gary Benson
      Program in Bioinformatics
          Boston University

Version 4.09

Sequence: Ga02g00001.P2

Parameters: 2 7 7 80 10 50 1000

Pmatch=0.80,Pindel=0.10
tuple sizes 0,4,5,7
tuple distances 0, 29, 159, 1000

Length: 23954
ACGTcount: A:0.34, C:0.16, G:0.16, T:0.34


Found at i:4536 original size:2 final size:2

Alignment explanation

Indices: 4529--4559 Score: 62 Period size: 2 Copynumber: 15.5 Consensus size: 2 4519 TTCTGAGATA 4529 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 4560 ACGACCTTTA Statistics Matches: 29, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 29 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:13035 original size:43 final size:43 Alignment explanation

Indices: 12934--13059 Score: 155 Period size: 43 Copynumber: 2.9 Consensus size: 43 12924 AATTAATAAT * * * * 12934 ATTATCCAAATGCATAATTAGTATTAC-ACTACATGATAACAACAAC 1 ATTATCCAAGTGCATAACTAATATTACAAC-ACAT--T-ATAACAAC 12980 ATTATCCAAGTGCATAACTAATATTACAACACATTATAACAAC 1 ATTATCCAAGTGCATAACTAATATTACAACACATTATAACAAC * * 13023 ATTGTCCAAGTGCATAACTAATATTACACCACATTAT 1 ATTATCCAAGTGCATAACTAATATTACAACACATTAT 13060 CAATATCTTC Statistics Matches: 73, Mismatches: 6, Indels: 5 0.87 0.07 0.06 Matches are distributed among these distances: 43 42 0.58 44 1 0.01 46 28 0.38 47 2 0.03 ACGTcount: A:0.44, C:0.21, G:0.06, T:0.29 Consensus pattern (43 bp): ATTATCCAAGTGCATAACTAATATTACAACACATTATAACAAC Found at i:15333 original size:897 final size:896 Alignment explanation

Indices: 13776--15568 Score: 3424 Period size: 897 Copynumber: 2.0 Consensus size: 896 13766 GATTTTCCCA * 13776 AAAATGGTTCATCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG 1 AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG 13841 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA 66 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA * * 13906 CTCTTTTAAATGATATCGAATACCATGATATGGGGTAATATATCCATTACGGATGCCATATCCAG 131 CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG 13971 CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 196 CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 14036 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG 261 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG * 14101 GTGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT 326 GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT 14166 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA 391 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA * 14231 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATTTAATGCTCCAAT 456 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT * 14296 ACAATCTTTAAAATAAGGATTAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG 521 ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG * 14361 GTAATTTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 586 GTAATCTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 14426 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCACTCGAAACCTTACATTATGACCAATTAT 651 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCA-TCGAAACCTTACATTATGACCAATTAT * 14491 ATGTAAAAATATAACTACTTGCTCCCTAATATTCACTGATTTAGTTGATTGTAACAAATTATTTC 715 ATGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTC * * 14556 TACTAAGAATATCACACAAATTAAAAAATGCAACCGGTCGCATCCTTATCATATCAATACAATGC 780 TACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCGCATCCTTATCATATCAATACAATGC 14621 CGGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC 845 CGGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC * 14673 AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCTAAAAACACGTTCAATAGTG 1 AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG * 14738 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCTTGAGCACTAAA 66 ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA 14803 CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG 131 CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG * 14868 CATCAGCAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 196 CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT 14933 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG 261 TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG 14998 GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT 326 GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT * 15063 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCTCCCCTTTACGGCTA 391 AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA 15128 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT 456 CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT 15193 ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG 521 ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG * 15258 GTAATCTAATAACTAGTTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 586 GTAATCTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA 15323 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCATCGAAACCTTACATTATGACCAATTATA 651 ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCATCGAAACCTTACATTATGACCAATTATA 15388 TGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTCT 716 TGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTCT * 15453 ACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCTCATCCTTATCATATCAATACAATGCC 781 ACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCGCATCCTTATCATATCAATACAATGCC * 15518 GGTCACCACTATATAAAATATTATTAATATAATTTTCTCTTTCATAATCTC 846 GGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC 15569 GATTTACATG Statistics Matches: 879, Mismatches: 17, Indels: 1 0.98 0.02 0.00 Matches are distributed among these distances: 896 204 0.23 897 675 0.77 ACGTcount: A:0.35, C:0.20, G:0.13, T:0.32 Consensus pattern (896 bp): AAAATGGTTCAGCATCTAATACACGAAACCGTTTCTTCAAAATCCCAAAAACACGTTCAATAGTG ATTCGCAATGATGAATGACGAAGATTAAAGAGTTCCTTTTCATTTTCAGGCCCCTGAGCACTAAA CTCTTTTAAATGATATCGAACACCACGATATGGGGTAATATATCCATTACGGATGCCATATCCAG CATCAACAAGATAATATTTACCTACAATTTGACAAAACATAATTATTACTAATAAATTATGAGCT TAGTAGAACTATTTGGTATTTAATGTTTTATAATTCTTACCTTCCGGAATTTTTAATCCTCTTGG GCGTGAAAGTGCATCACTTAAAATACGAGAATCATGTGCACTACCTTCCCAACCAGCTAGAACAT AGGAAAATTTCAAATCAAATGTAATGGCAGCCAATACATTTTGTGTCGTCCCCCCTTTACGGCTA CGAAATCTTCCTTGAATGTTAAGTGGAACGGATGCACGAACATGAGTTCCATCTAATGCTCCAAT ACAATCTTTAAAATAAGGATAAAACCTTGGATTGTTTCTGATTTCACTAGGAGTTGACTCATCAG GTAATCTAATAACTAGCTTATACAATTTCAAAACAGCTCTCAATACAACCCTAAAGTAACGGTGA ATTGTCTCAGTTGATCTATAATATCTAGATCCTATCATCGAAACCTTACATTATGACCAATTATA TGTAAAAATATAACTACTTGCTCCCTAATATTCACCGATTTAGTTGATTGTAACAAATTATTTCT ACAAAGAATATCACACAAATTAAAAAATGCAACAGGTCGCATCCTTATCATATCAATACAATGCC GGTCACCACTATATAAAATACTATTAATATAATTTTCTCTTTCATAATCTC Found at i:15804 original size:2 final size:2 Alignment explanation

Indices: 15797--15825 Score: 58 Period size: 2 Copynumber: 14.5 Consensus size: 2 15787 AAGTGCATAC 15797 AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 1 AT AT AT AT AT AT AT AT AT AT AT AT AT AT A 15826 AGTTTCTCAA Statistics Matches: 27, Mismatches: 0, Indels: 0 1.00 0.00 0.00 Matches are distributed among these distances: 2 27 1.00 ACGTcount: A:0.52, C:0.00, G:0.00, T:0.48 Consensus pattern (2 bp): AT Found at i:16306 original size:2 final size:2 Alignment explanation

Indices: 16295--16324 Score: 53 Period size: 2 Copynumber: 15.5 Consensus size: 2 16285 GAACCTTTAC 16295 TA TA -A TA TA TA TA TA TA TA TA TA TA TA TA T 1 TA TA TA TA TA TA TA TA TA TA TA TA TA TA TA T 16325 GGTTTCATTT Statistics Matches: 27, Mismatches: 0, Indels: 2 0.93 0.00 0.07 Matches are distributed among these distances: 1 1 0.04 2 26 0.96 ACGTcount: A:0.50, C:0.00, G:0.00, T:0.50 Consensus pattern (2 bp): TA Found at i:18967 original size:46 final size:46 Alignment explanation

Indices: 18917--19092 Score: 207 Period size: 46 Copynumber: 3.8 Consensus size: 46 18907 TGGTTGAGCA 18917 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG 1 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG * * * 18963 TCCGAACTCGTTGAGTTGAGTCCGAGTTC-GTGA--GATG-TAACTAGG 1 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAA-T--G * 19008 CATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACG 1 --TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG * * * * 19056 CCCGAGCTCGTTGATTTGAGTCCGAGTTCGCTTATGG 1 TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGG 19093 GCGGGTTATA Statistics Matches: 110, Mismatches: 11, Indels: 18 0.79 0.08 0.13 Matches are distributed among these distances: 42 2 0.02 43 5 0.05 45 3 0.03 46 62 0.56 47 29 0.26 48 3 0.03 50 4 0.04 51 2 0.02 ACGTcount: A:0.20, C:0.21, G:0.29, T:0.30 Consensus pattern (46 bp): TCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATG Found at i:19075 original size:93 final size:93 Alignment explanation

Indices: 18914--19085 Score: 308 Period size: 93 Copynumber: 1.8 Consensus size: 93 18904 GGATGGTTGA * * 18914 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAATGTCCGAACTCGTTGAGT 1 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAACTCGTTGAGT 18979 TGAGTCCGAGTTCGTGAGATGTAACTAG 66 TGAGTCCGAGTTCGTGAGATGTAACTAG * * 19007 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAGCTCGTTGATT 1 GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAACTCGTTGAGT 19072 TGAGTCCGAGTTCG 66 TGAGTCCGAGTTCG 19086 CTTATGGGCG Statistics Matches: 75, Mismatches: 4, Indels: 0 0.95 0.05 0.00 Matches are distributed among these distances: 93 75 1.00 ACGTcount: A:0.21, C:0.22, G:0.29, T:0.28 Consensus pattern (93 bp): GCATCCGAACTCGTTGAGTTGAGTCCGAGTTCACTTATGGATGCGAACGCCCGAACTCGTTGAGT TGAGTCCGAGTTCGTGAGATGTAACTAG Found at i:20445 original size:18 final size:18 Alignment explanation

Indices: 20424--20470 Score: 51 Period size: 18 Copynumber: 2.6 Consensus size: 18 20414 TTAAATTGTT 20424 TAAATTTAAAAAATTAT-A 1 TAAATTTAAAAAA-TATCA * * 20442 TAAAATTATAAAATATCA 1 TAAATTTAAAAAATATCA 20460 TCAAATTTAAA 1 T-AAATTTAAA 20471 TTTAAAATCG Statistics Matches: 23, Mismatches: 4, Indels: 3 0.77 0.13 0.10 Matches are distributed among these distances: 17 3 0.13 18 13 0.57 19 7 0.30 ACGTcount: A:0.60, C:0.04, G:0.00, T:0.36 Consensus pattern (18 bp): TAAATTTAAAAAATATCA Found at i:20845 original size:55 final size:55 Alignment explanation

Indices: 20757--20862 Score: 185 Period size: 55 Copynumber: 1.9 Consensus size: 55 20747 AAGGAGGTGC * * 20757 TTTTACACCTAGAAGGATAACTGATTTTGGGGGTCAAAATTACGGTGAAAAATTG 1 TTTTACACCGAGAAGGATAACTGATTTTGGGGGTCAAAATTAAGGTGAAAAATTG * 20812 TTTTACACCGAGAAGGATAACTGATTTTTGGGGTCAAAATTAAGGTGAAAA 1 TTTTACACCGAGAAGGATAACTGATTTTGGGGGTCAAAATTAAGGTGAAAA 20863 TTTATTTTTC Statistics Matches: 48, Mismatches: 3, Indels: 0 0.94 0.06 0.00 Matches are distributed among these distances: 55 48 1.00 ACGTcount: A:0.36, C:0.10, G:0.24, T:0.30 Consensus pattern (55 bp): TTTTACACCGAGAAGGATAACTGATTTTGGGGGTCAAAATTAAGGTGAAAAATTG Found at i:21046 original size:31 final size:31 Alignment explanation

Indices: 21006--21097 Score: 125 Period size: 30 Copynumber: 3.0 Consensus size: 31 20996 TATTTTGTAC 21006 CATTATATTAAATTATTTAAATATAATTTAT 1 CATTATATTAAATTATTTAAATATAATTTAT * 21037 CATTGTATTAAATTATTT-AATATAATTTAT 1 CATTATATTAAATTATTTAAATATAATTTAT * * * * 21067 TATTATATTTAATTTTTTAAA-ATATTTTAT 1 CATTATATTAAATTATTTAAATATAATTTAT 21097 C 1 C 21098 TATCATTAAT Statistics Matches: 53, Mismatches: 7, Indels: 3 0.84 0.11 0.05 Matches are distributed among these distances: 30 34 0.64 31 19 0.36 ACGTcount: A:0.40, C:0.03, G:0.01, T:0.55 Consensus pattern (31 bp): CATTATATTAAATTATTTAAATATAATTTAT Found at i:21115 original size:28 final size:27 Alignment explanation

Indices: 21051--21123 Score: 62 Period size: 27 Copynumber: 2.7 Consensus size: 27 21041 GTATTAAATT * 21051 ATTTAATATAAT-TTATTATTATATTTA 1 ATTTAATAAAATATT-TTATTATATTTA ** 21078 ATTTTTTAAAATATTTTATCTATCA-TTA 1 ATTTAATAAAATATTTTAT-TAT-ATTTA * 21106 ATTTAAT-AAATACTTTAT 1 ATTTAATAAAATATTTTAT 21124 ATTAATTACT Statistics Matches: 37, Mismatches: 6, Indels: 6 0.76 0.12 0.12 Matches are distributed among these distances: 27 23 0.62 28 13 0.35 29 1 0.03 ACGTcount: A:0.40, C:0.04, G:0.00, T:0.56 Consensus pattern (27 bp): ATTTAATAAAATATTTTATTATATTTA Done.