Click on image to view larger version.

Fig. 2. Frequencies and sequences of the 60 most frequently observed alleles in environmental isolates. Horizontal bars represent the percent of all sequenced isolates, out of 941, that had the indicated sequence. At the right, the sequence of each allele is indicated as differences from a reference sequence, S69414 in GenBank, encoded as xny, where x = the base in the reference sequence, n = the position in the reference sequence at which the base is changed, and y = the base found in the allele. The length of each allele's list of sequence differences is proportional to the number of bases differing from S69414, as represented by the scale on the bottom axis.