Comparative genomics of gene-family size in closely related bacteria

Institute : University of Miguel Hernández, Alicante, SPAIN
Programming Language : PERL, Python
Operating System : Linux Redhat - 9.0
Software Tools : CVS, gnuplot, PyChart Library
Bioinformatics Tools : BLAST
Team Size : 3
Duration : Jul. 2003 - Dec. 2003


The wealth of genomic data in bacteria is helping microbiologists understand the factors involved in gene innovation. Among these, the expansion and reduction of gene families appears to have a fundamental role in this, but the factors influencing gene family size are unclear. The relative content of paralogous genes in bacterial genomes increases with genome size, largely due to the expansion of gene family size in large genomes. Bacteria undergoing genome reduction display a parallel process of redundancy elimination, by which gene families are reduced to one or a few members. Gene family size is also influenced by sequence divergence and physiological function. Large gene families show wider sequence divergence, suggesting they are probably older, and certain functions (such as metabolite transport mechanisms) are overrepresented in large families. The size of a given gene family is remarkably similar in strains of the same species and in closely related species, suggesting that homologous gene families are vertically transmitted and depend little on horizontal gene transfer (HGT). The remarkable preservation of copy numbers in widely different ecotypes indicates a functional role for the different copies rather than simply a back-up role. When different genera are compared, the increase in phylogenetic distance and/or ecological specialization disrupts this preservation, albeit in a gradual manner and maintaining an overall similarity, which also supports this view. HGT can have an important role, however, in nonhomologous gene families, as exemplified by a comparison between saprophytic and enterohemorrhagic strains of Escherichia coli.


Pushker R., Mira A. and Rodríguez-Valera F. (2004) Comparative Genomics of Gene Family Size in Closely Related Bacteria. Genome Biology, 5:R27.


© 2004 Pushker et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.

The electronic version of this article is the complete one and can be found online at: