a novel approach for predicting the evolutionary time of acquisition of foreign genes by bacteria:...

6
A novel approach for predicting the evolutionary time of acquisition of foreign genes by bacteria: Application of codon usage analyses Sajib Chakraborty a , Mohammad Arif a, *, T.M. Zaved Waise a , Farhana Naznin a , Faizule Hassan a , Mohammad Faisal a , Md. Fakhrul Kabir a , Md. Moazzem Hossain Khan b , Md. Ehsanul Hoque Mazumder c , Yearul Kabir d , Mark A. Smith e a Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh b Department of Biotechnology, BRAC University, Dhaka, Bangladesh c Department of Pharmacy, Jahangirnagar University, Savar, Bangladesh d Department of Family Sciences, College for Women, Kuwait University, Kuwait City, Kuwait e Department of Pathology, Case Western Reserve University, Cleveland, OH, USA Received 26 November 2008; received in revised form 20 December 2008; accepted 28 December 2008 KEYWORDS Codon adaptation index; Synonymous substitution rate; Evolutionary time; Horizontal gene transfer Abstract Lysogenic bacteriophages are considered as a major player for the introduction of foreign genes into bacterial strains. At the time of introduction foreign genes do not fit well into the translation system of the recipient host bacterium as they tend to retain the charac- teristics of the donor bacterium from which they have been transferred. Consequently foreign genes are poorly transcribed at the early phase of their evolution within the host bacterium. This is largely due to the difference in the codon usage pattern between the horizontally trans- ferred genes and the host bacterium. In this study we present detailed analyses of various parameters of the codon usages such as codon adaptation index (CAI), mean difference (MD) of the relative adaptiveness, synonymous substitution rate (SSR) of six different phage en- coded toxin genes (cholera toxin, shiga toxin, diphtheria toxin, neurotoxin C1, enterotoxin type A and cytotoxin), and proposed conceptual relationship between the evolutionary time of acquisition of the foreign genes and the selected set of parameters of the codon usage. On the basis of the observed data we hypothesize that CAI, MD and SSR of the phage encoded toxin genes are correlated with the evolutionary time of their acquisition, and have developed a novel approach based on the analyses of these parameters, which can be used to predict the evolutionary time of their acquisition by the corresponding host bacterium. ª 2008 Published by Elsevier Ltd. * Corresponding author. Tel.: þ880 966 1900 59x7643; fax: þ880 2 861 5583. E-mail addresses: [email protected], [email protected] (M. Arif). ARTICLE IN PRESS 1756-2392/$ - see front matter ª 2008 Published by Elsevier Ltd. doi:10.1016/j.bihy.2008.12.003 available at www.sciencedirect.com journal homepage: www.elsevier.com/locate/bihy Bioscience Hypotheses (2009) xx,1e6 + MODEL Please cite this article in press as: Chakraborty S et al., A novel approach for predicting the evolutionary time of acquisition of foreign genes by bacteria: Application of codon usage analyses, Bioscience Hypotheses (2009), doi:10.1016/j.bihy.2008.12.003

Upload: independent

Post on 26-Nov-2023

0 views

Category:

Documents


0 download

TRANSCRIPT

ARTICLE IN PRESS

Bioscience Hypotheses (2009) xx, 1e6

+ MODEL

ava i lab le at www.sc ienced i rec t . com

journa l homepage : www.e lsev i er . com/ loca te /b ihy

A novel approach for predicting the evolutionarytime of acquisition of foreign genes by bacteria:Application of codon usage analyses

Sajib Chakraborty a, Mohammad Arif a,*, T.M. Zaved Waise a,Farhana Naznin a, Faizule Hassan a, Mohammad Faisal a, Md. Fakhrul Kabir a,Md. Moazzem Hossain Khan b, Md. Ehsanul Hoque Mazumder c,Yearul Kabir d, Mark A. Smith e

a Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladeshb Department of Biotechnology, BRAC University, Dhaka, Bangladeshc Department of Pharmacy, Jahangirnagar University, Savar, Bangladeshd Department of Family Sciences, College for Women, Kuwait University, Kuwait City, Kuwaite Department of Pathology, Case Western Reserve University, Cleveland, OH, USA

Received 26 November 2008; received in revised form 20 December 2008; accepted 28 December 2008

KEYWORDSCodon adaptationindex;Synonymoussubstitution rate;Evolutionary time;Horizontal gene transfer

* Corresponding author. Tel.: þ880 9E-mail addresses: marif567@yahoo

1756-2392/$ - see front matter ª 200doi:10.1016/j.bihy.2008.12.003

Please cite this article in press as: Cgenes by bacteria: Application of co

Abstract Lysogenic bacteriophages are considered as a major player for the introduction offoreign genes into bacterial strains. At the time of introduction foreign genes do not fit wellinto the translation system of the recipient host bacterium as they tend to retain the charac-teristics of the donor bacterium from which they have been transferred. Consequently foreigngenes are poorly transcribed at the early phase of their evolution within the host bacterium.This is largely due to the difference in the codon usage pattern between the horizontally trans-ferred genes and the host bacterium. In this study we present detailed analyses of variousparameters of the codon usages such as codon adaptation index (CAI), mean difference (MD)of the relative adaptiveness, synonymous substitution rate (SSR) of six different phage en-coded toxin genes (cholera toxin, shiga toxin, diphtheria toxin, neurotoxin C1, enterotoxintype A and cytotoxin), and proposed conceptual relationship between the evolutionary timeof acquisition of the foreign genes and the selected set of parameters of the codon usage.On the basis of the observed data we hypothesize that CAI, MD and SSR of the phage encodedtoxin genes are correlated with the evolutionary time of their acquisition, and have developeda novel approach based on the analyses of these parameters, which can be used to predict theevolutionary time of their acquisition by the corresponding host bacterium.ª 2008 Published by Elsevier Ltd.

66 1900 59x7643; fax: þ880 2 861 5583..com, [email protected] (M. Arif).

8 Published by Elsevier Ltd.

hakraborty S et al., A novel approach for predicting the evolutionary time of acquisition of foreigndon usage analyses, Bioscience Hypotheses (2009), doi:10.1016/j.bihy.2008.12.003

ARTICLE IN PRESS

2 S. Chakraborty et al.

+ MODEL

Introduction Pseudomonas aeruginosa, Staphylococcus aureus, Escher-

Degeneracy of the genetic code allows synonymous codons tocode for the same amino acid. In a particular species severalsynonymous codons are utilized more frequently than othersduring protein synthesis [1]. This non-randomness in theutilization of the synonymous codons is believed arise fromthe mutational biases and various selective forces [2,3]. GþCcomposition of the genome is a vital factor for codon usagevariation [4,5]. This variation mostly lies in the third positionof the codons (<20% to >90% GþC), as it is immune tochanges [6].GC-rich organisms tend to prefer GC-containingcodons over AT-containing ones. Consequently eachorganism has their optimal and non-optimal codons [7].

Bacteria canacquire foreigngenes throughHGT(horizontalgene transfer). Bacteriophages are the major player in theHGT phenomenon. Bacteriophages can mobilize geneticmaterial between distantly related bacterial species [8]. Atthe timeof introduction into the recipienthost bacterium, theforeign genes tend to retain the characteristics of the donorbacterium and it may vary significantly from the native genesof the recipient bacterium in terms of optimal codon usage.For the detection of the horizontally transferred foreign genesvarious parameters of the codon usage such as relativeadaptiveness (RA), mean difference of RA, codon adaptationindex (CAI), and synonymous substitution rate (SSR) betweencodons can be used [10e12]. CAI is a measure of similarity ofa gene’s synonymous codon usage to that of a standard set ofhighly expressed genes for that organism [13]. The meandeference (MD)of the relativeadaptiveness (RA)of thecodonsof the foreign genes from that of the native genes gives usa clue about to what extent the foreign gene varies from thenative genes in a host bacterium. Here we present a detailedanalysis of a selected set of parameters such as RA, MD of RA,CAI, and SSR of the codon usage pattern of the six phageencoded toxin genes. These are cholera toxin, shiga toxin,neurotoxin C1, enterotoxin type A, cytotoxin and diphtheriatoxin. To the best of our knowledge these parameters of thecodon usage have not been utilized previously in predictingthe time of acquisition of foreign genes. In a previous methodthe rate of horizontal gene transfer was estimated [9], but nottheir evolutionary timeofacquisition. In this studyweproposea hypothesis involving the conceptual relationship betweenthe evolutionary time of acquisition of the foreign genes andthe selected set of parameters of the codon usage, and adopta novel approach for thepredictionof thecomparative timeoftheacquisitionof the foreigngenesonthebasisof theanalysesthe selected parameters.

Methods and materials

Constructing the codon usage tables for toxin genesand corresponding host bacteria

The nucleotide sequence of the six toxin genes wereretrieved from the TIGR (The Institute of Genome Research)database (web add: www.tigr.org). Codon usage table withthe relative codon frequencies of individual codons for eachof the toxin gene was constructed by the Sequence Manipu-lation Suite web tool [14]. The codon usage tables ofa selected set of host bacteria (Vibrio cholerae O1,

Please cite this article in press as: Chakraborty S et al., A novel approgenes by bacteria: Application of codon usage analyses, Bioscience H

ichia coli O157, Clostridium botulinum, Corynebacteriumdiphtheriae) which harbor the toxin genes to be analyzed,were retrieved from the codon usage database of CUTG(Codon Usage Tabulated from GenBank (ftp distribution):Codon usage tables for NCBI listed organisms.

Graphical analyses of relative adaptiveness ofcodon usage frequencies

The Graphical Codon Usage Analyzer (GCUA) web tool wasutilized for comparing the codon usage tables of a particulartoxin gene and the host bacterium in which the toxin generesides. Here the bar diagrams were generated based on therelative adaptiveness of codon frequencies. The basic prin-ciple for deriving relative adaptiveness values out of codonusage frequency values is the following: for each amino acidthe codon with the highest frequency value is set to 100%relative adaptiveness. All other codons for the same aminoacid are scaled accordingly. The graphical codon usageanalyzer tool is accessible under http://gcua.schoedl.de/.

GC content analysis of toxin genes andcorresponding host bacterial genome

The GC content of the toxin gene nucleotide sequences andthe bacterial genome were established by the EMBOSS(European Biology Open Software Suite). Third letter GþC %was also counted by the web tools of the same package.

Calculating codon adaptation index

The CAI value indicates the expressivity of a given gene. Itis also useful to identify the poorly expressed genes. For thecalculation of CAI each codon is given a weight with respectto the subset of highly expressed genes defined for theconsidered organism. JCat (Java Codon Adaptation Tool),a very rapid and easy method for the estimation of CAI [15],was employed in this study to calculate the CAI values ofthe six toxin genes (Appendix A).

Estimating synonymous substitution rate in thetoxin genes

Substitution rate at the different synonymous sites in thetoxin genes were calculated by a computational methoddeveloped by Eyre-Walker and Bulmer [12]. This methodwas slightly modified to calculate the SSR in the phageencoded toxin genes with respect to their correspondinghost genome (Appendix B).

Results

Comparison of relative adaptiveness of toxin genecodons

Relative adaptiveness (RA) of the codon frequencies of thesix phage encoded toxin genes with respect to their cor-responding host genome were determined. The comparisonwas drawn between the RAs of both toxin gene codons and

ach for predicting the evolutionary time of acquisition of foreignypotheses (2009), doi:10.1016/j.bihy.2008.12.003

ARTICLE IN PRESS

Application of codon usage analyses 3

+ MODEL

overall codon frequencies of the host bacteria and meanvariance of the RA for each toxin gene from their host wascalculated. The difference between the RAs of each codonof the toxin genes and corresponding hosts are shown inFig. 1. From Fig. 1 it is evident that the differences of theRAs are greater for shiga toxin, diphtheria toxin and cyto-toxin from their respective hosts whereas difference in theRA are low for neurotoxin and enterotoxin. Mean difference(MD) between the RA of toxin genes and respective hostbacterium was also calculated. The lowest MD was foundfor neurotoxin which implies similarities with the hostbacterium C. botulinum. High MD was seen for shiga toxin,Diphtheria toxin and cholera toxin. Among these, MD fordiphtheria toxin was highest (Table 1).

Analysis and comparison of GC content of toxingenes with corresponding host organisms

GC content of the six phage encoded toxin genes wasestimated and compared with the total GC content of thespecific set of bacteria in which they occur. Significantsimilarity was observed for two toxins (neurotoxin andenterotoxin) with their hosts in terms of GþC percentage.The GþC% of four toxin genes, (shiga, cytotoxin, choleratoxin and diphtheria toxin) were significantly lower thantheir corresponding host bacterium (Table 1). The 3rd letterGC% in the toxin gene was also compared with that of thecorresponding host bacteria (Table 1). For neurotoxin and

Mean variance (MD) of the relative adaptiveness of

each toxin gene

-150

-100

-50

0

50

100

150

300 10 20 5040 60 70

CTxA

CTxB

Shiga toxin

Neurotoxin

Enteroroxin

Cytotoxin

Diptheria toxin

Figure 1 Difference in RA of each of the codons betweentoxin gene and host bacteria: The difference in the relativeadaptiveness (RA) of the codons between toxin genes andrespective bacteria was calculated and is shown in the scatterplot. When RA of Codontoxin gene < RA of the Codonhost bacteria,the difference was considered as a positive value whereasnegative values were obtained when Codontoxin gene > RA of theCodonhost bacteria. Each symbol in the graph represents RAdifference for a particular toxin. Each point in the scatter plotrepresents the difference in RA for a particular codon and thecorresponding host bacterium.

Please cite this article in press as: Chakraborty S et al., A novel approgenes by bacteria: Application of codon usage analyses, Bioscience Hy

enterotoxin the 3rd letter GC% are very similar to their hostbacteria whereas the rest of the toxin genes showed vari-ation in the 3rd letter GC% when compared with theirrespective bacteria (Table 1).

Estimating codon adaptation index

The codon adaptation index was calculated by JCat webtool for each of six toxin genes and is shown in Table 1. Onthe basis of CAI values genes can be categorized into fourclasses: very highly expressed genes (CAI > 0.6), highlyexpressed genes (0.5 > CAI > 0.6), moderately expressedgenes (0.35 > CAI > 0.5) and weakly expressed genes(CAI < 0.35). Neurotoxin C1 (CAI: 0.63) and enterotoxintype A (CAI: 0.38) genes can be considered as very highlyexpressed and moderately expressed genes respectively.CAI values of the rest of the toxin genes were below 0.35and can be considered as weakly expressed genes. Diph-theria and shiga toxins have the two lowest CAI values.

Calculating synonymous substitution rate (SSR)

SSR between eight pairs of synonymous codons (encodingeight amino acids: Ala, Leu, Arg, Gln, Gly, Lys, Phe and Ile)were calculated (Table 2). These amino acids were selectedfor the SSR calculation, because a significant difference in theRA of the codons was observed for these amino acids. Amongthe eight substitutions, two were transversion type and therest were transition type. The geometric mean of the SSR foroneparticular toxinwasdetermined(Table2).Thehighest SSRwas seen for neurotoxin C1. Enterotoxin and cytotoxin genehad relatively high SSR values whereas shiga toxin, diphtheriatoxin and cholera toxin exhibited similar (lower) values.

Predicting the evolutionary time of acquisition ofphage encoded toxin gene

We hypothesized that the CAI, MD and SSR of the codon ofthe toxin genes should correlate with their evolutionarytime of acquisition. We proposed that CAI and SSR shouldhave a linear relationship with the time of acquisition ofthe toxin gene, whereas the MD of the relative adaptivenessis inversely correlated with the evolutionary acquisitiontime (Fig. 2). In line with our proposed hypothesis weestimated the relative time of acquisition of the six phageencoded toxin genes by their respective host bacteria. Byconsidering all the parameters, we concluded that neuro-toxin C1 and enterotoxin type A were acquired in thedistant past by C. botulinum and S. aureus. On the otherhand shiga toxin and diphtheria toxin were introduced inthe genome of E. coli and C. diphtheriae respectively in therelatively recent past. Cholera toxin and cytotoxin wereacquired at some time between these events of acquisition.

Discussion

Detailed analyses of the CAI, RA and MD of the selected setof six phage encoded toxin genes showed that neurotoxinC1 and enterotoxin type A fit well in the translationalsystem in their respective host as they show high CAI valuesand low MD. On the other hand shiga toxin and diphtheria

ach for predicting the evolutionary time of acquisition of foreignpotheses (2009), doi:10.1016/j.bihy.2008.12.003

Table 1 CAI values and GC content of the toxin genes.

Gene Total no.of codons

Hostorganism

CAI Meandifferenceof RA

GC content in % GC contentof host

No of codonswith G/C at3rd position

3rd letterGC% in the

toxin gene

3rd letterGC% in thetotal CDs inthe host

Cholera toxin 384 V. cholerae 0.31 31.5 35.27 47.488 90 27.81 44.68Shiga toxin 316 E. coli O157 0.16 35.03 41.56 50.40 103 31 55.17Neurotoxin C1 1292 C. botulinum 0.63 5.91 26.16 28.21 1292 12.77 14.21Cytotoxin 287 P. aeruginosa 0.25 22.53 53.77 66.56 198 68 87.15Enterotoxin type A 258 S. aureus 0.38 15.61 31.13 32.85 61 22.68 23.64Diphtheria toxin 561 C. diphtheriae 0.15 35.73 42.54 53.48 192 34.22 56.12

ARTICLE IN PRESS

4 S. Chakraborty et al.

+ MODEL

toxin have low CAI but high MD values. This implies thatthese two toxins are weakly expressed as they show majordifferences in codon usage with host bacteria. Theremaining two toxins showed relatively medium CAI and MDvalues. The synonymous substitution rates (SSR) for eightpairs of synonymous codons were estimated and thegeometric mean of SSR for each toxin gene was calculated.Three toxins (neurotoxin C1, enterotoxin type A and cyto-toxin) showed high SSR values. The remaining three toxinsfall into the low SSR group. The high CAI and low MD valuesfor neurotoxin C1 and enterotoxin type A can be correlatedwith their high SSR values. In these two toxins high substi-tution rate in the synonymous codons may help in theconversion of non-optimal codons to the optimal ones, andthis explains their high CAI values and low MD values.Horizontally transferred genes are subjected to the muta-tional processes that also affect the recipient genome [12].With the progression of evolutionary time the acquiredforeign gene sequences will accumulate substitutions andeventually reflect the DNA composition of the recipienthost genome. This process of adjustment of a foreign genesequence in order to match up the base composition andcodon usage of the resident genome is known as ‘‘amelio-ration’’ [9,11]. So the high SSRs in the neurotoxin C1 andenterotoxin type A actually reflect the amelioration processby which the foreign gene gradually adjusts its base

Table 2 Synonymous substitution rate (SSR) between the eight

Substitution Substitutionmode

Correspondingamino acid

Substitution rate (SR)

CTxA inthe geneticbackground ofV. cholerae

Shiga toxiin thegeneticbackgrounof E. coliO157

GCT/GCC Transition Ala 0.79 0.77TTA/TTG Transition Leu 0.79 0.73CGG/CGT Tranversion Arg * 0.74CAA/CAG Transition Gln 0.93 0.82GGT/GGC Transition Gly 0.95 0.85ATA/ATC Tranversion Ile 0.70 0.65AAA/AAG Transition Lys 0.65 0.63TTT/TTC Transition Phe 0.73 1.18Mean SR 0.7968 0.8024

*Indicates null SSR values. Cholera toxin and enterotoxin do not have

Please cite this article in press as: Chakraborty S et al., A novel approgenes by bacteria: Application of codon usage analyses, Bioscience H

composition of the non-optimal codons and converts it intoan optimal one. Relatively low SSR values were found forthree toxins (cholera toxin, shiga toxin and diphtheriatoxin). These toxins also showed lower CAI values andhigher mean difference than neurotoxin C1 and enterotoxintype A. For cytotoxin a high SSR value was observed, whichis unusual because it also showed medium CAI andmoderate MD values.

In our proposed hypothesis of the conceptual relationshipbetween the evolutionary time and codon usage parameters(CAI, MD, SSR), we assume that the evolutionary time hasa linear relationship with CAI and SSR but an inverse corre-lation with MD. We hypothesize that a foreign gene must besubjected to a high rate of substitution for a longer period ofevolutionary time in order to achieve a high adaptationindex. We divide the evolutionary time of acquisition of thetoxin genes into three zones. These are distant past, recentpast and time span between the distant and recent past. Asboth the neurotoxin C1 and enterotoxin type A showed highCAI and low MD values they are presumed to reside in therecipient host genome for a relatively longer period of time.The longer period of evolutionary time and high SSR allowedthese toxins to adjust its codon usage to according to thehost bacteria, and this is reflected by their high CAI and lowMD values. Inversely when a horizontally transferred generesides within a host bacterium for a relatively short period

pairs of synonymous codons in the toxin genes.

n

d

Cytotoxinin thegeneticbackgroundof P.aeruginosa

Enterotoxin inthe geneticbackground ofS. aureus

Diphtheria toxinin the geneticbackground ofC. diphtheriae

Neurotoxin inthe geneticbackground ofC. botulinum

1.61 1.26 0.86 1.962.39 0.80 0.79 1.471.29 * 0.89 1.080.94 0.62 0.96 1.521.42 1.17 0.78 2.372.57 0.71 0.89 2.450.60 1.27 0.78 1.371.69 1.44 0.83 2.351.56 1.039 0.817 1.82

CGG codon and hence SSR values could not be calculated.

ach for predicting the evolutionary time of acquisition of foreignypotheses (2009), doi:10.1016/j.bihy.2008.12.003

Figure 2 Comparison of the evolutionary time of acquisition of toxin genes: The span of evolutionary time, in which the six toxingenes were introduced into their corresponding host bacteria, is divided into three zones: distant past, recent past, and time spanbetween distant and recent past. This graph was based on three parameters (CAI, SSR and log (MD)) of the toxin genes. The circlesrepresent the different zones of evolutionary time.

ARTICLE IN PRESS

Application of codon usage analyses 5

+ MODEL

of time, it will be subjected to less substitution andconsequently it might not be able to achieve optimal codonusage pattern for efficient translation. Such an incidentmight happen to shiga toxin and diphtheria toxin. Theyshowed relatively low SSR and CAI, and high MD values.Consequently we assume that these foreign genes may beacquired in the recent past, as they possess many non-optimal codons with respect to their recipient host. Simi-larly, by considering all the parameters we assume that bothcholera toxin and cytotoxin were introduced within the timespan between the distant and recent past. The method thatwe proposed here is a novel one for deducing the evolu-tionary time of acquisition of the foreign genes byprokaryotic genome. Moreover, this method can be appliedto any horizontally transferred genes in the prokaryoticgenome because the amelioration processes is the same inall prokaryotic genomes but their rate varies from bacteriato bacteria. In this method we utilized the difference of theamelioration rate among different foreign genes to estimatetheir acquisition time.

Conclusion

We conclude that the codon usage analyses of a foreigngene form a basis to estimate their evolutionary time ofacquisition by the host bacteria. The uniformity of theamelioration process and codon adaptation of the hori-zontally transferred genes among different bacteria helpthis method to be applied globally to the prokaryoticgenomes, which are believed to be assembled by horizontalgene transfer.

A

Wi Z fi/max{ fj, all synonymous for i},CAIðgeneÞZLOpL

iZ1Wi

In CAIðgeneÞZP64

iZ1 gi In WiZðIn WiÞ gene

Please cite this article in press as: Chakraborty S et al., A novel approgenes by bacteria: Application of codon usage analyses, Bioscience Hy

fi Z Frequency of Codon I, calculated over reference setSL Z Number of all codons in a genegi Z Frequency of codon I on a gene

B

Let us consider Ci and Cj are relative frequencies of twosynonymous codons for a particular amino acid. For toxingenes Ci and Cj is denoted by Citox and Cjtox whereas in thehost bacteria these are called Cibac and Cjbac. Substitutionrate between these two codons is calculated by thefollowing formula:

SZ� b lnð1� p=bÞ

where b Z 1 � {(f1 þ f2)(f1 þ f3)/n2 � (f3 þ f4)(f2 þ f4)/n2}[f1 Z Citox/Cibac, f2 Z Citox/Cjbac, f3 Z Cjtox/Cibac,f4 Z Cjtox/Cjbac]; p Z (f2 þ f3)/n where n Z Sum of all thefour frequencies.

References

[1] Grantham RC, Gautier C, Gouy M, Mercier R, Pave A. Codoncatalog usage and the genome hypothesis. Nucleic Acids Res1980;8:49e79.

[2] Bulmer M. The selection-mutation-drift theory of synonymouscodon usage. Genetics 1991;129:897e907.

[3] Sharp PM, Stenico M, Peden JF, Lloyd AT. Codon usage:mutational bias, translational selection, or both? Biochem SocTrans 1993;21(4):835e41.

[4] Sharp PM, Bailes E, Grocock RJ, Peden JF, Sockett RE. Varia-tion in the strength of selected codon usage bias amongbacteria. Nucleic Acids Res 2005;33:1141e53.

[5] Chen SL, Lee W, Hottes AK, Shapiro L, McAdams HH. Codonusage between genomes is constrained by genome-widemutational processes. Proc Natl Acad Sci USA 2004;101(10):3480e5. 9.

ach for predicting the evolutionary time of acquisition of foreignpotheses (2009), doi:10.1016/j.bihy.2008.12.003

ARTICLE IN PRESS

6 S. Chakraborty et al.

+ MODEL

[6] Muto A, Osawa S. The guanine and cytosine content ofgenomic DNA and bacterial evolution. Proc Natl Acad Sci USA1987;84(1):166e9.

[7] Cutter AD, Wasmuth JD, Blaxter ML. The evolution of biasedcodon and amino acid usage in nematode genomes. Mol BiolEvol 2006;23(12):2303e15.

[8] Lawrence JG. Horizontal and vertical gene transfer: the lifehistory of pathogens Contrib Microbiol 2005;12:255e71.

[9] Lawrence JG, Ochman H. Molecular archaeology of the Escher-ichia coli genome. Proc Natl Acad Sci USA 1998;95:9413e7.

[10] Sharp PM, Li WH. The rate of synonymous substitution inenterobacterial genes is inversely related to codon usage bias.Mol Biol Evol 1987;4:222e30.

Please cite this article in press as: Chakraborty S et al., A novel approgenes by bacteria: Application of codon usage analyses, Bioscience H

[11] Ermolaeva MD. Synonymous codon usage in bacteria. CurrIssues Mol Biol 2001;3(4):91e7.

[12] Walker AE, Bulmer M. Synonymous substitution rates inEnterobacteria. Genetics 1995;140:1407e12.

[13] Sharp PM Li WH. The Codon Adaptation Indexda measure ofdirectional synonymous codon usage bias, and its potentialapplications. Nucleic Acids Res 1987;15:1281e95.

[14] Stothard P. JavaScript programs for analyzing and formattingprotein and DNA sequences. Biotechniques 2000;28:1102e4.

[15] Grote A, Hiller K, Scheer M, Munch R, Nortemann B,Hempel DC, Jahn D. JCat: a novel tool to adapt codon usage ofa target gene to its potential expression host. Nucleic AcidsRes 2005;33:W526e31.

ach for predicting the evolutionary time of acquisition of foreignypotheses (2009), doi:10.1016/j.bihy.2008.12.003