mycobacterium tbc pathogenesis

Upload: ivan-choirul-wiza

Post on 13-Oct-2015

28 views

Category:

Documents


0 download

DESCRIPTION

patogenesa tb

TRANSCRIPT

  • CLINICAL MICROBIOLOGY REVIEWS, July 2003, p. 463496 Vol. 16, No. 30893-8512/03/$08.000 DOI: 10.1128/CMR.16.3.463496.2003Copyright 2003, American Society for Microbiology. All Rights Reserved.

    Mycobacterium tuberculosis Pathogenesis and MolecularDeterminants of Virulence

    Issar Smith*TB Center, Public Health Research Institute, International Center for Public Health,

    Newark, New Jersey 07103-3535

    INTRODUCTION .......................................................................................................................................................464HISTORY OF TUBERCULOSIS..............................................................................................................................464CLINICAL MANIFESTATIONS ..............................................................................................................................466M. TUBERCULOSIS VIRULENCE AND THE DISEASE PROCESS.................................................................467Events in the Infectious Process...........................................................................................................................467Early events .........................................................................................................................................................467Later events .........................................................................................................................................................468

    DEFINING M. TUBERCULOSIS VIRULENCE .....................................................................................................468Models for Measuring M. tuberculosis Virulence................................................................................................469Animal models.....................................................................................................................................................469Macrophages........................................................................................................................................................470

    GENETICS OF M. TUBERCULOSIS.......................................................................................................................471Description of the M. tuberculosis Genome..........................................................................................................471Methods of Genetic Analysis in Mycobacteria ...................................................................................................472Initial genetic studies .........................................................................................................................................473Current genetic methods....................................................................................................................................473(i) Directed gene disruption ..........................................................................................................................473(ii) Global gene inactivation..........................................................................................................................474(iii) Complementation ....................................................................................................................................475(iv) Antisense methods...................................................................................................................................475

    Other (nongenetic) methods..............................................................................................................................475(i) Reporter fusions and promoter traps.....................................................................................................475(ii) Hybridization-based methods .................................................................................................................476(iii) Proteomics................................................................................................................................................477

    Validation of results obtained from genetic and gene expression studies..................................................477M. TUBERCULOSIS VIRULENCE FACTORS.......................................................................................................478Cell Secretion and Envelope Function.................................................................................................................478Culture filtrate proteins.....................................................................................................................................478(i) HspX (Rv2031c, hspX) ..............................................................................................................................478(ii) Esat6/CF-10 (Rv3875, Rv3874)...............................................................................................................479(iii) 19-kD protein (Rv3763, lpqH) ...............................................................................................................479(iv) Glutamine synthase (Rv2220, glnA1) ....................................................................................................479

    Cell surface components ....................................................................................................................................479(i) Erp (Rv3810, erp) ......................................................................................................................................480(ii) Mas (Rv2940c, mas).................................................................................................................................480(iii) FadD26 (Rv2930, fadD26) ......................................................................................................................480(iv) FadD28 (Rv2941, fadD28) ......................................................................................................................480(v) MmpL7 (Rv2942, mmpL7).......................................................................................................................480(vi) FbpA (Rv3804c, fbpA) .............................................................................................................................480(vii) MmaA4 (Rv0642c, mmaA4)...................................................................................................................480(viii) PcaA (Rv0470c, pcaA) ...........................................................................................................................480(ix) OmpA (Rv0899, ompA) ...........................................................................................................................481(x) HbhA (Rv0475, hbhA) ..............................................................................................................................481(xi) LAM ..........................................................................................................................................................481

    Enzymes Involved in General Cellular Metabolism ..........................................................................................481Lipid and fatty acid metabolism.......................................................................................................................481(i) Icl (Rv0467, icl or aceA) ...........................................................................................................................481(ii) LipF (Rv3487c, lipF) ................................................................................................................................481(iii) FadD33 (Rv1345, fadD33) ......................................................................................................................482

    * Mailing address: TB Center, Public Health Research Institute,International Center for Public Health, 225 Warren St., Newark NJ07103-3535. Phone: (973) 854 3260. Fax: (973) 854 3261. E-mail:[email protected].

    463

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • (iv) Phospholipases C (Rv2351c, Rv2350c, Rv2349c, Rv1755c, plcA, plcB, plcC, plcD) .........................482(v) PanC/PanD (Rv3602c, Rv3601c, panC, panD) ......................................................................................482

    Amino acid and purine biosynthetic genes .....................................................................................................482(i) LeuD (Rv2987c, leuD) ...............................................................................................................................482(ii) TrpD (Rv2192c, trpD) ..............................................................................................................................482(iii) ProC (Rv0500, proC)...............................................................................................................................482(iv) PurC (Rv0780, purC)...............................................................................................................................482

    Metal uptake........................................................................................................................................................483(i) MgtC (Rv1811, mgtC) ...............................................................................................................................483(ii) MbtB (Rv2383c, mbtB) ............................................................................................................................483(iii) IdeR (Rv2711, ideR)................................................................................................................................483

    Anaerobic respiration and oxidative stress proteins .....................................................................................483(i) Nitrate reductase (Rv1161, narG) ...........................................................................................................483(ii) KatG (Rv1908c, katG) .............................................................................................................................484(iii) AhpC (Rv2428, ahpC) .............................................................................................................................484(iv) SodA (Rv3846, soda)................................................................................................................................484(v) SodC (Rv0342, sodC) ................................................................................................................................484

    Transcriptional Regulators....................................................................................................................................484Sigma factors.......................................................................................................................................................484(i) Sigma A (Rv2703, sigA) ............................................................................................................................484(ii) Sigma F (Rv3286c, sigF)..........................................................................................................................485(iii) Sigma E (Rv1221, sigE)..........................................................................................................................485(iv) Sigma H (Rv3223c, sigH)........................................................................................................................485

    Response regulators ...........................................................................................................................................486(i) PhoP (Rv0757, phoP) ................................................................................................................................486(ii) PrrA (Rv0903c, prrA) ...............................................................................................................................486(iii) Rv0981 (Rv0981, mprA)..........................................................................................................................486

    Other transcriptional regulators ......................................................................................................................487(i) HspR (Rv0353, hspR)................................................................................................................................487(ii) WhiB3 (Rv3416, whiB3)...........................................................................................................................487

    FUTURE RESEARCH................................................................................................................................................487Genetic Approaches ................................................................................................................................................487New Ways To Study M. tuberculosis-Host Interactions......................................................................................488M. tuberculosis-macrophage interactions..........................................................................................................488

    PROSPECTS FOR NEW ANTITUBERCULAR AGENTS....................................................................................489ACKNOWLEDGMENTS ...........................................................................................................................................489REFERENCES ............................................................................................................................................................490

    INTRODUCTION

    Tuberculosis (TB), one of the oldest recorded human afflic-tions, is still one of the biggest killers among the infectiousdiseases, despite the worldwide use of a live attenuated vaccineand several antibiotics. New vaccines and drugs are needed tostem the worldwide epidemic of TB that kills two million peo-ple each year. To rationally develop new antitubercular agents,it is essential to study the genetics and physiology of M. tuber-culosis and related mycobacteria. It is equally important tounderstand the M. tuberculosis-host interaction to learn howthese bacteria circumvent host defenses and cause disease. Theapproaches described in this review identify M. tuberculosisgenes that are or are potentially involved in virulence. In thefuture, some of these genes and the proteins they encode, aswell as newly discovered ones, should provide new bacterialtargets that can be used for creating vaccines and drugs as wellas more selective diagnostic reagents. To help the reader betterunderstand the context for these approaches, a summary ofvarious aspects of TB is presented initially, including a historyof the disease, its clinical manifestations, as well as host andbacterial responses during infection. Because of space consid-erations, this initial discussion must omit important areas andcan only touch on the many topics covered. For more extensive

    background material, there are many excellent books and re-views (30, 100, 116, 245).

    HISTORY OF TUBERCULOSIS

    TB, as described in the next section, can present in variousforms, including one that attacks bone and causes skeletaldeformities. Hard tissues like bone can be preserved for thou-sands of years, allowing the almost certain identification ofindividuals with bone TB who died more than 4,000 years ago.The frequency of unearthed skeletons with apparent tubercu-lar deformities in ancient Egypt suggests that the disease wascommon among that population. The discovery of similarlydeformed bones in various Neolithic sites in Italy, Denmark,and countries in the Middle East also indicates that TB wasfound throughout the world up to 4,000 years ago. The originof M. tuberculosis, the causative agent of TB, has been thesubject of much recent investigation, and it is thought that thebacteria in the genus Mycobacterium, like other actimomycetes,were initially found in soil and that some species evolved to livein mammals. The domestication of cattle, thought to haveoccurred between 10,000 and 25,000 years ago, would haveallowed the passage of a mycobacterial pathogen from domes-ticated livestock to humans, and in this adaptation to a new

    464 SMITH CLIN. MICROBIOL. REV.

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • host, the bacterium would have evolved to the closely relatedM. tuberculosis. Specifically, it has been hypothesized that M.bovis, which causes a TB-like disease in cattle, was the hypo-thetical evolutionary precursor of M. tuberculosis (274). Thishypothesis is now considered doubtful in the light of new data,since it was formulated before the genomes in the M. tubercu-losis complex, including the human and animal pathogens M.africanum, M. microti, and M. canetti, as well as M. tuberculosisand M. bovis, were characterized by DNA sequencing andrelated methods. These studies have shown a greater than99.9% similarity of DNA sequence among the members of theM. tuberculosis complex (38), but the existence of rare synon-ymous single-nucleotide polymorphisms (sSNP) allows dis-crimination between these closely related bacteria. sSNP anal-yses suggest that M. bovis evolved at the same time as M.tuberculosis (273), and a study of the distribution of deletionsand insertions in the genomes of the M. tuberculosis complexprovides strong evidence for the independent evolution of bothM. tuberculosis and M. bovis from another precursor species,possibly related to M. canetti (38).

    In recorded history, Assyrian clay tablets describe patientscoughing blood in the seventh century B.C., and Hippocrates(fifth century B.C.) writes of patients with consumption (theGreek term is phthisis), i.e., wasting away associated with chestpain and coughing, frequently with blood in the sputum. Bythis time, the frequency of descriptions of patients with TB-likesymptoms indicates that the disease was already well en-trenched. It is thought that TB may have been introduced intothese regions by the migration of Indo-European cattle herderswho were carrying it by virtue of their exposure to cattle in-fected with the tubercle bacillus. Analysis of various humanphenotypic traits, like lactose tolerance, that are associatedwith the raising of cattle and selection for the ability to utilizemilk, as well as the resulting exposure to M. tuberculosis, hasalso suggested that Indo-Europeans spread the disease to Eu-rope and Asia during their migrations into these regions (118).

    Europe, with its population explosion in the second millen-nium A.D. and the growth of large urban centers, become theepicenter for many TB epidemics starting in the 16th and 17thcenturies. This disease peaked in Europe in the first half of the19th century, and it is estimated that one-quarter Europeansdied of TB. In one study in a Paris hospital at this time, 250 of696 cadavers examined showed that the individuals had died ofthis disease (77). In the last half of the 19th century, mortalitydue to TB decreased, largely due to improved sanitation andhousing, of which the best-known example is the urban renewalof Paris in the 1850s, initiated and directed by Baron GeorgesHaussmann. Of course, the motivation for this massive projectwas not only public health concerns but also political consid-erations, since the wide, straight boulevards of the rebuiltRight Bank allowed better control of the increasingly radical-ized working class by Louis Bonapartes troops (51). It has alsobeen postulated that natural selection of humans resistant toTB may have played a major role in the 19th-century decreasein the incidence of this disease, but the decline has been toorapid to be explained by these changes (165).

    European immigrants to the New World brought the diseasewith them, and while the mortality rate never reached thelevels found in Europe, large urban centers like Boston andNew York had TB death rates of 6 to 7 per 1,000 in 1800,

    declining to 4 per 1,000 in 1860 to 1870 (62). Presumably publichealth measures also played a role in these declining mortalityrates.

    TB morbidity and mortality rates due to TB steadily droppedduring the 20th century in the developed world, aided by betterpublic health practices and widespread use of the M. bovisBCG vaccine (discussed below), as well as the development ofantibiotics in the 1950s. This downward trend ended and thenumbers of new cases started increasing in the mid-1980s. Themajor causes of this were increased homelessness and povertyin the developed world and the emergence of AIDS, with itsdestruction of the cell-mediated immune response in coin-fected persons. Only by massive expenditures of funds andhuman resources, mainly by directly monitored antibiotic de-livery, has this miniepidemic of new TB cases been reversedin Europe and the United States (99).

    However, the underdeveloped world is still suffering fromTB, as shown by the following statistics. The incidence of TBranges from less than 10 per 100,000 in North America to 100to 300 per 100,000 in Asia and Western Russia to over 300 per100,000 in Southern and Central Africa. There is one deathfrom TB every 15 s (over two million per year), and eightmillion people develop TB every year. Without treatment, upto 60% of people with the disease will die (152). Essentially allthese cases are in the Third World (318a), reflecting the pov-erty and the lack of healthy living conditions and adequatemedical care (301). This global crisis is compounded by theemergence of multidrug resistance in countries like the formerSoviet Union, South Africa, and India, where some antibioticsare available but are of inferior quality or are not used for asufficient time to control the disease according to recom-mended regimens (140, 203).

    Throughout the centuries, doctors and scientists have de-scribed TB in its many forms and sought to understand theorigins of the disease, in order to use this information forbetter diagnoses, prevention, and cures. Hippocrates thoughtthe disease was largely inherited, while Aristotle (4th centuryB.C.) stressed its contagious nature, as did Galen, greatest ofRoman physicians, in the 2nd century A.D. This opposing viewof the origins of TB reemerged in the second half of 17thcentury, where Italian physicians, continuing Galens ideas andinfluencing countries in the Mediterranean basin, still main-tained that TB was contagious. Conversely, doctors and savantsin Northern countries favored constitutional or hereditarycauses of this disease. Reflecting the empiricism of medicalauthorities of the time like Paracelsus of Switzerland, it wasbelieved that the Southern theory of contagion was not rigor-ously proven scientifically and did not explain why some peoplein urban settings did not get TB even where there was a highincidence of the disease (118). This philosophic difference,which can be paraphrased as the well-known nature-versus-nurture conundrum, came to its high point in the 19th century.In 1865, Jean-Antoine Villemin, a French military physician,reported that he had been able to give TB to laboratory rabbitsby inoculating them with tuberculous tissue from a cadaver.This report was immediately assailed by the French medicalestablishment, notably Herman Pidoux, who strongly main-tained that there had to be more modern and more socialsolutions to the problem of TB, which he and others felt arosein the poorer (working) classes from external causes like mal-

    VOL. 16, 2003 M. TUBERCULOSIS PATHOGENESIS AND VIRULENCE DETERMINANTS 465

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • nutrition, poor sanitation, and overwork. The report by RobertKoch 17 years later (155), which conclusively showed that TBwas indeed caused by a bacterium discredited many of Pidouxsarguments. However, belief in the societal causes of TB stillcontinued into the early 20th century as the revolutionarysyndicalist movement in France, in their struggle for an 8-hworking day, used TB as an example of a disease that wascaused by overwork and malnutrition. Contemporary expo-nents of this view tried to discredit Kochs conclusive experi-ments, using arguments similar to those of Northern Europeandoctors of the 17th century and Pidoux and his colleagues (17).

    Starting with Edward Trudeaus work in the late 19th andthe early 20th centuries, the apparent dichotomy in explainingthe etiology of tuberculosis was resolved. In a classic experi-ment, which by todays standards might be considered statisti-cally limited, he showed that TB could be induced in rabbitswith a purified culture of virulent M. tuberculosis but that theenvironmental conditions in which the animals were main-tained greatly influenced the course of the disease (290). Inthis study, five M. tuberculosis-infected rabbits were kept in acrowded, dark cage with minimal food. Of these, four died ofTB within 3 months, and one became severely ill with thedisease. When five similarly infected animals were allowed tolive outdoors on a small island with additional food, one rabbitdied within a month of infection but the other four were stillalive after 6 months, with no sign of the disease. The controlseries, i.e., five uninfected rabbits confined to a dark, crowdedcage with little food, became malnourished and clearly un-happy but did not get TB (290). This simple experiment gavescientific validity to the treatment of TB (fresh air and amplefood) that was the basis of the TB sanitarium movementstarted by European physicians in the mid-1800s and that wasalso used by Trudeau in his Saranac Lake TB treatment centerthat opened in 1884. The history of research and treatment ofTB at the Trudeau Institute has been described in a fascinatingand informative review (57).

    Thus, TB is caused by a bacterium, but environmental fac-tors play a major role, an idea that Rene Dubos clearly rear-ticulated 50 years ago (77). To Dubos, purely medical solutionsalone would not work to cure and prevent TB. Unfortunately,the events of the last half of the 20th century have shown howprescient he was. The antibiotic era, begun by the discovery ofstreptomycin by Schatz and Waksman in the 1940s and its useto treat TB and followed by the introduction of many otherantibiotics like isoniazid, rifampin, and pyrazinamide that areuseful against TB, has not eliminated the disease (248). Like-wise, the widespread use of BCG, an attenuated vaccine strainproduced by the sequential passage of a virulent M. bovis strainby Calmette and Guerin in Paris in the 1920s, has not loweredthe incidence of TB in recent years (4), and there is more TBtoday than ever before (301). Clearly, new vaccines and drugsare needed for TB control, and approaches discussed in thisreview are designed to help in this search. However, it is alwaysimportant to remember Dubos cautionary words, whichstressed the social nature of TB.

    CLINICAL MANIFESTATIONS

    As discussed earlier in this review, pulmonary TB has beenvariously described as consumption and phthisis, both terms

    indicating the severe wasting and the coughing of blood asso-ciated with later stages of the disease. Potts disease or spinaltuberculosis, marked by spinal deformity and other bone de-fects, was named after an 18th-century English physician, butHippocrates thought there was a great similarity between thisbone disease and pulmonary tuberculosis and possibly a com-mon origin. Scrofula, or cervical lymphadenitis, was a commondisease in the middle ages that presented with swelling oflymph nodes in the neck. It was also called The Kings Evilbecause of the myth that it could be cured by the touch of areigning monarch. Villemin (mentioned above) showed in the1860s that scrofula and pulmonary TB had an identical cause.Tuberculosis also can develop in the central nervous system, inwhich case meningitis is the predominant form of the disease,and also in the urogenital tract, the digestive system, and cu-taneously in the form named lupus vulgaris. The incidence ofthese various extrapulmonary forms of tuberculosis varies fromcountry to country, such that on the average between 1964 and1989, 20% of the 20,000 new cases of TB in the United Stateswere extrapulmonary while 5 to 10% of the approximatelyseven million new cases each year in the developing countrieswere extrapulmonary (283). This distribution also can be af-fected by origin of the individuals within a country. In onestudy of TB patients in England, 20% of patients of Europeanorigin had extrapulmonary TB, of which lymph node, bone andjoint, and genitourinary involvement accounted for almost90%. Of patients whose origin was on the Indian subcontinent,45% had extrapulmonary tuberculosis, and 60% of these sitesof infection were in lymph nodes and in bones and joints (319).Autopsies of deceased human immunodeficiency virus (HIV)-negative TB patients in another study in New York Cityshowed that 68% had extrapulmonary TB whose lesions werewidely and randomly distributed throughout the body with noapparent predilection for a limited number of sites as noted inthe English study (143).

    In modern times, most TB infections are initiated by therespiratory route of exposure now that milk products are gen-erally pasteurized, at least in the developed world. One studyin 1978, prior to the AIDS epidemic, showed that 85% of newTB cases were pulmonary (132). Thus, the different forms ofthe disease discussed above usually arise from dissemination ofthe bacilli from infected lungs. TB in many cases follows ageneral pattern as described by Wallgren, who divided theprogression and resolution of the disease into four stages(302). In the first stage, dating from 3 to 8 weeks after M.tuberculosis contained in inhaled aerosols becomes implantedin alveoli, the bacteria are disseminated by the lymphatic cir-culation to regional lymph nodes in the lung, forming theso-called primary or Ghon complex. At this time, conversion totuberculin reactivity occurs. The second stage, lasting about 3months, is marked by hematogenous circulation of bacteria tomany organs including other parts of the lung; at this time insome individuals, acute and sometimes fatal disease can occurin the form of tuberculosis meningitis or miliary (disseminat-ed) tuberculosis. Pleurisy or inflammation of the pleural sur-faces can occur during the third stage, lasting 3 to 7 monthsand causing severe chest pain, but this stage can be delayed forup to 2 years. It is thought that this condition is caused byeither hematogenous dissemination or the release of bacteriainto the pleural space from subpleural concentrations of bac-

    466 SMITH CLIN. MICROBIOL. REV.

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • teria in the lung. The free bacteria or their components arethought to interact with sensitized CD4 T lymphocytes that areattracted and then proliferate and release inflammatory cyto-kines (149). The last stage or resolution of the primary com-plex, where the disease does not progress, may take up to 3years. In this stage, more slowly developing extrapulmonarylesions, e.g, those in bones and joints, frequently presenting aschronic back pain, can appear in some individuals. However,most humans who are infected with TB do not exhibit progres-sion of the disease. One-third of exposed HIV-negative indi-viduals become infected, and of this number 3 to 5% developTB in the first year. An additional 3 to 5% of those infecteddevelop TB later in their lives. It is thought that most adult TBin non-HIV-infected patients is caused by reactivation of pre-existing infection (104). HIV-positive persons infected with M.tuberculosis have a 50% chance of developing reactivation(postprimary) TB at some time in their lives. These individualsand others who are immunosuppressed can also be newly in-fected with M. tuberculosis and in many cases show rapid pro-gression to active disease (105). Adult TB, whether resultingfrom activation or new infection in HIV-infected patients, isalmost always pulmonary and is associated with differing de-grees of lung involvement and damage, notably necrosis, cav-itation, and bleeding (143).

    M. TUBERCULOSIS VIRULENCE AND THEDISEASE PROCESS

    The previous section described the different stages of humanTB at the level of the infected patient and the involved organsystems. This section summarizes events in infection from thecellular and molecular viewpoint of both the infecting bacte-rium and its host. An excellent review of host innate immunityand responses to M. tuberculosis infection has recently ap-peared in this journal (297), and to avoid repetition, someareas are mentioned only briefly.

    Events in the Infectious Process

    Early events. As discussed above, M. tuberculosis usuallyenters the alveolar passages of exposed humans in an aerosoldroplet, where its first contact is thought be with residentmacrophages, but it is also possible that bacteria can be initiallyingested by alveolar epithelial type II pneumocytes. This celltype is found in greater numbers than macrophages in alveoli,and M. tuberculosis can infect and grow in these pneumocytesex vivo (24, 190). In addition, dendritic cells play a very im-portant role in the early stages of infection since they are muchbetter antigen presenters than are macrophages (286) and pre-sumably play a key role in activating T cells with specific M.tuberculosis antigens (31, 114). Since dendritic cells are migra-tory, unlike differentiated macrophages (164), they also mayplay an important role in dissemination of M. tuberculosis.However, this discussion is limited to the much more exten-sively studied and better understood M. tuberculosis-macro-phage interaction. The bacteria are phagocytosed in a processthat is initiated by bacterial contact with macrophage mannoseand/or complement receptors (254). Surfactant protein A, aglycoprotein found on alveolar surfaces, can enhance the bind-ing and uptake of M. tuberculosis by upregulating mannose

    receptor activity (107). On the other hand, surfactant proteinD, similarly located in alveolae, inhibits phagocytosis of M.tuberculosis by blocking mannosyl oligosaccharide residues onthe bacterial cell surface (90), and it is proposed that thisprevents M. tuberculosis interaction with mannose receptors onthe macrophage cell surface. Cholesterol in cell plasma mem-branes is thought to be important for this process, since re-moval of this steroid from human neutrophils decreases thephagocytosis of M. kansasii (221) and similar depletion exper-iments prevented the entry of M. bovis BCG into mouse mac-rophages (106). The human toll-like receptor 2 (TLR2) alsoplays a role in M. tuberculosis uptake (201), and this importantinteraction with bacterial components is discussed later in thisreview. On entry into a host macrophage, M. tuberculosis andother intracellar pathogens initially reside in an endocytic vac-uoule called the phagosome. If the normal phagosomal matu-ration cycle occurs, i.e., phagosome-lysosome fusion, thesebacteria can encounter a hostile environment that includesacid pH, reactive oxygen intermediates (ROI), lysosomal en-zymes, and toxic peptides. Reactive nitrogen intermediates(RNIs) produced by activated mouse macrophages are majorelements in antimicrobial activity (197), and mice with muta-tions in the gene encoding the macrophage-localized cytokine-inducible nitric oxide synthase gene are more susceptible tovarious pathogens, including Leishmania major (311), Listeriamonocytogenes (169), and M. tuberculosis (168). The M. tuber-culosis result is consistent with the results of other experimentsshowing that RNIs are the most significant weapon againstvirulent mycobacteria in mouse macrophages (48, 50) and theobservation that resistance to RNIs among various strains ofM. tuberculosis correlates with virulence (48, 50, 202). Thepresence of RNIs in human macrophages and their potentialrole in disease has been the subject of controversy, but thealveolar macrophages of a majority of TB-infected patientsexhibit iNOS activity (200).

    Since most macrophage killing of bacteria occurs in thephagolysosome (89), intracellular pathogens have evolvedmany ways to avoid this hostile vacuolar microenvironment.Listeria and Shigella physically escape the phagosome and rep-licate in the cytoplasm (252), and Legionella inhibits phago-some-lysosome fusion (134). Salmonella enterica serovar Ty-phimurium phagosomes also are diverted from the normalendocytic pathway of phagosome-lysosoma fusion (42, 233),and this bacterium requires acidification of the phagosome tosurvive in macrophages (234). Pathogenic mycobacteria alsoinhibit phagosome-lysosome fusion (6, 98), but unlike the sit-uation for Salmonella, the mycobacterial phagosome is notacidified (60). This is presumably due to the exclusion of pro-ton ATPases from the mycobacterial phagosome (281), but it isnot clear that the blocking of endosomal maturation is essen-tial for M. tuberculosis survival in macrophages. Live M. tuber-culosis cells can made to traffic to late endosomes by opsoniza-tion with polyclonal antibodies against M. tuberculosis H37Rv,which presumably directs bacterial binding to Fc receptors.However, this rerouting has no effect on bacterial growth inmouse peritoneal macrophages (6). On the other hand, a re-cent study in which human monocyte-derived macrophages(MDMs) were infected with M. tuberculosis Erdmann opso-nized with a polyclonal antibody raised against the M. tuber-culosis cell surface glycolipid lipoarabinomannan (LAM)

    VOL. 16, 2003 M. TUBERCULOSIS PATHOGENESIS AND VIRULENCE DETERMINANTS 467

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • showed that this treatment causes 80% loss of bacteria as wellas increased trafficking to late, more acidic endosomes (175).The different results in these two experiments have not beenresolved but could be in part due to the source of the macro-phages, the nature of the antibodies, and the bacterial strainsused. An interesting finding in the latter work is that Ca2

    signaling is inhibited when M. tuberculosis enters human mac-rophages but not when killed M. tuberculosis or antibody-op-sonized M. tuberculosis cells are phagocytosed (175). This ef-fect was correlated with trafficking to late endosomes; i.e.,elevated Ca2 levels were associated with phagolysosome for-mation. Since Ca2 can stimulate many host responses to in-fection, e.g., the respiratory burst as well as NO and cytokineproduction, preventing increases in Ca2 levels would help M.tuberculosis avoid these host defense mechanisms. It has alsobeen postulated that a selective advantage to M. tuberculosis ofstaying in an early endosome is that there would be less hostimmunosurveillance by CD4 T cells. In agreement with thisidea, there is a decrease in the expression of major histocom-patibility complex class II (MHC-I) proteins and in theMHC-II presentation of bacterial antigens in macrophagesafter M. tuberculosis infection (201). As discussed below, thiseffect seems to be induced by presence of the secreted orsurface-exposed M. tuberculosis 19-kDa lipoprotein, which isthought to interact with TLR2 in the early phase of bacterialentry into macrophages (287). The mechanism by which viru-lent mycobacteria prevent phagosomal maturation is notknown, but in the normal maturation of the mycobacterialphagosome there is a successive recruitment of Rab proteins,which are small GTPases involved in endosome trafficking; i.e.,Rab5 associates with early endosomes, and Rab7 is found inlater endosomes. The M. tuberculosis phagosome that doescontain Rab5 does not recruit Rab7 (298). Also, TACO, amember of the coronin family of actin binding proteins, ispreferentially recruited to the mycobacterial phagosome ofinfected murine macrophages, where it was reported to beretained in phagosomes containing live and not killed M. bovisBCG (91). However, a more recent study, in which phago-somes and other macrophage organelles were isolated, hasshown that the association of coronin with phagosomes con-taining live M. bovis BCG in both murine and human macro-phages is transient and is retained only on phagosomes con-taining clumped bacteria (257). These latter results suggestthat coronin is not involved with the arrest in phagosometrafficking observed in M. tuberculosis infections of macro-phages. It is also not known whether the exclusion of Rab7and/or the decreased Ca2 signaling discussed above is directlyresponsible for this block in phagosome maturation or is asecondary consequence.Later events. The relative ease of working with tissue culture

    has provided many data on M. tuberculosis entrance and traf-ficking in the macrophage and on other responses of the in-fected cells, as discussed above, but there is much less infor-mation on how the bacterium survives and grows during laterstages of infection in the lung. It is known that infected mac-rophages in the lung, through their production of chemokines,attract inactivated monocytes, lymphocytes, and neutrophils(297), none of which kill the bacteria very efficiently (89).Then, granulomatous focal lesions composed of macrophage-derived giant cells and lymphocytes begin to form. This process

    is generally an effective means of containing the spread of thebacteria. As cellular immunity develops, macrophages loadedwith bacilli are killed, and this results in the formation of thecaseous center of the granuloma, surrounded by a cellular zoneof fibroblasts, lymphocytes, and blood-derived monocytes (63).Although M. tuberculosis bacilli are postulated to be unable tomultiply within this caseous tissue due to its acidic pH, the lowavailability of oxygen, and the presence of toxic fatty acids,some organisms may remain dormant but alive for decades.The strength of the host cellular immune response determineswhether an infection is arrested here or progresses to the nextstages. This enclosed infection is referred to as latent or per-sistent TB and can persist throughout a persons life in anasymptomatic and nontransmissible state. In persons with ef-ficient cell-mediated immunity, the infection may be arrestedpermanently at this point. The granulomas subsequently heal,leaving small fibrous and calcified lesions. However, if an in-fected person cannot control the initial infection in the lung orif a latently infected persons immune system becomes weak-ened by immunosuppressive drugs, HIV infection, malnutri-tion, aging, or other factors, the granuloma center can becomeliquefied by an unknown process and then serves as a richmedium in which the now revived bacteria can replicate in anuncontrolled manner. At this point, viable M. tuberculosis canescape from the granuloma and spread within the lungs (activepulmonary TB) and even to other tissues via the lymphaticsystem and the blood (miliary or extrapulmonary TB). Whenthis happens, the person becomes infectious and requires an-tibiotic therapy to survive (63).

    Currently, there is little information concerning how M. tu-berculosis responds to the environment of the lung, preventingthe development of rational strategies for treating latent andchronic infections as well acute manifestations of the disease.Some experiments on M. tuberculosis persistence or laten-cy have been performed, using two chronic-infection modelsin mice in which bacteria can be maintained in a steady state inthe absence of disease or are actually not cultivatible (reviewedin reference 95). However, it is still not certain whether bac-teria in the chronic-disease models are actually viable but non-growing, which would reflect a true latent state, or whetherthey are growing and dying at the same rate. The fact that M.tuberculosis in a chronically infected mouse model is suscepti-ble to isoniazid (INH) (238), a drug that is effective onlyagainst growing M. tuberculosis (308), provides evidence for thelatter explanation, i.e., balanced growth and death. There isbiochemical evidence that the intermediary metabolism of M.tuberculosis changes, during the course of chronic mouse in-fections, from an aerobic, carbohydrate-metabolizing mode toone that is more anaerobic and lipid utilizing (258, 259). It isonly in recent years that the full significance of these apparentchanges in intermediary metabolism for acute or chronic in-fection has become apparent, as discussed later in this review.

    DEFINING M. TUBERCULOSIS VIRULENCE

    What makes M. tuberculosis virulent? Unfortunately, there isno simple answer yet, despite the knowledge obtained in thelast 100 or more years. M. tuberculosis does not have classicalvirulence factors like those which are the major causes ofdiseases due to other bacterial pathogens, e.g., toxins produced

    468 SMITH CLIN. MICROBIOL. REV.

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • by Corynebacterium diphtheriae, Escherichia coli O157:H7, Shi-gella dysenteriae, and Vibrio cholerae. The goal of much of theresearch discussed in this review is to help answer this ques-tion, i.e., to define M. tuberculosis virulence by finding thefactors that are important for the progression of TB. Whilethere is limited knowledge of how M. tuberculosis causes dis-ease, its virulence can be measured. This quantitative view canthen used to ascertain the effects of modifying the bacteriumon the disease process. The standard terms mortality andmorbidity are used for a description of M. tuberculosis viru-lence and can be defined in the following ways: mortality is thepercentage of infected animals that die and is also measured asthe time taken for an animal to die after being infected. An-other important parameter that is usually associated with vir-ulence is bacterial load or burden, i.e., numbers of bacteriafound in the infected host after the initial infection. This in-formation allows a comparison of the fitness of different bac-terial strains to survive host responses during an infection. Inaddition, M. tuberculosis virulence mutants that have lowerbacterial loads during animal infections exhibit differentgrowth curves during this process; in one publication, theywere grouped into various classes: sgiv (for severe growth invivo) (i.e., mutants that do not replicate at all but either arecleared rapidly or can persist with no increase in cell numbers),giv (for growth in vivo) (i.e., mutants that grow initially but atlower rates than the wild type), and per (for persistencegenes) (i.e., those growing normally at earlier stages but de-clining in numbers at the onset of cell-mediated immunity)(109). This classification of mutations, especially when moreare obtained, will be useful for understanding how the stages ofthe infectious process are controlled by different bacterialgenes. In this review, to conform to standard genetic nomen-clature, M. tuberculosis mutants that show attenuated growth inmice are classified with the same terminology but as pheno-types, i.e., SGIV, GIV, and PER. Morbidity, as measured byhistopathology analyses, is important to characterize anotherclass of M. tuberculosis mutants affecting virulence, i.e., thosethat do not affect the bacterial load. An example of one ofthese is the M. tuberculosis sigH mutant, which shows normalM. tuberculosis growth and survival in macrophages and mice(151, 181) but whose histopathology in infected mouse lungs ismuch lower than that caused by wild-type M. tuberculosis (151).

    To better measure morbidity and mortality caused by M.tuberculosis it is important to understand the pathogenesisassociated with TB. Uncontrolled M. tuberculosis growth in itshuman host, given the usual site of the infection, is associatedwith extensive lung damage that ultimately causes death bysuffocation due to insufficient oxygen. This anoxia is caused theobliteration of lung parenchymal cells involved in oxygen up-take as well as obstruction of bronchiolar passages by granu-lomatous growths and by blood released during the rupture ofliquefied granulomas in adjacent lung tissue (104). Other un-treated forms of tuberculosis such as tubercular meningitis,which occurs in the meningial membranes of the brain, canresult in death because of inflammation in brain tissue and theresulting hydrocephalus and seizures. Tuberculomas, anotherform of TB in the brain, are large structures formed by by theenlargement of brain granulomas, also due to inflammatoryresponses, and they are also associated with seizures (329).

    Inflammatory responses are also thought to play a role in otherextrapulmonary manifestations of TB, e.g., in bone (189).

    Inflammation is a keyword here, since the growth of M.tuberculosis elicits inflammatory host responses that are neces-sary to control infections but can also cause extensive tissuedamage. Among the cellular agents involved in tissue destruc-tion are various proteases like cathepsin D (196) that are alsothought to be major factors in the liquefaction of granulomas(58). In addition, M. tuberculosis uptake can cause apoptosis ofmacrophages (153, 157), and this could play a role in adjacenttissue damage. A key cytokine in the inflammatory or Th1response of the cellular immune system is tumor necrosis fac-tor (TNF-), which is necessary to control infection. Micethat are unable to produce or respond to TNF- cannot formgranulomas to restrict bacterial dissemination (260). However,when this cytokine is present in large amounts during an aero-sol model of mouse infection, severe lung inflammation andearly death occurs (21). TNF- is a major determinant ofdisease in a rabbit model of TB meningitis, since there is adirect correlation between the extent of disease caused byseveral M. bovis and M. tuberculosis strains and levels of thiscytokine in the cerebrospinal fluid (291). However, data fromanalyses of cytokine responses and virulence in mice infectedwith various M. tuberculosis strains indicate that there are fac-tors additional to TNF- in TB progression. M. tuberculosisCDC1551 is a clinical strain that was originally thought to behighly virulent (142); it has more recently been shown thatCDC1551 induces levels of cytokines, including TNF-, thatare higher than those induced by other M. tuberculosis strainsin mice. However, it is not more virulent than the other strains,as defined by bacterial load and mortality (177). Similar resultswere obtained when the virulence of CDC1551 and H37Rvwere compared in a rabbit model of infection (29). Also, an-other study compared the ability of two clinical M. tuberculosisstrains, HN878 and NHN5, to cause disease and to elicit acytokine response in a mouse model. HN878, the more virulentof the two as determined by mortality measurements, inducedsmaller amounts of TNF- and other inflammatory cytokinesthan NHN5 did (178). Interestingly, HN878 induced higherlevels of alpha interferon (IFN-) and Th2 cytokines like in-terleukin-4 (IL-4).

    It has also been reported that the apoptosis that is some-times observed when M. tuberculosis infects macrophages isdependent on TNF- and that more virulent M. tuberculosisstrains cause less apoptosis (10). The above experiments illus-trate the complexity of the immune system and its effectors,since their results are not consistent with a simple direct rela-tionship between the levels of one or a few cytokines such asTNF- and the progression of the disease in clinical or modelsettings. Clearly, an optimal balance of these immunomodula-tors is crucial. Despite the different results, which make itdifficult to interpret the data in these studies, they are of valuebecause they show that some M. tuberculosis strains are morevirulent than others in animal models (78).

    Models for Measuring M. tuberculosis Virulence

    Animal models. M. tuberculosis virulence is studied both intissue culture, mainly using macrophages and more recentlydendritic cells (31, 126, 145) and pneumocytes (24, 28), and in

    VOL. 16, 2003 M. TUBERCULOSIS PATHOGENESIS AND VIRULENCE DETERMINANTS 469

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • animal models. While tissue culture models are easier to workwith and give faster results, they are limited to the early stagesof infection. Thus, the animal models are ultimately betterbecause all the stages of TB can be studied. The choice ofanimal models for virulence studies is important, and the threemajor models, mice, guinea pigs, and rabbits, each have theiradvantages and disadvantages. Mice are the most frequentlyused in vivo models because of their well-studied genetics (theexistence of inbred strains, including some that have mutationsin the immune system) the availability of reagents to measurecytokine levels, and their low costs of maintenance relative toother animal models (205). Another big advantage to mouseexperiments is that there are inbred strains that show widelydifferent levels of resistance to M. tuberculosis and other patho-gens, which allows the mapping of pertinent loci. In one suchstudy that analyzed the offspring of a mating between therelatively sensitive C3H mouse strain and the resistant B6strain, a locus, sst1, was mapped that had a major effect onsusceptibility to tuberculosis (156). However, it was shown inthese experiments that resistance or susceptibility to M. tuber-culosis is a complex trait and that other, unidentified loci mustalso play a role. In an ambitious and significant study, a set ofrecombinant congenic mouse strains that were made by matingstrains A/J and C57/BL6 is now being used to chromosomallymap loci conferring resistance or sensitivity to bacterial infec-tions (97). Hopefully, these mouse strains will be used in thenear future to map the other loci determining host resistanceto M. tuberculosis infections. The progression of TB in mice isunlike that in humans, in that the granulomas formed are notas distinct, but the fact that mice are generally not as sensitiveto the disease as other animal models and can become chron-ically infected is more like the human situation. Guinea pigsare very sensitive to M. tuberculosis infection, and the stages ofthe disease, including early stages of granuoma formation, inthis animal are similar to those in humans (205). The disad-vantages are the lack of inbred strains and reagents as well asthe high maintenance costs. The rabbit model has one bigadvantage over the other animal system in that the lung gran-ulomas formed during the disease show the same progressionof stages, i.e., caseation, liquefaction, and cavitation, as ob-served in advanced cases of human TB (58). The disadvantagesof rabbits are similar to those of guinea pigs, but their upkeepis even more expensive.Macrophages. Since M. tuberculosis is an intracellular patho-

    gen and infects macrophages primarily, these phagocytic cellsare also used to analyze the virulence of M. tuberculosis strainsand mutants. These ex vivo studies serve as a model for theearly stages of infection, which involve the phagocytosis of M.tuberculosis by resident macrophages in the lung alveoli. Sincehuman alveolar macrophages are difficult to obtain, modelmacrophage systems are generally used. These can be frommice or humans and can be primary cultures or immortalizedcell lines. Macrophages from other animals are not as usefulbecause of the lack of suitable reagents. Mouse macrophagescan be primary, isolated from bone marrow, from lung alveoliby bronchial lavage, and from peritoneal exudates after injec-tion of thioglycolic acid into the peritoneal cavities of mice.Primary macrophages are natural (not immortalized) and aremore representative of the actual in vivo situation, but they areusually harder to obtain and are more variable, especially mac-

    rophages from human donors. There are many mouse macro-phage cell lines available, including the widely used J774 lineand MH-S cells, the latter being an immortalized alveolarmacrophage cell line whose behavior is very similar to that ofprimary mouse alveolar macrophages (191). Since the mouse isthe most widely used animal model, there are advantages tousing macrophages from this mammal, including reagent avail-ability, as discussed above. In addition, the use of primarymouse cells allows the preparation of macrophages from ani-mals with defined mutations so that the effect of specific hostfactors on interactions with M. tuberculosis at the macrophagelevel can be tested (83). Activation of primary mouse macro-phages or macrophage cell lines by the addition of IFN- andlipopolysaccharide, which induces the levels of the iNOS en-zyme needed to form NO, is necessary to observe the M.tuberculosis-killing activity of these cells (50).

    Human macrophages are also widely used, and a big advan-tage of these studies is that the early stages of human diseasecan be studied. Human macrophages used in these experi-ments are primary cultures derived from peripheral bloodmonocytes (MDMs) that are allowed to differentiate into mac-rophages, as well as the more difficult-to-obtain alveolar mac-rophages derived by branchiolar lavage. Transformed mono-cytic cell lines like THP-1, which can be differentiated intomacrophage-like phagocytic cells by the addition of phorbolesters (292), are frequently used, as are similar immortalizedmonocytic cell lines like U937 (310). Studies have shown thatdifferentiated THP-1 macrophages are quite similar to humanMDMs in their response to M. tuberculosis infection (279). Inaddition, many reagents are available to study human hostresponses to M. tuberculosis infection. Human macrophagesfrom TB-negative donors or cell lines do not kill M. tuberculosiswell, presumably because they do not produce NO (69). This isan advantage if one is testing the survival of bacterial mutants,since the wild-type M. tuberculosis cells usually grow well andmutant growth phenotypes can be more accurately comparedthan in activated mouse macrophages (182).

    There are certain caveats in using macrophages for virulencestudies, however. Some M. tuberculosis mutants do not exhibitan attenuated phenotype in macrophages yet are defective forgrowth in mice and/or cause fewer histopathologic changes, asdiscussed later in this review (74, 275). Thus, genetic selectionsor screens for M. tuberculosis virulence phenotypes that usemacrophages alone may miss some attenuated M. tuberculosismutants. Another problem is that macrophages isolated fromdifferent organs of the same animal may respond differently toM. tuberculosis infection. A dramatic example of these tissuedifferences comes from a recent comparative study of the in-teractions between M. tuberculosis H37Rv and primary macro-phages isolated from the sensitive I/St and resistant A/S mousestrains (174). In these experiments, macrophages isolated fromlungs of the sensitive mice were killed more readily by M.tuberculosis and permitted better bacterial growth than didtheir counterparts from the resistant strain. On the other hand,peritoneal macrophages from the sensitive strain were moreresistant to M. tuberculosis killing and restricted bacterialgrowth more than did the resistant peritoneal macrophagesisolated from the resistant mice.

    470 SMITH CLIN. MICROBIOL. REV.

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • GENETICS OF M. TUBERCULOSIS

    Until quite recently, the genetics of M. tuberculosis was aneglected subject because of difficulties in working with theorganism and lack of suitable tools. A review published asrecently as 1994 stated that this field . . . is still in its infancy(55), but the study of mycobacterial genetics has blossomed inrecent years, as demonstrated by the publication an entirebook dedicated to this topic (122). This is due to the develop-ment of many genetic methods, mainly by the Gicquel andJacobs laboratories (109, 218), and to the DNA sequencing

    and annotation of the M. tuberculosis H37Rv genome (53) andthose of related mycobacteria that have been or are currentlybeing completed by The Institute for Genomic Research (94)and by the Sanger Center-Pasteur Institute consortium.

    Description of the M. tuberculosis Genome

    The M. tuberculosis H37Rv genome consists of 4.4 106 bpand contains approximately 4,000 genes (Fig. 1) (53). Annota-tion of the M. tuberculosis genome shows that this bacterium

    FIG. 1. Circular map of the chromosome of M. tuberculosis H37Rv. The outer circle shows the scale in megabases, with 0 representing the originof replication. The first ring from the exterior denotes the positions of stable RNA genes (tRNAs are blue, and others are pink) and thedirect-repeat region (pink cube); the second ring shows the coding sequence by strand (clockwise, dark green; anticlockwise, light green); the thirdring depicts repetitive DNA (insertion sequences, orange; 13E12 REP family, dark pink; prophage, blue); the fourth ring shows the positions ofthe PPE family members (green); the fifth ring shows the positions of the PE family members (purple, excluding PGRS); and the sixth ring showsthe positions of the PGRS sequences (dark red). The histogram (center) represents the GC content, with 65% GC in yellow and 65% GCin red. The figure was generated with software from DNASTAR. Reprinted from reference 53 with permission from the senior author and thepublisher.

    VOL. 16, 2003 M. TUBERCULOSIS PATHOGENESIS AND VIRULENCE DETERMINANTS 471

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • has some unique features. Over 200 genes are annotated asencoding enzymes for the metabolism of fatty acids, compris-ing 6% of the total (Table 1). Among these are approximately100 that are predicted to function in the -oxidation of fattyacids, while E. coli only has 50 enzymes involved in fatty acidmetabolism. The distantly related actinomycete Streptomycescoelicolor has a total of 115, corresponding to a little more than1% of the proteins, of which 59 are annotated as being in-volved in fatty acid degradation (23). This large number of M.tuberculosis enzymes that putatively use fatty acids may berelated to the ability of this pathogen to grow in the tissues ofthe infected host, where fatty acids maybe the major carbonsource. This important aspect of M. tuberculosis physiologyduring infection is described later in this review.

    Another unusual feature of the M. tuberculosis genome is thepresence of the unrelated PE and PPE families of acidic, gly-cine-rich proteins. The names come from the Pro-Glu (PE)and Pro-Pro-Glu (PPE) sequences found in the two conservedN-terminal regions in each of these protein families that areapproximately 110 and 180 amino acids long, respectively. The172 genes, 104 of the PE class and 68 of the PPE variety,comprise over 4% of the genes in M. tuberculosis, and similarlevels of abundance are noted in other members of the M.tuberculosis complex for which sequence data are available(39). PE and PPE genes are not unique to members of the M.tuberculosis complex, since M. leprae has 26 genes for these twofamilies. Nineteen of these are pseudogenes, reflecting theextensive physical and genetic downsizing of the M. lepraegenome by deletion and mutation during the evolution of thisobligate parasite (54). M. marimum, a pathogenic mycobacte-rium that infects frogs and fish and causes a TB-like disease,has some PE genes that are involved in virulence (231). Theseproteins are not restricted to pathogenic mycobacteria, sinceM. smegmatis has some proteins of the PE-PGRS family (13).In M. tuberculosis, proteins encoded by the 104 PE genes canbe further subdivided into three classes, containing 29 proteinswith the PE region alone, 8 proteins in which the PE region isfollowed by unrelated C-terminal sequences, and 67 proteinsthat form the PE-PGRS subfamily. This group of proteins hasthe conserved PE domain followed by C-terminal extensionswith multiple repeats of Gly-Gly-Ala or Gly-Gly-Asn that arein the PGRS (for polymorphic GC-rich repetitive sequences)domains. The function of these large families of related pro-teins is unknown, but size variation has been observed in mem-

    bers of the PE-PGRS subfamily family in clinical TB strains,and many of these proteins have been localized in the cell walland cell membrane (13). These data and the antigenicity ofthese proteins have led to the hypothesis that at least some ofthese proteins may be involved in antigenic variation of M.tuberculosis during infection (13). Significantly, many of the PEgenes that encode proteins containing only the 110-amino-aciddomain are closely followed by a gene encoding a PPE protein.In one case, a tandem PE-PPE pair, Rv2431c-Rv2430c, arecoexpressed and can form a complex (M. Strong and D. Eisen-berg, personal communication).

    Otherwise, M. tuberculosis is not remarkable in its annotatedcontent of most groups of genes and their products that arefound in microorganisms. For example, among transcriptionalregulatory proteins, M. tuberculosis has 13 sigma factors (pro-teins that confer transcriptional specificity on RNA poly-merases), corresponding to 0.3% of the total genes and, 22other regulatory proteins, including 13 two-component re-sponse regulators (usually transcriptional regulators that areactivated by and serve to transduce environmental signals),corresponding to 0.6% of the total. These numbers are quitesimilar to the frequency of genes that encode these regulatorsin the genomes of C. diphtheriae, Bacillus subtilis, and E. coli.This is much lower than the corresponding numbers for thesoil-dwelling, spore-forming S. coelicolor, which has 55 sigmafactor genes, 0.7% of the total, and 74 response regulatorgenes, over 2% of the total (23). It has been postulated that thesoil environment in which Streptomyces species are found hasselected for the ability of these microorganisms to adapt toradically changing conditions. This would occur by by geneduplication and divergent evolution that would give rise tomany transcriptional regulators, allowing appropriate bacterialresponses to a changeable environment. In agreement with thisidea of environmental selection, the number of predictedtransport proteins encoded in the S. coelicolor genome is 614,corresponding to 8% of the total genes. The equivalent num-ber in M. tuberculosis is 125 annotated genes for transportfunctions, corresponding to 3% of the total. Circumstantialevidence for this idea of gene duplication is provided by thefact that the S. coelicolor genome is over 8.5 106 bp, twice asbig as that of M. tuberculosis and encoding twice as manyproteins (23). There must be additional reasons for these dra-matic differences in the Streptomyces genome compared toother eukaryotes. B. subtilis is also a spore-forming soildweller, but its 4 106-bp genome has levels of regulatory andtransporter proteins that are similar to bacteria other thanStreptomyces.

    Methods of Genetic Analysis in Mycobacteria

    The complete sequence and annotation of the M. tuberculo-sis genome has allowed many new genetic approaches to stud-ies of the physiology and pathogenicity of this organism, butmuch important work was done in this area before the ge-nome-sequencing project was initiated. These pregenome ap-proaches dealt largely with developing methods for creatingmutations in specific genes. The choice of which genes to useand ultimately inactivate in order to study virulence was fre-quently based on on the existence of naturally mutations oc-curing in normally virulent strains that affected pathogenicity,

    TABLE 1. General classification of M. tuberculosis genes

    Function No. ofgenes% oftotal

    % of Totalcoding capacity

    Lipid metabolism 225 5.7 9.3Information pathways 207 5.2 6.1Cell wall and cell processes 517 13.0 15.5Stable RNAs 50 1.3 0.2IS elements and bacteriophages 137 3.4 2.5PE and PPE proteins 167 4.2 7.1Intermediary metabolism and

    respiration877 22.0 24.6

    Regulatory proteins 188 4.7 4.0Virulence, detoxification and adaptation 91 2.3 2.4Conserved hypothetical function 911 22.9 18.4Proteins of unknown function 607 15.3 9.9

    472 SMITH CLIN. MICROBIOL. REV.

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • e.g., sigA and katG (56, 317), or predictions of which genesshould be important in some aspect of M. tuberculosis virulenceand/or physiology by inference from studies of other bacterialpathogens, e.g., genes encoding sigma factors and iron acqui-sition regulators (267). The following discussion addressesmethods used for genetic analysis of mycobacterial species.Their application to the identification and characterization ofM. tuberculosis genes that play a role in virulence is discussedlater in this review. There are some methods, such as bacterialconjugation (211) and generalized bacteriophage-mediatedtransduction (282), that have been used for genetic studies ofM. smegmatis but are not discussed here because they have notyet been successfully applied to M. tuberculosis.Initial genetic studies. Early studies on the creation of mu-

    tations in mycobacteria concentrated on the faster-growingnonpathogenic species because of the relative ease of workingwith these bacteria. There is no requirement for biosafety level3 containment facilities, and the experiments are relativelyrapid; e.g., M. smegmatis has a 3-h generation time while thatof M. tuberculosis is 20 to 24 h. Several techniques were devel-oped to inactivate genes in these bacteria, with the first reportsof M. smegmatis and M. bovis BCG transformation being pub-lished in 1988 (268) and the subsequent development of ahighly transformable M. smegmatis strain, MC2155, being pub-lished in 1990 (269). These and more recent articles have beenreviewed (187). While these methods had some success ininvestigating M. tuberculosis, the process was still difficult. Inaddition to its extremely slow growth, which makes it time-consuming to do the standard types of gene inactivation (ittakes 3 weeks for a single M. tuberculosis cell to become avisible colony on solid media), this bacterium was thought tohave lower rates of homologous recombination and higherrates of illegitimate recombination than other mycobacteria,which would complicate gene disruption by standard gene re-placement techniques. In fact, earlier attempts to inactivate M.tuberculosis genes by allelic exchange resulted in illegitimaterecombination (148, 316). The original observation that the M.tuberculosis RecA protein was synthesized with an intein (pro-tein intron) that had to be spliced out (65) led to speculationthat this process contributed to the reported low levels ofhomologous recombination in this species (65, 187). However,later results indicated that the M. tuberculosis recA gene isequally competent in restoring function to a M. smegmatis recAmutant as is the M. smegmatis inteinless gene, suggesting thatthe splicing out of inteins does not affect the capability of RecAfor homologous recombination (101, 208). In addition, morerecent experiments have shown that M. tuberculosis has similarrates of homologous recombination to those of the faster-growing nonpathogenic M. smegmatis (214).Current genetic methods. Despite the problems mentioned

    in the previous section, several current techniques have beensuccessful in inactivating M. tuberculosis genes. Gene disrup-tion techniques in mycobacteria, as described below, can bedivided into directed and global methods but generally requirea selectable phenotype, usually resistance to an antibiotic. Themost frequently used antibiotic resistance cassettes in myco-bacteria are those conferring resistance to kanamycin, hygro-mycin, and streptomycin (47). These antibiotics are also usefulfor selection in E. coli, allowing most cloning procedures totake place in this organism with appropriate plasmid vectors.

    Selection for kanamycin resistance (Kanr) is favored in manymycobacteria like M. smegmatis because of the generally lowlevels of spontaneous Kanr mutations and the high stability andlow cost of the antibiotic. However, kanamycin resistance is nota good selective marker in M. tuberculosis because there is ahigh spontaneous mutation rate that results from the presenceof only one rRNA (rrn) cistron in which the 16S rRNA genecan undergo mutations to Kanr at a significant level. M. smeg-matis has two rrn cistrons, and mutations to Kanr in one ofthese is masked by the dominance of the second, sensitive rrncistron (251). The high background to Kanr in M. tuberculosiscan be avoided by introducing a second resistance marker, suchas a streptomycin resistance cassette, into the plasmid con-struct and doing a double-antibiotic selection for kanamycinand the second antibiotic (streptomycin in this case) (244), butthis modification requires a more complex cloning strategy.Hygromycin resistance is the preferred antibiotic selection inM. tuberculosis in spite of its cost, because the frequency ofspontaneous mutation to this drug is very low.(i) Directed gene disruption. Directed gene inactivation en-

    tails the insertion of an antibiotic resistance cassette in themiddle of the gene of interest and then the transformation ofthis DNA into mycobacteria as a linear or circular molecule,using electroporation. The desired result is allelic replacementof the chromosomal gene by the mutated one. In members ofthe M. tuberculosis complex, directed gene disruptions havebeen made with long linear molecules, up to 40 kb (9), orshorter ones, in the range of 4 kb (1, 240). The use of single-stranded linear DNA increases allelic replacement by homo-logus recombination in Streptomyces (204), and this effect isalso observed in gene inactivation experiments in M. smegmatisand in the M. tuberculosis complex (127). The advantage of thelinear-DNA method is that cloning is relatively easy, especiallywhen short DNA fragments are used. A disadvantage is thatunique restriction enzyme sites are required unless one em-ploys more elaborate manipulation of the DNA sequence ofinterest, and this problem is magnified as the DNA increases inlength. This is true for linear DNAs and plasmid-based systemsdescribed below. One early way of avoiding this problem wasby the use of transposition systems that function in E. coli todisrupt genes contained in large segments of mycobacterialDNA that were cloned into plasmids with E. coli replicons.Many transposition insertion events in several genes containedin a 7-kbp M. smegmatis chromosomal fragment were obtainedin E. coli using transposon m-200 (96). The desired disrup-tion in the ideR gene contained in the M. smegmatis DNAfragment, determined by restriction enzyme digestion analyses,was then used to create an M. smegmatis ideR mutant gene byhomologous recombination, using techniques described in thenext paragraph (79). A newer transposon delivery system usesa cell-free approach to disrupt mycobacterial genes. A com-plex, called the transpososome, is made between the transpos-able element Tn 5 containing a selectable antibiotic resistancemarker and its transposase, the enzyme responsible for theintegration of the transposon into other DNAs. The transposo-some is commercially available and has been used to makerandom mutations in cloned mycobacterial DNAs as well asintact mycobacteria after electroporation of the transposo-some into recipient bacteria (70).

    VOL. 16, 2003 M. TUBERCULOSIS PATHOGENESIS AND VIRULENCE DETERMINANTS 473

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • Most directed mutations in mycobacteria are performedwith bifunctional or shuttle plasmid vectors that can be main-tained in E. coli, and many are available for this use (47).Circular plasmid integration, using a vector that cannot repli-cate in the recipient or has a temperature-sensitive (ts) repli-con (216), is the most widely used method for directed genedisruption by allelic replacement in mycobacteria (187). It usu-ally entails a two-step process in which the plasmid containingthe desired gene disrupted with an antibiotic resistance cas-sette integrates into the genome by a single crossover event(Campbell-type integration) at the region of homology, select-ing for the antibiotic resistance cassette. This event forms adirect repeat at one of two positions relative to the antibioticcassette. In a second crossover event, the plasmid backbone isexcised by means of recombination at the other direct repeatthan the one initially used, when the antibiotic selection ismaintained, resulting in the desired gene disruption. This tech-nique can be relatively efficient when coupled with a counters-election to facilitate plasmid elimination in the second step.Two methods have been used for counterselection; the first isresistance to streptomycin (Strr), using wild-type rpsL in theplasmid vector and a mutated rpsL (Strr) allele in the chromo-some of the recipient (250). Since the sensitive rpsL allele isdominant, resistance to streptomycin is observed only when itis lost along with the plasmid backbone, leaving the unique rpsl(Strr) chromosomal gene. The second counterselection is su-crose resistance when sacB encoding levansucrase is in theplasmid backbone. This enzyme converts sucrose to levans thatare toxic to cells such as mycobacteria that lack a functionallevanase that coverts levans to fructose and glucose (215). Inboth cases, the presence of the vector in the bacterial chromo-some prevents bacterial growth on selective media, i.e., eitherstreptomycin or sucrose. Of the two methods, the sacB selec-tion is easier because one does not have make a specific Strr

    recipient in which rpsL, but not rrnA, is mutated. Anotheradvantage to the sacB sucrose counterselection is that it can beused to introduce silent or unmarked mutations into the chro-mosome of the recipient, dispensing with the need for anyantibiotic resistance marker. This method was used with M.smegmatis (217) and later with M. tuberculosis (214) and isnecessary for the development of live, attenuated M. tubercu-losis vaccine strains, as discussed later in this review. The draw-backs of the two-step procedure are, again, the necessity for aunique restriction site in the gene of interest coupled with thetime required for the process, i.e., approximately 3 to 5 monthsfrom the initial transformation to the verification, by DNAanalysis, of the gene disruption in survivors of the counter-selection. In addition, mutations that inactivate the plasmid-borne rpsl or sacB during the selection and counterselectionprocedure will be erroneously scored as desired events inwhich the plasmid has been excised from the chromosome. Away of avoiding these false-positive results is to introduce an-other easily screenable marker in the plasmid backbone thatshould also be eliminated with the vector. An Strr cassette(244) and reporter fusions that give a visible plate phenotypehave been used for this purpose, including fusions with xylE(327) and lacZ (209).

    A simpler and faster one-step variation of single-crossovergene inactivation with circular DNA has been used to inacti-vate mycobacterial genes. This technique takes advantage of

    the property of internal gene fragments carried on a circularDNA (e.g., a nonreplicating plasmid) to disrupt the corre-sponding gene when the DNA integrates into the chromosomeby a single Campbell-type event. It has been possible to disruptthe M. smegmatis sodA gene with DNA fragments as short as180 bp by this method (80). These events can be unstable,however, since recombination at the direct repeats in the chro-mosome formed during the plasmid integration, which is thebasis of the two-step plasmid procedure, can lead to excision ofthe plasmid, restoring the intact wild-type gene, in the absenceof the selecting antibiotic.

    The major problem with the above methods is they are forthe most part inefficient since the frequency of introducingDNA into mycobacterial species, especially M. tuberculosis, bythe standard electroporation technique is quite low, even withthe utilization of modifications that increase transformation,e.g., raising the temperature at which the DNA is introducedinto M. tuberculosis (304). A new method of directed geneinactivation, termed specialized transduction, using a ts bacte-riophage delivery system has been described recently that by-passes these problems and is the current method of choice fordirected mutation (108). A recent review by the recently de-ceased S. Bardarov, the inventor of the mycobacterial special-ized-transduction system, is quite useful because it gives adetailed description of the method. This article also shows howthe resolvase system can be used to make silent mutationsin mycobacteria, allowing the creation of multiply mutatedbacterial strains (14). In the specialized-transduction system,the gene of interest that has been disrupted with an antibioticresistance cassette is cloned into a plasmid containing themycobacteriophage phAE87 genome with a ts replicon andalso has a bacteriophage lambda-packaging site. This allowsthe formation of viable transducing bacteriophage particles byan in vitro packaging reaction that can be transduced into E.coli. Cosmid DNA is prepared in E. coli and is then trans-formed into M. smegmatis at 30C, the pemissive temperaturewhich allows the formation of infectious mycobacteriophage.M. smegmatis lysates with the bacteriophage construct are thenused to infect M. tuberculosis at 37C the nonpermissive tem-perature, with high enough multiplicities to transduce all of thebacteria. Transductants are then selected on antibiotic-con-taining media for events in which the M. tuberculosis gene hasintegrated into the chromosome. This technique is very effi-cient because essentially all of the recipient cells can be trans-duced and the selection is robust.(ii) Global gene inactivation. The principle of global gene

    inactivation is the insertion of foreign DNA, usually a trans-posable element, into many sites in the bacterial genome, ide-ally on a completely random basis. These events require aselectable phenotype, generally an antibiotic resistance markercarried within the transposable element. Two groups have de-veloped efficient transposition systems that produce integra-tion events in the genomes of mycobacterial species, includingM. tuberculosis. These systems use transposable elements car-ried by vectors that cannot replicate at temperatures above39C: in one case, a ts plasmid that carries Tn 1096 (216), andin the other, a ts bacteriophage, similar to the one described inthe previous section, that carries Tn 5367 or Tn 5370 (15). Theadvantage of these systems is that one can start with a trans-formed or infected population of cells and easily obtain many

    474 SMITH CLIN. MICROBIOL. REV.

    by on May 4, 2009

    cmr.a

    sm.o

    rgD

    ownloaded from

  • integrative events by passage at the restrictive temperature.Both systems have been used to identify M. tuberculosis viru-lence genes by using the signature-tagged mutagenesis (STM)method developed to clone Salmonella genes essential for bac-terial survival during mouse infections (124). In this technique,uniquely tagged transposon M. tuberculosis mutants were madein broth cultures and were negatively screened by hybridizationfor those that did not survive during animal infections (46, 59).

    A drawback of these systems when used with M. tuberculosisis that the transposition events are not completely random inthat there seem to be hot spots of transposition integration,based on the genes found in initial STM screens, as discussedbelow (46, 59). Since the M. tuberculosis genome has approx-imately 4,000 open reading frames (ORFs), many events wouldbe needed to saturate the genome and mutagenize every genewhen using a nonrandom inactivating system. A transpositionsystem has been developed more recently from the marinertransposable element Himar-1 that overcomes some of thesedrawbacks. The mariner element recognition sequence is sim-ply an A-T base pair and is be expected to be truly random inits integration, unlike Tn 1096 and Tn 5367, which recognizemuch larger integration sites. The Himar-1 system has beenused to introduce transposition events in M. smegmatis (247),M. bovis BCG (253), and, more recently, M. tuberculosisH37Rv, where 100,000 transposon insertions have been ob-tained. DNA sequencing and other analyses indicate that over2,600 M. tuberculosis genes have been inactivated thus far (E.Rubin, personal communication). STM tagging was not used tomake these mariner transposon mutations, and so there is nosimple global screening method for virulence genes as thereare with the two transposon systems described above. Thetranspososome method mentioned above can also be used tomake random mutations in mycobacterial genomes (71), but atagging system has not been developed for this method either,limiting its usefulness in screening for mutants with attenuatedvirulence.(iii) Complementation. Genetic complementation has also

    been used to identify M. tuberculosis virulence genes. Thesestudies use M. tuberculosis strains that are known to be aviru-lent or nonpathogenic as recipients for genes that can be se-lected on the basis of encoding a virulence phenotype, usingassays described earlier in this review. In vivo complementa-tion has been made possible by using integration-proficientvectors, which allow the stable propagation of genomic librar-ies as well as individual genes in bacteria during animal infec-tions (160). It was shown that the random cloning of an M.tuberculosis cosmid library into an avirulent M. bovis strainlocalized the attenuating mutation in sigA, encoding the majormycobacterial sigma factor, since the wild-type sigA restoredvirulence in a guinea pig morbidity (spleen focus) assay (56). Asimilar M. tuberculosis cosmid library transformed into theavirulent M. tuberculosis strain H37Ra permitted the isolationof a DNA fragment that increased bacterial survival in mousespleens but not lungs (212). The nonpathogen M. smegmatiswas also used as the host in similar experiments. In one case, acosmid library made with M. bovis BCG DNA was transformedinto M. smegmatis, allowing the identification of a chromo-somal fragment that modestly increased bacterial survival inmouse spleens and in mouse peritoneal macrophages (87). AnM. tuberculosis plasmid library was used to identify eis, a gene

    that increased the survival of M. smegmatis in the human mac-rophage-like cell line U937 (310), and another plasmid libraryallowed the isolation of mce1, a gene that increases the entry ofE. coli into HeLa cells that are nonphagocytic (7). A relatedapproach but using a specific gene has also been tested. The M.leprae thioredoxin-thioredoxin reductase gene, cloned into aplasmid vector, was able to increase the survival of M. smeg-matis in human MDMs (313).(iv) Antisense methods. Antisense RNAs are used to reduce

    the expression of specific genes because they prevent the trans-lation of the mRNAs to which they are complementary. Theyare especially useful in systems where gene inactivation is dif-ficult and also when genes are essential because antisenseinhibition of translation is rarely, if ever, complete. A generalsystem for conditionally controlling the production of anti-sense RNA in mycobacteria was developed, using the regulat-able acetamide/acetamidase system (210). In a demonstrationof the usefulness of this method, a prototrophic M. smegmatisstain was made into a histidine auxotroph when a hisD5 anti-sense RNA was induced by acetamide. Other applications ofthe antisense method were in M. bovis to lower the levels ofAhpC (315) and in M. tuberculosis H37Rv to reduce theamounts of SodA (82). These are discussed later in this review.A related antisense approach, but using phosphorothioate an-tisense deoxyoligonucleotides, was used to decrease levels ofthe M. tuberculosis glutamine synthetase in growing cells (121).Other (nongenetic) methods. Gene inactivation, either di-

    rected or global, and the subsequent analysis of mutant phe-notypes is the most straightforward way to identify and char-acterize genes and proteins that are involved in a specificprocess and, for M. tuberculosis, is virulence. A major problemwith this approach is that some genes may be essential andcannot be disrupted. Thus, methods that do not rely on theabsence of a function are also useful, and these nongeneticscreens usually rely on the differential expression of genes andtheir products in different environments. As in the geneticmethods, they can be used to characterize the expression ofindividual genes identified by other means or can function inglobal searches for genes that show the desired pattern of geneexpression. The output of these methods can be enzyme activ-ity using reporter genes, levels of RNA or proteins, and insome cases direct selection of genes using a selectable orscreenable phenotype.(i) Reporter fusions and promoter traps. A general method

    for studying bacterial gene expression is that of reporter genemethodology. This utilizes plasmid vectors that contain a pro-moterless gene encoding a protein that, in most cases, catalyzesan easily assayable enzymatic reaction. Specific promoter se-quences from known genes o