Escherichia coli
Bacteria · other
Escherichia coli
Name meaning: "Escherichia honors Theodor Escherich (1857–1911), the German-Austrian pediatrician who first isolated and described this bacterium in 1885; coli is the genitive of the Latin neuter noun colon (the large intestine), reflecting the bacterium's primary habitat"
Discovery
Microorganism traits
Classification
Habitat
Classification history
1885: Escherich described it as Bacterium coli commune. 1895: Migula reclassified it as Bacillus coli. 1919: Castellani and Chalmers established the genus Escherichia and reclassified it as Escherichia coli. 1980: Included in the Approved Lists of Bacterial Names, establishing formal taxonomic validity. Listed in LPSN as the correct name explicitly recommended for medical use
Clinical significance
Most strains are harmless commensals, but pathogenic strains cause diverse diseases: EHEC/STEC (hemorrhagic colitis, HUS), ETEC (traveler's diarrhea), UPEC (UTIs, responsible for 80–90% of cases), NMEC (neonatal meningitis). ESBL-producing and carbapenem-resistant strains are classified as WHO critical-priority pathogens

Escherichia coli (Migula 1895) Castellani and Chalmers 1919 is a Gram-negative, rod-shaped, facultatively anaerobic bacterium belonging to the family Enterobacteriaceae and the genus Escherichia. It normally inhabits the lower intestinal tract (large intestine) of warm-blooded animals and is the most representative commensal bacterium of the gut microbiota. Possessing a circular double-stranded DNA genome of approximately 4.6 Mb, E. coli can reproduce remarkably rapidly under optimal conditions, with a generation time of about 20 minutes. While the vast majority of strains are non-pathogenic commensals, certain pathotypes such as EHEC (enterohemorrhagic E. coli) and UPEC (uropathogenic E. coli) cause serious infectious diseases. Depending on the strain, biosafety classification ranges from BSL-1 (laboratory strains such as K-12) to BSL-2 (pathogenic strains), and up to BSL-3 (EHEC/STEC) in some national regulations.
E. coli is the most extensively studied prokaryotic model organism and has played a pivotal role in the history of molecular biology and genetics. From the discovery of bacterial genetic recombination in 1958 to green fluorescent protein (GFP) research in 2008, studies utilizing E. coli have contributed to 11 Nobel Prize-winning discoveries (NCBI Bookshelf, NBK562895). In biotechnology, it serves as the principal host organism for recombinant DNA technology; since its use in producing the world's first recombinant pharmaceutical—human insulin (Humulin), approved by the FDA on October 28, 1982—it has been widely employed for the mass production of therapeutic proteins and industrial enzymes to this day.
Meanwhile, antibiotic-resistant E. coli has emerged as a serious global public health threat. The 2024 WHO Bacterial Priority Pathogens List (BPPL) classifies ESBL (extended-spectrum beta-lactamase)-producing Enterobacterales and carbapenem-resistant Enterobacterales as critical-priority threats (WHO, 2024), and the CDC likewise designates ESBL-producing E. coli as a serious threat (CDC, 2019).
1. Overview
The genus name Escherichia honors the German-Austrian pediatrician Theodor Escherich (1857–1911), who first isolated and described this bacterium from the feces of healthy infants in 1885, in the laboratory of Otto von Bollinger in Munich. Escherich originally named the organism Bacterium coli commune (the common colon bacillus). In 1919, Aldo Castellani and Albert John Chalmers established the new genus Escherichia and reclassified the organism under its current binomial, Escherichia coli. The species epithet coli is the genitive of the Latin neuter noun colon (the large intestine), reflecting the bacterium's primary habitat (LPSN: co'li, L. gen. neut. n. coli, of the colon).
According to LPSN (List of Prokaryotic names with Standing in Nomenclature), E. coli is a validly published correct name and is explicitly recommended for medical use. Its NCBI Taxonomy ID is 562, and the taxonomic authority is recorded as (Migula 1895) Castellani and Chalmers 1919 (Approved Lists 1980). E. coli is the type species of the genus Escherichia, which in turn is the type genus of the family Enterobacteriaceae.
In a single sentence, E. coli can be summarized as a multifaceted bacterium that is simultaneously a human gut commensal, the most important model organism in the history of molecular biology, and—through certain pathogenic strains and antibiotic resistance—a global public health threat.
2. Classification and Phylogeny
The full taxonomic placement of E. coli is as follows: Domain Bacteria, Phylum Pseudomonadota (formerly Proteobacteria), Class Gammaproteobacteria, Order Enterobacterales, Family Enterobacteriaceae, Genus Escherichia, Species E. coli. The type strain is ATCC 11775 (= DSM 30083 = NCTC 9001), and its complete genome was sequenced as part of the GEBA (Genomic Encyclopedia of Bacteria and Archaea) project (Meier-Kolthoff et al., 2014).
E. coli is a genetically and phenotypically extraordinarily diverse species. Pangenome analyses have identified over 30,000 gene families across the species, yet the core genome shared by virtually all strains comprises only about 2,000–3,000 genes—roughly 20% of a typical E. coli genome (Lukjancenko et al., 2010; Abram et al., 2021). A large-scale Mash-based phylogenomic study published in 2021 analyzed 10,667 E. coli genomes and identified up to 14 phylogroups, expanding on the traditionally recognized groups A, B1, B2, C, D, E, F, and G (Abram et al., 2021). Phylogroups B2 and D are predominantly associated with extraintestinal infections (urinary tract infections, sepsis), while A and B1 are mainly commensals.
DNA–DNA hybridization and whole-genome analyses have established that the four species of the genus Shigella (S. dysenteriae, S. flexneri, S. boydii, S. sonnei) fall genetically within E. coli (Brenner, 1984; LPSN). However, because of their medical importance as causative agents of bacillary dysentery and longstanding historical convention, they are maintained as a separate genus—a phenomenon referred to in taxonomy as "taxa in disguise." Furthermore, a 2023 study reported that some genomes classified as E. coli in NCBI may represent reproductively isolated, distinct species, and the debate over species boundaries within E. coli continues (Gonzalez-Alba et al., 2023).
Below the species level, E. coli is extensively classified by serotyping based on three major surface antigens. O antigens (somatic antigens, components of the lipopolysaccharide layer) encompass approximately 190 known serogroups. H antigens are based on the flagellar protein flagellin (approximately 53 types), and K antigens are based on capsular polysaccharides (approximately 80 types). For example, O157:H7 denotes a serotype possessing O antigen type 157 and H antigen type 7, one of the most widely recognized pathogenic serotypes.
3. Morphology and Structure
E. coli is a straight, rod-shaped (bacillus) bacterium with rounded ends. Cell dimensions vary with growth conditions but typically measure approximately 1.0–3.0 μm in length (commonly about 2.0 μm), 0.25–1.0 μm in width (commonly about 0.5 μm), with a cell volume of approximately 0.6–0.7 μm³ (about 1 fL) (BioNumbers). Cells exist singly or in pairs, and under nutrient depletion or stress conditions, morphology can range from coccoid to elongated filamentous forms.
E. coli possesses the characteristic double-membrane cell envelope of Gram-negative bacteria. The inner membrane (cytoplasmic membrane) is a phospholipid bilayer involved in nutrient transport and energy production (electron transport chain). The peptidoglycan layer (murein layer) is relatively thin at approximately 2–7 nm—much thinner than in Gram-positive bacteria—and serves as the structural scaffold maintaining cell shape. The outer membrane is the most distinctive feature of the E. coli cell wall; its inner leaflet consists of phospholipids, while the outer leaflet contains lipopolysaccharide (LPS). LPS is composed of three regions: lipid A (endotoxin activity), core oligosaccharide, and O-antigen polysaccharide. In Gram-negative sepsis, LPS is the primary molecule responsible for endotoxic shock. The outer membrane contains porins (e.g., OmpF, OmpC)—water-filled protein channels that allow selective passage of small hydrophilic molecules.
E. coli has a peritrichous flagellar arrangement, with approximately 5–10 flagella distributed over the entire cell surface. Flagella are composed of flagellin protein and rotate by proton motive force, enabling swimming at speeds of approximately 25 μm per second (roughly 10 body lengths per second) in liquid environments. However, not all strains are motile; some pathogenic strains (e.g., EIEC) are non-motile. Pili (fimbriae) are short, thin proteinaceous appendages crucial for adhesion to host cell surfaces. Type 1 fimbriae carry the FimH adhesin, which binds mannose residues, while P pili (pap pili) are critical for UPEC adhesion to P-blood group antigens on urinary epithelial cells. E. coli does not form spores.
When subjected to Gram staining, E. coli stains pink to red with the counterstain safranin, because the thin peptidoglycan layer fails to retain the crystal violet–iodine complex during ethanol decolorization.
4. Genome and Molecular Biology
The E. coli genome consists of a single circular double-stranded DNA chromosome. The complete genome of the best-known laboratory reference strain, K-12 MG1655, was sequenced and published by Blattner et al. in 1997 and deposited in GenBank (accession U00096, subsequently updated to U00096.3). The genome size is 4,641,652 bp (approximately 4.6 Mb), with a GC content of 50.8% and approximately 4,300 protein-coding genes. However, genome sizes of naturally isolated E. coli strains range considerably, from approximately 3.98 Mb (the minimized K-12 derivative MDS42) to 5.86 Mb (pathogenic strain O26:H11 11368).
The E. coli pangenome is highly open, with over 30,000 identified gene families across the species. Only approximately 2,000–3,000 core genes are conserved in virtually all strains; the remainder constitutes the accessory genome, largely acquired through horizontal gene transfer (HGT) via conjugation, transduction, and transformation. An estimated 18% of the K-12 MG1655 genome was horizontally acquired since the divergence from Salmonella (Lawrence and Ochman, 1998). Many virulence genes (toxins, adhesins, iron acquisition systems) are carried on pathogenicity islands, plasmids, or lysogenic phages—the transfer of Shiga toxin genes to E. coli O157:H7 via transduction being a prime example. According to EcoCyc, approximately 38% of genes in the K-12 genome remain functionally uncharacterized, and elucidating their roles is an active area of research.
DNA replication initiates at a single origin of replication (oriC) and proceeds bidirectionally. The C period (time to replicate the entire chromosome) is approximately 40 minutes. Under rapid growth conditions, the generation time (approximately 20 minutes) can be shorter than the C period, made possible by overlapping replication: new rounds of replication initiate before the previous round is completed, so that multiple replication forks coexist simultaneously within a single cell.
5. Pathogenicity and Clinical Manifestations
The vast majority of E. coli strains (over 90%) are harmless commensals, but strains that have acquired specific virulence factors cause a wide spectrum of infectious diseases. Diarrheagenic E. coli (DEC) strains are classified into six major pathotypes (Nataro and Kaper, 1998; Kaper et al., 2004).
Enterohemorrhagic E. coli (EHEC/STEC) produce Shiga toxins (Stx1 and/or Stx2), causing hemorrhagic colitis and hemolytic uremic syndrome (HUS). HUS is characterized by acute renal failure, microangiopathic hemolytic anemia, and thrombocytopenia, and can be fatal, particularly in children under five and the elderly. The best-known serotype, O157:H7, was first recognized in 1982 during outbreaks of bloody diarrhea traced to contaminated hamburgers in Oregon and Michigan, USA. Enterotoxigenic E. coli (ETEC) produce heat-labile (LT) and heat-stable (ST) enterotoxins and are a leading cause of traveler's diarrhea in developing countries. Enteropathogenic E. coli (EPEC) use the adhesin intimin to form attaching and effacing (A/E) lesions and primarily cause watery diarrhea in infants. Enteroinvasive E. coli (EIEC), like Shigella, invade intestinal mucosal epithelial cells, producing dysentery-like illness. Enteroaggregative E. coli (EAEC) is associated with persistent diarrhea, and diffusely adherent E. coli (DAEC) is linked primarily to childhood diarrhea.
Regarding extraintestinal infections, uropathogenic E. coli (UPEC) is the most common causative agent of urinary tract infections (UTIs), accounting for approximately 80–90% of community-acquired UTI cases (Foxman, 2014). UPEC employs virulence factors including type 1 fimbriae (FimH), P pili, alpha-hemolysin (HlyA), and aerobactin to adhere to and invade urinary epithelial cells. Neonatal meningitis-associated E. coli (NMEC) carrying the K1 capsular antigen is a leading cause of neonatal bacterial meningitis. E. coli is also a major agent in healthcare-associated infections, including bloodstream infections, catheter-associated UTIs, intra-abdominal infections, and ventilator-associated pneumonia.
Diagnosis relies on culture (characteristic colony morphology on MacConkey or EMB agar), biochemical testing (lactose fermentation positive, indole positive, citrate utilization negative, etc.), serotyping, and PCR-based detection of virulence genes.
6. Epidemiology and Transmission
E. coli is distributed worldwide and is found in the intestines of over 90% of healthy humans. Pathogenic E. coli is transmitted primarily via the fecal-oral route, through contaminated food (undercooked beef, leafy greens, unpasteurized milk or juice), contaminated drinking water, direct contact with infected individuals, and contact with animals—particularly ruminants, the primary reservoir for STEC.
Notable large-scale STEC/EHEC outbreaks include the 1996 Sakai City, Japan outbreak (approximately 9,000 cases of O157:H7 linked to contaminated white radish sprouts), the 2006 US outbreak (O157:H7 traced to contaminated spinach), and the 2011 German outbreak (O104:H4 linked to fenugreek sprouts, resulting in 54 deaths and approximately 850 HUS cases). The 2011 German outbreak was caused by a hybrid pathotype combining EAEC and STEC characteristics, demonstrating the evolutionary dynamism of E. coli pathogenicity.
ETEC-associated diarrhea accounts for hundreds of millions of cases worldwide annually and is a major cause of mortality in children under five in developing countries. UTI caused by UPEC is one of the most common bacterial infections globally, with an estimated 50–60% of women experiencing at least one episode in their lifetime.
7. Immunity, Treatment, and Prevention
The immune response to E. coli infection involves both innate and adaptive immunity. Secretory IgA at the intestinal mucosa serves as a first line of defense, LPS activates a potent innate immune response via TLR4 (Toll-like receptor 4), and flagellin is recognized by TLR5. However, pathogenic strains possess various immune evasion mechanisms, including complement inhibition by capsular polysaccharides and host cell signaling manipulation via type III secretion systems.
Antibiotic use in STEC/EHEC infections is controversial. Because antibiotics may promote Shiga toxin release and increase the risk of HUS, antibiotic therapy is generally not recommended for STEC infections; supportive care (fluid replacement, electrolyte correction) is the standard approach. For UTIs caused by UPEC, trimethoprim-sulfamethoxazole, nitrofurantoin, fosfomycin, and fluoroquinolones are commonly used, but rising resistance rates are making empirical antibiotic selection increasingly difficult.
Antibiotic resistance is among the most serious public health issues associated with E. coli. ESBL-producing E. coli are resistant to most beta-lactam antibiotics, including third-generation cephalosporins. Carbapenem-resistant Enterobacterales (CRE) are resistant even to carbapenems—antibiotics of last resort. The plasmid-mediated colistin resistance gene mcr-1, first reported in China in 2015, confers transferable resistance to colistin, the last-line antimicrobial. According to the CDC's 2019 Antibiotic Resistance Threats Report, over 2.8 million antibiotic-resistant infections occur annually in the United States, causing more than 35,000 deaths, with ESBL-producing Enterobacterales classified as a serious threat. The 2024 WHO BPPL designated both carbapenem-resistant and third-generation cephalosporin-resistant Enterobacterales as critical-priority pathogens.
Emerging therapeutic approaches include bacteriophage therapy. In the largest phage therapy clinical trial to date (2024), a six-strain phage cocktail by Locus Biosciences demonstrated in vitro efficacy against 94% of clinical E. coli isolates (C&EN, 2024). Additionally, engineered phages carrying CRISPR-Cas systems have been shown to selectively eliminate target bacteria within E. coli biofilms (Kiga et al., 2023). Prevention strategies include thorough handwashing, complete cooking of meats (ground beef to an internal temperature of at least 71°C / 160°F), avoiding unpasteurized dairy products, adequate washing of fruits and vegetables, preventing cross-contamination during food preparation, and avoiding contaminated water.
8. Ecology and Environmental Role
The primary habitat of E. coli is the lower intestinal tract (large intestine) of warm-blooded animals, including humans. Although facultative anaerobes including E. coli constitute only about 0.1% of the total gut microbiota, E. coli is found in over 90% of healthy adults, and an average of approximately five distinct E. coli strains coexist in the healthy human gut (Tenaillon et al., 2010). Within the intestine, E. coli produces vitamin K2 (menaquinone) and vitamin B12 for the host (Bentley and Meganathan, 1982), provides colonization resistance against pathogenic bacteria, and consumes oxygen to create favorable conditions for obligate anaerobes. The E. coli Nissle 1917 strain has been commercialized as a probiotic pharmaceutical (Mutaflor) for the prevention of intestinal infections.
Metabolically, E. coli is a chemoheterotroph that performs aerobic respiration in the presence of oxygen and mixed acid fermentation or anaerobic respiration in its absence. Mixed acid fermentation products include lactate, succinate, ethanol, acetate, and formate (which is further cleaved to CO₂ and H₂). Hydrogen gas production is important for interspecies hydrogen transfer with methanogenic archaea and sulfate-reducing bacteria in the gut. The phenomenon of catabolite repression—preferential glucose consumption over lactose when both are available—became the classical model of gene regulation through Jacob and Monod's research on the lac operon.
Outside the host, E. coli can survive for limited periods in the environment and is therefore widely used as an indicator organism for fecal contamination in water quality and food safety monitoring. Recent research has identified environmentally persistent E. coli lineages capable of prolonged survival and even growth in soil and freshwater, necessitating caution in interpreting the presence of E. coli solely as evidence of recent fecal contamination.
9. Research History and Scientific Significance
The scientific history of E. coli begins in 1885, when Theodor Escherich first isolated and described this bacterium from the feces of healthy infants at the laboratory of Otto von Bollinger in Munich, Germany (Escherich, 1885). Escherich used the novel staining technique developed by Hans Christian Gram to characterize the organism. In 1922, a strain isolated from the stool of a convalescent diphtheria patient was deposited in the Stanford University strain collection as K-12, and this strain would later become the central experimental tool of molecular biology.
In 1946, Joshua Lederberg and Edward Tatum used the K-12 strain to discover bacterial genetic recombination (conjugation), a landmark event that inaugurated the field of bacterial genetics (Lederberg and Tatum, 1946) and led to the 1958 Nobel Prize in Physiology or Medicine. Subsequently, E. coli was instrumental in the deciphering of the genetic code (1968 Nobel Prize), the discovery of gene regulation mechanisms (the operon concept; 1965 Nobel Prize), the discovery of restriction enzymes (1978 Nobel Prize), recombinant DNA technology (1980 Nobel Prize), ATP synthesis mechanisms (1997 Nobel Prize), protein signal sequences (1999 Nobel Prize), and green fluorescent protein (GFP; 2008 Nobel Prize)—a total of 11 Nobel Prize-winning discoveries (NCBI Bookshelf, NBK562895).
Biotechnology milestones include the first recombinant DNA experiment by Cohen and Boyer in 1973, Genentech's production of recombinant human insulin in E. coli in 1978, and the FDA approval of Humulin on October 28, 1982—the first recombinant pharmaceutical ever approved (FDA, 2022). Today, E. coli remains the most widely used host organism for recombinant protein production, employed in the manufacture of insulin, growth hormone, interferon, and industrial enzymes.
The E. coli Long-Term Evolution Experiment (LTEE), initiated by Richard Lenski in 1988, tracks the evolution of 12 independent populations descended from a single ancestral strain in real time—the world's longest-running evolution experiment. The populations surpassed 80,000 generations in August 2024 in Jeffrey Barrick's laboratory at UT Austin, continued to 82,000 generations, and in September 2025 returned to Michigan State University, where the experiment continues (the-ltee.org, 2025). The evolution of aerobic citrate utilization in one population at approximately generation 31,500 is regarded as a landmark case of observed microbial speciation in the laboratory.
10. Comparison with Related Microorganisms
| Feature | E. coli | Salmonella enterica | Shigella spp. | Klebsiella pneumoniae |
|---|---|---|---|---|
| Gram stain | Negative | Negative | Negative | Negative |
| Cell morphology | Rod | Rod | Rod | Rod (encapsulated) |
| Motility | Mostly positive | Mostly positive | Negative | Negative |
| Lactose fermentation | Positive | Negative | Negative | Positive |
| H₂S production | Negative | Positive | Negative | Negative |
| Indole production | Positive | Negative | Variable | Negative |
| Citrate utilization | Negative | Positive | Negative | Positive |
| Urease | Negative | Negative | Negative | Positive |
| Primary habitat | Mammalian intestine | Diverse animal intestines | Human/primate intestine | Soil, water, human intestine/respiratory tract |
| Major diseases | Diarrhea, UTI, sepsis, meningitis | Typhoid fever, gastroenteritis | Bacillary dysentery | Pneumonia, UTI, sepsis |
| Genome size | 4.6–5.9 Mb | 4.5–4.9 Mb | 4.0–4.8 Mb | 5.0–5.8 Mb |
| Divergence relationship | Reference | Diverged ~100 Ma | Within E. coli species | Separate genus, same family |
Salmonella enterica diverged from E. coli approximately 102 million years ago from a common ancestor. The two genera share high genomic synteny, but Salmonella is distinguished by its inability to ferment lactose, its production of H₂S, and its intracellular parasitism. Shigella, as noted above, is genetically within E. coli but is maintained as a separate genus for medical convention, distinguished by its non-motility, inability to ferment lactose, and capacity for intracellular invasion.
Fun Facts
E. coli has contributed to 11 Nobel Prize-winning discoveries—more than any other single organism—spanning from bacterial recombination (1958) to GFP research (2008).
If E. coli divided every 20 minutes under ideal conditions, a single cell could theoretically produce approximately 4.7 × 10²¹ cells (about 4,700 kg) in one day—but nutrient depletion and waste accumulation make this impossible in practice.
Complete replication of the E. coli genome takes about 40 minutes, yet cells can divide every 20 minutes. This is possible through overlapping replication, where new rounds of DNA replication begin before previous ones are finished.
The K-12 strain was isolated from a convalescent diphtheria patient's stool in 1922 and deposited at Stanford University. Over a century later, it remains the workhorse strain of molecular biology laboratories worldwide.
In Lenski's Long-Term Evolution Experiment, one E. coli population evolved the ability to metabolize citrate aerobically at approximately generation 31,500—a trait whose absence had been a defining characteristic of the species.
Recombinant human insulin produced in E. coli in 1978 became the world's first recombinant pharmaceutical (Humulin, FDA-approved October 1982). Previously, producing one year's supply of insulin for a single diabetic patient required pancreases from about 50 pigs.
An average of about five different E. coli strains coexist in the healthy human gut, yet they constitute only about 0.1% of the total intestinal microbiota.
E. coli and Salmonella diverged from a common ancestor approximately 102 million years ago—during the mid-Cretaceous, when dinosaurs dominated the Earth.
A single E. coli cell has a volume of about 0.7 μm³ (approximately 1 femtoliter)—about 90 times smaller than a human red blood cell—yet packs roughly 43 million protein molecules into that tiny space.
All four species of Shigella are genetically within E. coli but are maintained as a separate genus for medical reasons—a classic case of 'taxa in disguise' in taxonomy.
The LTEE surpassed 80,000 generations in August 2024, continued to 82,000 generations, and returned to MSU in September 2025—making it the world's longest continuously running evolution experiment.
The E. coli pangenome contains over 30,000 gene families, but only about 2,000–3,000 core genes are shared by nearly all strains—meaning roughly 80% of any given strain's genome may be unique or shared with only some other strains.
FAQ
No. The vast majority of E. coli strains (over 90%) are harmless commensals that reside in the human gut. Some strains actively benefit their hosts by producing vitamin K2 and vitamin B12, and by providing colonization resistance against pathogenic bacteria. An average of approximately five different E. coli strains coexist in the healthy human gut. The Nissle 1917 strain is even used commercially as a probiotic medication (Mutaflor).
Effective prevention measures include thorough handwashing, complete cooking of meats (ground beef to an internal temperature of at least 71°C / 160°F), avoiding unpasteurized dairy products, adequate washing of fruits and vegetables, avoiding contaminated drinking water, and preventing cross-contamination during food preparation. Hand hygiene after contact with ruminant animals (cattle, sheep) is also important.
E. coli is the ideal model organism due to its rapid growth (generation time of approximately 20 minutes under optimal conditions), low-cost and simple culture requirements, well-established genetic manipulation techniques, fully sequenced genome (since 1997), and the non-pathogenic nature of the K-12 strain (BSL-1). These attributes have made it the flagship bacterium of molecular biology, contributing to 11 Nobel Prize-winning discoveries.
E. coli O157:H7 is the most well-known pathogenic serotype (EHEC/STEC), first recognized in 1982 during outbreaks of bloody diarrhea linked to contaminated hamburgers in the United States. It produces Shiga toxins (Stx1 and Stx2), which can cause hemorrhagic colitis and hemolytic uremic syndrome (HUS). HUS is characterized by acute renal failure, hemolytic anemia, and thrombocytopenia, and can be fatal, particularly in children under five and the elderly.
Under optimal laboratory conditions (37°C, rich growth medium, pH 7.0, aerobic), the generation time is approximately 20 minutes, making it one of the fastest-growing known bacteria. However, within the human intestinal tract, generation time is estimated at 12–15 hours due to nutritional competition and environmental constraints, and in wild environments it exceeds approximately 15 hours.
In 1978, Genentech used recombinant DNA technology to produce human insulin in E. coli for the first time. The human insulin gene is inserted into a plasmid and introduced into E. coli, whereupon the bacterium's protein synthesis machinery mass-produces the human insulin protein. On October 28, 1982, the FDA approved Eli Lilly's Humulin, making it the world's first recombinant pharmaceutical—a process that took only five months of review.
ESBL-producing strains are resistant to most beta-lactam antibiotics including penicillins and cephalosporins. CRE (carbapenem-resistant Enterobacterales) are resistant even to last-resort carbapenems, and strains carrying the mcr-1 gene show resistance to colistin—the ultimate last-line antimicrobial. The 2024 WHO classified these as critical-priority pathogens. With therapeutic options shrinking, antibiotic-resistant E. coli poses a severe and growing public health crisis.
DNA relatedness and whole-genome analyses show that all four Shigella species genetically fall within the E. coli species. However, due to the medical importance of Shigella as the causative agent of bacillary dysentery and longstanding historical convention, it is maintained as a separate genus. LPSN explicitly notes this relationship. This phenomenon is referred to in taxonomy as 'taxa in disguise.'
It depends on the strain. Laboratory reference strains such as K-12, B, and BL21 are classified as BSL-1. Pathogenic strains (EPEC, ETEC, UPEC, O157:H7, etc.) are classified as BSL-2. In some countries (Germany, France, UK, Belgium), EHEC/STEC (Shiga toxin-producing) strains are classified as BSL-3. Under US CDC/NIH guidelines, pathogenic strains are generally BSL-2.
The LTEE, initiated by Richard Lenski in 1988, is the world's longest-running evolution experiment, tracking 12 independent populations derived from a single non-pathogenic E. coli ancestral strain. The populations surpassed 80,000 generations in August 2024 and continued to 82,000 generations before returning to Michigan State University in September 2025, where the experiment continues. Notably, one population evolved the ability to metabolize citrate aerobically at approximately generation 31,500—a landmark case of observed microbial speciation in the laboratory.
📚References
- Escherich, T. (1885). Die Darmbakterien des Neugeborenen und Säuglings. Fortschritte der Medizin, 3, 515–522.
- Castellani, A. & Chalmers, A.J. (1919). Manual of Tropical Medicine, 3rd ed. Williams Wood and Co., New York.
- Blattner, F.R., Plunkett, G., Bloch, C.A., et al. (1997). The complete genome sequence of Escherichia coli K-12. Science, 277(5331), 1453–1462. https://doi.org/10.1126/science.277.5331.1453
- Nataro, J.P. & Kaper, J.B. (1998). Diarrheagenic Escherichia coli. Clinical Microbiology Reviews, 11(1), 142–201. https://doi.org/10.1128/CMR.11.1.142
- Kaper, J.B., Nataro, J.P. & Mobley, H.L. (2004). Pathogenic Escherichia coli. Nature Reviews Microbiology, 2(2), 123–140. https://doi.org/10.1038/nrmicro818
- Tenaillon, O., Skurnik, D., Picard, B. & Denamur, E. (2010). The population genetics of commensal Escherichia coli. Nature Reviews Microbiology, 8(3), 207–217. https://doi.org/10.1038/nrmicro2298
- Lederberg, J. & Tatum, E.L. (1946). Gene recombination in Escherichia coli. Nature, 158, 558. https://doi.org/10.1038/158558a0
- Jacob, F. & Monod, J. (1961). Genetic regulatory mechanisms in the synthesis of proteins. Journal of Molecular Biology, 3(3), 318–356. https://doi.org/10.1016/S0022-2836(61)80072-7
- Cohen, S.N., Chang, A.C., Boyer, H.W. & Helling, R.B. (1973). Construction of biologically functional bacterial plasmids in vitro. Proceedings of the National Academy of Sciences, 70(11), 3240–3244. https://doi.org/10.1073/pnas.70.11.3240
- Bentley, R. & Meganathan, R. (1982). Biosynthesis of vitamin K (menaquinone) in bacteria. Microbiological Reviews, 46(3), 241–280.
- Brenner, D.J. (1984). Family I. Enterobacteriaceae. In: Krieg, N.R. & Holt, J.G. (eds), Bergey's Manual of Systematic Bacteriology, 1st ed., vol. 1, Williams & Wilkins, Baltimore, pp. 408–420.
- Lenski, R.E., Rose, M.R., Simpson, S.C. & Tadler, S.C. (1991). Long-term experimental evolution in Escherichia coli. I. Adaptation and divergence during 2,000 generations. The American Naturalist, 138(6), 1315–1341. https://doi.org/10.1086/285289
- Lawrence, J.G. & Ochman, H. (1998). Molecular archaeology of the Escherichia coli genome. Proceedings of the National Academy of Sciences, 95(16), 9413–9417. https://doi.org/10.1073/pnas.95.16.9413
- Hayashi, K., Morooka, N., Yamamoto, Y., et al. (2006). Highly accurate genome sequences of Escherichia coli K-12 strains MG1655 and W3110. Molecular Systems Biology, 2, 2006.0007. https://doi.org/10.1038/msb4100049
- Foxman, B. (2014). Urinary tract infection syndromes: occurrence, recurrence, bacteriology, risk factors, and disease burden. Infectious Disease Clinics of North America, 28(1), 1–13. https://doi.org/10.1016/j.idc.2013.09.003
- Meier-Kolthoff, J.P., Hahnke, R.L., Petersen, J., et al. (2014). Complete genome sequence of DSM 30083T, the type strain (U5/41T) of Escherichia coli, and a proposal for delineating subspecies in microbial taxonomy. Standards in Genomic Sciences, 9, 2. https://doi.org/10.1186/1944-3277-9-2
- CDC. (2019). Antibiotic Resistance Threats in the United States, 2019. U.S. Department of Health and Human Services, CDC. https://www.cdc.gov/antimicrobial-resistance/data-research/threats/index.html
- Abram, K., Udaondo, Z., Bleker, C., et al. (2021). Mash-based analyses of Escherichia coli genomes reveal 14 distinct phylogroups. Communications Biology, 4, 117. https://doi.org/10.1038/s42003-020-01626-5
- WHO. (2024). WHO bacterial priority pathogens list, 2024. Geneva: World Health Organization. https://www.who.int/publications/i/item/9789240093461
- Riggs, A.D. (2021). Making, cloning, and the expression of human insulin genes in bacteria: The path to Humulin. Endocrine Reviews, 42(3), 374–380. https://doi.org/10.1210/endrev/bnaa029
Gallery
5 images
Escherichia coliEscherichia coli
Escherichia coliEscherichia coli
Escherichia coliEscherichia coli
Escherichia coliEscherichia coli
Escherichia coliEscherichia coli