COVID-19 is caused by a virus called SARS-CoV-2, a type of coronavirus that first infected humans no later than fall 2019 in Wuhan, China. How exactly the virus made the jump to humans remains one of the most consequential unsolved questions in modern science, with two leading hypotheses still on the table: spillover from an animal and a laboratory-related incident.
The Virus Behind COVID-19
SARS-CoV-2 belongs to the beta group of coronaviruses, the same group as the viruses behind the original SARS outbreak in 2003 and MERS. Its genetic code is carried on a strand of RNA roughly 30,000 nucleotides long. The virus particle has four main structural proteins, but the most important one for understanding how it causes disease is the spike protein, which studs the virus’s outer surface and gives coronaviruses their crown-like appearance under a microscope.
The spike protein locks onto a protein called ACE2, which sits on the surface of cells in the human lungs, heart, kidneys, and other organs. This is the virus’s doorway into the body. SARS-CoV-2 binds to ACE2 roughly seven times more tightly than the original SARS virus did, which helps explain why it spreads so much more efficiently between people. Once attached, a special feature on the spike protein gets cut by an enzyme the body naturally produces, triggering a shape change that lets the virus fuse with the cell membrane and slip inside. This two-step entry process is unusually effective and is a key reason the virus became a pandemic pathogen rather than a localized outbreak.
When and Where It Started
The first recognized cluster of COVID-19 cases appeared in Wuhan, China, in December 2019, but the virus was circulating before anyone noticed. Regional newspaper reports place COVID-19 diagnoses in Hubei province as early as November 17, 2019. Modeling by researchers at UC San Diego pushed the timeline back further, estimating that SARS-CoV-2 began circulating in Hubei by mid-October 2019, though the number of infected people likely remained below one (meaning sporadic, unsustained transmission) until around November 4.
Many of the earliest identified cases were linked to the Huanan Seafood Market, a large wholesale market in Wuhan that sold live animals alongside seafood and other goods. That geographic clustering drew immediate attention and shaped the early investigation into where the virus came from.
The Animal Spillover Hypothesis
The most widely supported explanation among scientists is that SARS-CoV-2 jumped from an animal to a human, either directly or through an intermediate host species. This is the same pattern behind SARS in 2003, which spread from bats to civet cats to people, and MERS, which passed through camels.
Investigators from Chinese disease control authorities swabbed the floors, walls, cages, carts, drains, and sewers of the Huanan Seafood Market in early January 2020. They found SARS-CoV-2 genetic material in some of the same stalls where live wildlife had been sold. The market had housed raccoon dogs (small foxlike animals) and civet cats, among other species. In some cases, genetic material from the virus and from these animals turned up on the same swabs.
That evidence is suggestive but not conclusive. No animal at the market was confirmed to be infected with SARS-CoV-2, and finding viral and animal DNA on the same surface doesn’t prove the animal was the source. Scientists still haven’t identified the intermediate host species, if one exists. The closest known relative of SARS-CoV-2 in bats, a virus called RaTG13, shares about 96.2% of its genome with the pandemic virus. That sounds high, but the remaining gap represents decades of evolutionary distance, meaning a more direct ancestor or a closer intermediate virus has yet to be found.
The Laboratory Incident Hypothesis
The Wuhan Institute of Virology (WIV), located in the same city where the outbreak began, is one of the world’s leading centers for bat coronavirus research. Starting in at least 2016, WIV researchers conducted experiments involving RaTG13 and other bat coronaviruses. The institute has a published record of gain-of-function research, in which scientists engineer chimeric (hybrid) viruses to study how they might infect human cells. The lab also studied animals including mice, bats, and pangolins.
A 2021 U.S. State Department fact sheet noted that the WIV had engaged in classified research on behalf of the Chinese military since at least 2017. Proponents of the lab hypothesis point to this body of work, the geographic coincidence of the outbreak, and the lack of a confirmed animal source as reasons to take the possibility seriously. Under this scenario, a researcher could have been accidentally infected during fieldwork collecting bat samples, while handling animals in the lab, or while working with live virus in experiments.
No direct evidence of a lab accident has been made public. Chinese authorities have not granted independent investigators full access to WIV records, safety logs, or virus databases, which were taken offline in September 2019. That lack of transparency has kept the hypothesis alive.
What Investigations Have Concluded
The U.S. Intelligence Community released a declassified assessment stating that SARS-CoV-2 probably emerged through an initial small-scale exposure no later than November 2019. All agencies agreed on several points: the virus was not developed as a biological weapon, Chinese officials did not have foreknowledge of the outbreak, and both the natural spillover and laboratory incident hypotheses remain plausible.
Beyond that consensus, agencies split. Four intelligence agencies and the National Intelligence Council assessed with low confidence that natural animal exposure was the most likely origin. One agency assessed with moderate confidence that a laboratory incident was more likely. Three agencies could not settle on either explanation, with individual analysts spread across both hypotheses.
The Lancet Commission, an independent panel of experts convened by one of the world’s most prominent medical journals, reached a similar impasse. Its final report stated that the proximal origin of SARS-CoV-2 remains unknown and that both hypotheses require further investigation. Commissioners themselves held diverse views. The report specifically flagged poor enforcement of biosafety regulations globally as a systemic failure, noting it raised the plausibility of a lab-related outbreak.
Why the Answer Still Matters
The origin question is not purely academic. If the virus jumped from animals, it means the wildlife trade and wet markets remain a source of pandemic risk, and surveillance of animal populations is the most important preventive measure. If it escaped from a lab, the implications point toward stricter international biosafety standards and oversight of gain-of-function research. Both paths lead to policy changes, but very different ones.
The genetic evidence alone has not settled the debate. Most intelligence agencies assessed with low confidence that SARS-CoV-2 was probably not genetically engineered, but two agencies said there was not enough evidence to make a determination either way. A natural origin does not require genetic engineering; the virus could have adapted to humans through repeated exposure in animals or even through passage in lab cultures without deliberate modification.
Without access to early patient records from Wuhan, complete virus databases from the WIV, and comprehensive sampling of wildlife in the region, the question of what set off the COVID-19 pandemic may remain unresolved for years. What is certain is the biological cause: a highly transmissible coronavirus with an unusual ability to bind human cells, one that slipped into the population undetected and spread globally before the world understood what it was facing.