What Is Exome Sequencing and How Does It Work?

Exome sequencing is a genetic testing tool that allows scientists to examine specific parts of an individual’s DNA. This technique focuses on the regions of the genome that provide instructions for making proteins, which are the building blocks and workers of our cells. By concentrating on these areas, exome sequencing helps identify genetic variations that may contribute to various health conditions. It has become a significant method in understanding the genetic basis of both rare and complex diseases, offering insights into inherited predispositions and acquired changes.

The Exome: A Key Part of Your DNA

The human genome contains an individual’s genetic information, composed of approximately 3 billion DNA building blocks. Within this vast genetic blueprint, only a small fraction directly codes for proteins. This crucial subset of the genome is known as the exome.

The exome consists of the protein-coding segments of genes, called exons. While these exons make up only about 1% to 2% of the entire human genome, they are highly significant for human health. The majority of known disease-causing genetic variations, estimated to be around 85%, are found within these protein-coding regions. This concentration of clinically relevant information makes the exome a focal point for genetic investigations into disease.

How Exome Sequencing Works

Exome sequencing involves several steps to isolate and read genetic information contained within the exome. The process begins with DNA extraction, where a sample of an individual’s DNA, typically from blood or saliva, is collected. This extracted DNA is then fragmented into smaller, manageable pieces.

Following fragmentation, specific DNA adapters are attached to these pieces. A step called target enrichment then occurs, where probes are used to specifically capture and isolate exonic regions from the fragmented DNA. These captured exonic fragments are then sequenced using high-throughput technology, which reads the order of the DNA building blocks. The final stage involves data analysis, where the sequenced reads are aligned to a reference human genome, and specialized computational tools identify any genetic variations present.

Why Exome Sequencing is Used

Exome sequencing is widely applied in medicine and research due to its ability to efficiently identify genetic changes in protein-coding regions. One primary application is in the diagnosis of rare genetic diseases, particularly those with an unknown genetic cause. For individuals with suspected Mendelian disorders, where a single gene mutation causes the condition, exome sequencing has a reported diagnostic yield ranging from 25% to 50%. This includes conditions like Freeman Sheldon syndrome and certain primary immunodeficiencies, where the technique has successfully identified causative mutations.

Exome sequencing also plays a significant role in cancer genomics, helping to identify mutations that drive cancer development and progression. It can reveal both inherited predispositions to cancer and somatic mutations that arise within tumor cells. Exome sequencing also aids scientists in discovering new genes associated with complex diseases and shedding light on disease mechanisms. This allows for a deeper understanding of various health conditions and potential therapeutic targets.

Exome Sequencing vs. Whole Genome Sequencing

Exome sequencing is often compared with whole genome sequencing (WGS), another genetic testing method. The primary difference lies in the scope of DNA analyzed. Exome sequencing focuses exclusively on the protein-coding regions. In contrast, whole genome sequencing determines the sequence of an individual’s entire DNA, including both coding and non-coding regions.

In terms of cost, exome sequencing has been more affordable because it analyzes a smaller portion of the DNA. While whole genome sequencing costs have decreased over time, exome sequencing remains a cost-effective option for identifying protein-altering variants. Exome sequencing also generates a smaller, more manageable amount of data, which can simplify the analysis process compared to the data produced by whole genome sequencing. However, WGS offers a more comprehensive view, capable of detecting a broader range of genetic variations, including those in non-coding regions and certain structural changes that exome sequencing might miss.

What Exome Sequencing Can Tell Us and Its Limitations

Exome sequencing can effectively identify variations within protein-coding regions that are known or likely to be associated with disease. These findings can include pathogenic variants, which are clearly linked to a condition, or variants of uncertain significance (VUS), whose impact on health is not fully understood. The interpretation of these results often requires specialized knowledge due to the complexity of genetic information.

Despite its utility, exome sequencing has specific limitations. It primarily focuses on coding regions and may not detect mutations located in the non-coding parts of the genome, such as regulatory elements that control gene activity or introns, the intervening sequences within genes. Exome sequencing can also struggle to identify certain types of genetic changes, including large structural variants like deletions, duplications, inversions, or translocations. The technology does not always achieve complete coverage of every exonic region, meaning some disease-causing variants could be missed due to gaps in sequencing or challenges with complex genomic areas.