Biocompatibility testing is the process of evaluating whether a medical device or material is safe to use in or on the human body without causing harmful biological reactions. Every medical device that touches your skin, blood, or internal tissues, from a simple bandage to a hip implant, must go through some form of this evaluation before it can reach the market. The specific tests required depend on two factors: where the device contacts the body and how long it stays there.
How Devices Are Categorized
The FDA and international standards group medical devices into categories based on the nature of body contact and its duration. These two variables determine exactly which biological tests a manufacturer needs to run.
For body contact, devices fall into three broad groups. Surface devices touch intact skin, mucous membranes, or compromised skin. External communicating devices have an indirect blood path or contact tissue, blood, or circulating blood from outside the body (think catheters or dialysis tubing). Implant devices are placed directly into tissue, bone, or blood and stay there.
Duration is split into three windows: limited (24 hours or less), prolonged (more than 24 hours up to 30 days), and long-term or permanent (more than 30 days). A device that sits on intact skin for a few hours faces far fewer testing requirements than a permanent bone implant. The combination of contact type and duration creates a matrix that tells manufacturers which biological endpoints they need to evaluate.
Core Tests in Biocompatibility Evaluation
Cytotoxicity
Cytotoxicity testing is the most basic and most common biocompatibility test. It checks whether a material kills or damages living cells. Rather than testing the finished device directly, labs typically soak the material in an extraction medium for 24 hours at body temperature (37°C), then expose mammalian cells to the extract. The cells are then assessed for changes in growth, replication, and shape. If the extract kills a significant portion of the cells, the material fails. This is an in vitro test, meaning it happens entirely in a lab dish, with no animals involved.
Sensitization
Sensitization testing determines whether a material could trigger an allergic response after repeated exposure. The two main methods are the Guinea Pig Maximization Test and the Local Lymph Node Assay. The Guinea Pig Maximization Test involves exposing animals to the material and watching for an immune reaction over time. The Local Lymph Node Assay measures immune cell activity in lymph nodes after skin exposure and is evaluated by the FDA on a case-by-case basis for medical device extracts.
There are important limitations. The Local Lymph Node Assay cannot be used for nickel-containing metals, nanomaterials, or novel materials that have never been used in a legally marketed device. For those, the Guinea Pig Maximization Test is recommended because it includes injection under the skin, ensuring the substance actually reaches the immune system regardless of whether it can penetrate skin on its own. A third method, the Buehler test, is accepted only for devices that contact intact skin.
Irritation
Irritation testing checks for localized inflammation, redness, swelling, or tissue damage at the contact site. This is distinct from sensitization: irritation is an immediate, non-immune response, while sensitization is an allergic reaction that develops over time. If irritation testing is not built into the sensitization study, and something goes wrong, it can be difficult to tell which type of reaction occurred, potentially requiring additional studies.
A newer option under ISO 10993-23 allows skin irritation to be evaluated without animals. Reconstructed human epidermis models, lab-grown layers of human skin cells, are exposed to device extracts and then assessed for tissue viability. This in vitro approach is now accepted for medical device biocompatibility evaluation.
Systemic Toxicity
Systemic toxicity testing looks at whether chemicals released from a device could harm organs or systems beyond the contact site. In acute toxicity tests, animals receive a single relatively high dose of the test substance and are observed for up to 14 days. Researchers watch for signs of poisoning, delayed effects, and organ damage. At the end of the observation period, tissues are examined under a microscope to identify damage to specific organs. Devices with longer contact durations may require subacute or subchronic studies that extend the exposure and observation periods.
Chemical Characterization as an Alternative
Not every biocompatibility question requires a biological test. Chemical characterization identifies and quantifies the specific chemicals that could leach out of a device into the body. When combined with a toxicological risk assessment, comparing the amounts released against known safety thresholds, this approach can replace biological testing for certain endpoints entirely.
This is a significant shift in how biocompatibility is evaluated. Chemical characterization can address multiple safety endpoints at once, often faster than running separate biological studies for each one. It also reduces the need for animal testing. The FDA recognizes this approach under ISO 10993-18 and encourages it as part of a risk-based evaluation strategy. It is also useful when a manufacturer changes materials or manufacturing processes for an existing device, since chemical equivalence to the original can be demonstrated without repeating a full suite of biological tests.
Why Material Choice Matters
The material a device is made from shapes both its biocompatibility profile and the concerns that testing needs to address. Titanium, one of the most widely used implant materials, is generally well tolerated but can trigger allergic or immune reactions in susceptible individuals. It also has a stiffness (elastic modulus around 110 GPa) far higher than surrounding bone (roughly 13 GPa for cortical bone). This mismatch means the implant absorbs mechanical load that would normally stimulate the bone, a phenomenon called stress shielding. Over time, the bone receives less stimulation, loses density, and can resorb, potentially loosening the implant.
PEEK (polyetheretherketone) has emerged as an alternative with a stiffness of 3 to 4 GPa, much closer to natural bone. This transfers more load to surrounding tissue, which can preserve bone health. The trade-off is that pure PEEK bonds less readily to bone than titanium does. Surface treatments like hydroxyapatite coatings can improve this, but the material requires its own set of biocompatibility studies to validate those modifications. Each material brings a different biological risk profile, and biocompatibility testing is how those risks are identified and measured.
The Regulatory Framework
Biocompatibility testing is governed primarily by the ISO 10993 series of international standards, with the FDA providing additional guidance for devices sold in the United States. ISO 10993-1 is the overarching standard that frames biocompatibility evaluation within a risk management process. It does not prescribe a fixed checklist of tests for every device. Instead, it asks manufacturers to consider the device’s materials, contact type, duration, and intended use, then justify which evaluations are necessary.
The FDA’s guidance builds on this by incorporating risk-based approaches to determine whether biological testing is even needed. For some low-risk scenarios, such as devices made from well-characterized materials that only contact intact skin, a manufacturer may be able to justify skipping certain tests altogether based on existing data. The guidance also includes specific recommendations for newer technologies: devices with nanoscale components, materials that polymerize inside the body, and absorbable materials that break down over time all have additional considerations beyond the standard matrix.
For manufacturers, biocompatibility evaluation is not a single test but a strategy. It combines material history, chemical analysis, existing literature, and targeted biological testing to build a safety case. The goal is not to prove a device is perfectly inert, since almost nothing placed in the body is truly inert, but to demonstrate that any biological response it causes falls within acceptable limits for its intended use.