David Bioinformatics Jun 2026

DAVID uses a modified Fisher’s Exact Test (EASE Score) to measure gene-enrichment.

Highly frequent data updates; robust API support for programmatic R and Python automation.

The engine that powers this discovery is . Grounded in the Fisher’s Exact Test (a statistical cousin of the hypergeometric distribution), DAVID asks a simple but powerful question: Given a background set (e.g., all genes on a microarray), is a particular biological term found in your gene list more often than would be expected by chance? The output—an EASE score (a modified, more conservative Fisher p-value)—is a statistical whisper that points toward biological causality. A low p-value for the term “glycolysis” in a list of genes upregulated under low oxygen does not prove a mechanism, but it provides a high-confidence hypothesis, a starting gun for further experimental validation. david bioinformatics

Traditional enrichment analysis lists significant terms independently. This approach creates massive redundancy because multiple terms often describe identical biological processes.

InterPro, Pfam, and STRING networks.

Sherman, B.T., et al. Nucleic Acids Research (2022). URL: https://david.ncifcrf.gov/

To reduce redundancy in reporting (e.g., reporting "Cell Death" and "Apoptosis" separately), DAVID offers: DAVID uses a modified Fisher’s Exact Test (EASE

Review the Group Enrichment Scores for each cluster. Focus on clusters with high enrichment scores ( >1.3is greater than 1.3 , which corresponds to a non-log adjusted p-value of

At its core, DAVID addresses the fundamental problem of scale. The human mind struggles to derive meaning from a list of 500 gene symbols. But if those 500 genes are collapsed into a handful of biological themes—"cell cycle," "DNA repair," "apoptosis"—a story emerges. DAVID’s most celebrated contribution is . This is not a simple keyword search; it is an agglomerative algorithm that uses the fuzzy logic of biological knowledge. It recognizes that the terms "apoptosis" (from GO Biological Process), "caspase activity" (from GO Molecular Function), and "death domain" (from InterPro domains) all describe the same underlying phenomenon. By grouping redundant and related annotations, DAVID identifies the true biological “themes” that are overrepresented in a user’s gene list, suppressing the noise of semantic variation. Grounded in the Fisher’s Exact Test (a statistical

Over its history, the platform has occasionally experienced long intervals between data synchronizations. Always check the current data release notes on the website. Ensure that the reference annotations align with the newest versions of Gene Ontology or KEGG.

[Paste Gene List] -> [Select Identifier Type] -> [Choose List Type] -> [Submit Query] │ [Analyze Chart/Clustering] <- [Select Background Population] <──────────────┘ Step 1: Uploading Data Open the tool page.