History and Legacy of the Human Metabolome Project
The Human Metabolome Project (HMP) is a pioneering large-scale scientific initiative that transformed metabolomics from a fragmented, underdeveloped discipline into a comprehensive, data-driven “omics” science. Conceived in the early 2000s and formally launched in 2004–2005, the HMP aimed to systematically identify, quantify, and characterize all metabolites in the human body while building the databases, tools, and technologies necessary to make metabolomics broadly accessible to researchers worldwide. As of 2026 the HMP is still continuing with funds provided by donors, sponsors and national funding agencies. If you are interested in contributing or supporting the HMP, please contact us.
- The Human Metabolome Project (HMP) was conceived and designed by Dr. David Wishart in 2003 and officially awarded funding in April 2004, with full project funding secured by January 2005. From its inception, Dr. Wishart has led, coordinated, and directed the project, ensuring its continuity, growth, and long-term scientific vision.
- Originally proposed under the title “Building the Metabolomics Toolbox,” the project was later renamed the Human Metabolome Project based on recommendations from funding agencies and peer review committees, aligning it more closely with other large-scale “omics” initiatives such as the Human Genome Project.
- The HMP was explicitly modeled after the Human Genome Project (launched in 1990), with a similar philosophy of comprehensive data collection, infrastructure development, and open data sharing. It also preceded and helped inspire the Human Proteome Project (launched in 2010), positioning metabolomics as a central pillar in systems biology.
- Like the Human Genome Project, the HMP emphasized the systematic collection of large-scale biological data, the development of robust bioinformatics infrastructure, and the creation of new analytical technologies. A key guiding principle was the FAIR model—ensuring that all data and tools were Findable, Accessible, Interoperable, and Reusable by the global scientific community.
- The initial HMP team was composed primarily of researchers from the University of Alberta, with expertise spanning analytical chemistry, medicine, computer science, and biochemistry. Core members included Dr. David Wishart (NMR, GC-MS, informatics, AI), Dr. Russell Greiner (AI, informatics), Dr. Fiona Bamforth (clinical chemistry and rare diseases), Dr. Tom Marrie (infectious diseases), Dr. Brian Sykes (NMR spectroscopy), Dr. Liang Li (LC-MS), and Dr. Derrick Clive (organic synthesis), along with Dr. Hans Vogel (NMR) from the University of Calgary.
- The HMP was—and remains—a uniquely Canadian initiative, largely based in Alberta, representing one of the most significant national contributions to global “omics” science. Its success demonstrates how focused, collaborative efforts within a relatively small geographic region can achieve worldwide scientific impact.
- Initial funding for the HMP consisted of $3.61 million from Genome Canada, matched by an additional $3.61 million from provincial and industry partners. Over time, the project has secured more than $18 million in additional funding from agencies including Genome Canada, the Canada Foundation for Innovation (CFI), Alberta Innovates, the Canadian Institutes of Health Research (CIHR), and the Natural Sciences and Engineering Research Council (NSERC) , supporting its continued expansion and innovation.
- In its early years, the HMP focused on building foundational data resources and experimental capabilities. This included the development of the Human Metabolome Database (HMDB), the creation of chemical reference standards and libraries, and the comprehensive measurement of metabolites in major human biofluids such as serum, plasma, urine, cerebrospinal fluid, stool, and saliva. These efforts also established normal concentration ranges and disease-associated thresholds for many metabolites, along with standardized nomenclature and metabolite identifiers.
- A major institutional milestone occurred in 2011 with the establishment of The Metabolomics Innovation Centre (TMIC), which has provided ongoing infrastructure, technical expertise, and support for the HMP and related metabolomics initiatives.
- One of the most important legacies of the HMP is that it laid the groundwork for making metabolomics a robust, scalable, and widely accessible “omics” science. By standardizing methods, developing databases, and providing open tools, the HMP effectively democratized metabolomics, enabling researchers around the world to adopt and apply metabolomic techniques.
- The HMP pioneered the concept of absolutely quantitative metabolomics, moving beyond relative measurements to provide precise, reproducible concentration data for metabolites across different biological samples. This innovation significantly improved the reliability and clinical utility of metabolomic data.
- The project also introduced and advanced the concept of large-scale literature mining as a means of expanding metabolome coverage. By integrating automated text-mining approaches—including recent advances using large language models— the HMP has enabled continuous growth and refinement of metabolomic databases, ensuring their relevance for decades to come.
- A central achievement of the HMP was the creation of the Human Metabolome Database (HMDB) , the first comprehensive, organism-specific metabolomics database. Beyond HMDB, the project expanded its scope to include related databases covering dietary compounds (FooDB), drugs (DrugBank), toxic substances (T3DB), environmental exposures (ExposomeExplorer), and microbial metabolites (MiMeDB), thereby integrating multiple dimensions of human chemical exposure and metabolism.
- The HMP was also the first initiative to comprehensively characterize the metabolomes of major human biofluids using multiple analytical platforms. It generated the first large-scale metabolite reference libraries, developed pathway databases such as the Small Molecule Pathway Database (SMPD) and PathBank, and contributed to early mass spectral libraries, including collaborative efforts such as the T-Rex library with Bruker. Additionally, it pioneered chemical isotope labeling (CIL) strategies to enhance sensitivity and throughput in mass spectrometry-based metabolomics.
- The project produced a suite of widely used computational tools for metabolomic data analysis, including MetaboAnalyst, MetPA, MSEA, and MetATT. It also developed advanced artificial intelligence tools for spectral prediction and metabolite identification, such as CFM-ID and FraGNNet for MS/MS spectra, CFM-EI for GC-MS spectra, and highly accurate predictors for retention indices (RIPred), retention times (RTPred), and NMR spectra (PROSPRE, CASPRE). Automated NMR analysis tools such as Bayesil and MagMet further streamlined metabolomic workflows.
- The HMP continues to evolve, with recent efforts focused on comprehensive characterization of standardized reference materials such as NIST SRM-1950 plasma, as well as the development of high-throughput metabolomics kits (e.g., PRIME, MEGA, and GIGA) capable of quantitatively measuring up to 1,700 metabolites in a single assay.
- The tools, databases, and methodologies developed through the HMP have inspired numerous related projects aimed at characterizing metabolomes in other organisms, including E. coli (ECMDB), yeast (YMDB), cattle, and many additional species, thereby extending the project’s impact beyond human biology.
- The Human Metabolome Database (HMDB) itself has undergone rapid and sustained growth. HMDB 1.0 (2007) began with 2,180 metabolites; HMDB 2.0 (2009) expanded to 6,408 compounds; HMDB 3.0 (2013) grew to over 40,000 compounds; HMDB 4.0 (2018) reached 114,100 compounds; and HMDB 5.0 (2022) expanded further to over 217,000 annotated metabolites along with a large number of predicted compounds.
- As of early 2026, the HMDB contains more than 253,000 metabolites, representing over a 100-fold increase since its initial release. This growth highlights both the expanding knowledge of human metabolism and the effectiveness of the HMP’s data integration strategies.
- The scale of the HMDB now far exceeds that of other biological “omes,” containing approximately 12 times more metabolites than known human genes and 13 times more than known human proteins. This underscores the chemical complexity of human biology and the importance of metabolomics in understanding phenotype.
- Perhaps most significantly, the HMP demonstrated that comprehensive, global “omics” characterization can be achieved with relatively modest budgets and a small number of highly coordinated research groups. Through strong leadership, interdisciplinary collaboration, and a commitment to open science, the HMP established a model for future large-scale biological projects.