Proteomics and the Biomarker Revolution That's Actually Happening

High-throughput plasma proteomics is now fast enough and cheap enough to use at biobank scale. What that means for biomarker discovery — and which companies are positioned to turn it into clinical products.

Mass spectrometry proteomics equipment

The proteomics story has been the same for twenty years: genomics is what you inherit, proteomics is what is actually happening in the body right now. Proteins are the effectors of biology. They are what drugs bind. They are what disease processes alter. The proteome is, in a very direct sense, a real-time functional readout of what is going on in a tissue or in systemic circulation.

The problem has always been measurement. Proteins are chemically heterogeneous in ways that DNA and RNA are not. There is no PCR equivalent for proteins — no tool that amplifies a specific protein molecule to detectable abundance before measurement. The analytical tools required to measure thousands of proteins simultaneously in a single sample have been slow, expensive, and technically demanding in ways that made proteomics a research tool rather than a clinical or biomarker discovery tool at meaningful scale.

That has changed. The change is bigger than the life sciences investment community has fully priced in.

What Happened to the Technology

Two platform approaches have reached the throughput and cost thresholds required for large-scale use. Aptamer-based affinity proteomics platforms can now measure over 7,000 proteins simultaneously from a microliter of plasma, with run times and costs compatible with processing thousands of samples in the context of a biobank study or a clinical trial. Mass spectrometry-based approaches have improved through data-independent acquisition methods that make untargeted proteomics more reproducible and scalable than it was when single-shot DDA was the only option.

Neither technology is perfect. Affinity-based platforms measure protein abundance but may have variable specificity for specific protein forms, and coverage of low-abundance proteins remains a challenge. Mass spectrometry offers superior molecular precision and can detect post-translational modifications and specific proteoforms, but at lower throughput. The right approach depends on the question being asked.

What matters from an investment perspective is that both technologies have now been applied to biobank samples at scale — the UK Biobank proteomics project has profiled over 50,000 plasma samples, producing one of the largest proteomic datasets in human biology. The publications coming out of that effort are generating biomarker hypotheses at a rate that would have been impossible to contemplate even five years ago.

We have tracked more than 30 disease-associated proteomic signatures emerge from large-scale biobank studies in the last eighteen months. The conversion rate from discovery to validated clinical biomarker will be substantially lower than 30 — it always is — but the starting pipeline is deeper than anything the field has had before.

The Biomarker Discovery Opportunity

The immediate application of large-scale proteomics data is biomarker discovery. Most existing clinical biomarkers were identified through hypothesis-driven research — someone had a biological rationale for why a specific protein might change in disease and then measured it. The data-driven proteomic approach finds associations without pre-specified hypotheses, which creates both opportunity and risk.

The opportunity is finding biomarkers in protein space that no one was looking for — disease-associated signatures that do not follow from existing understanding of disease mechanism and that conventional targeted assay approaches would never have identified. Some of the most interesting signals coming out of large-scale proteomics are in neurological diseases where peripheral blood biomarkers for CNS pathology have been notoriously difficult to establish. Plasma proteins are now being identified as sensitive indicators of neurodegeneration several years before clinical symptom onset.

The risk is the same as any data-driven discovery approach: false associations at scale. With 7,000 proteins measured across 50,000 samples, the statistical multiple testing problem is severe. Companies that do not apply rigorous replication standards — pre-specified validation in held-out cohorts, replication in independent datasets, orthogonal validation with independent assay technologies — will generate compelling discovery data that does not translate to clinical utility.

The Path to Clinical Products

Biomarker discovery and clinical diagnostic product development are different things. The path from a statistically significant proteomic association in a biobank study to an FDA-cleared diagnostic test involves analytical validation of the assay, clinical validation of the biomarker in the intended use population, and health economic evidence that the test result changes clinical management in ways payers will cover.

The companies we find most interesting in the space are those treating the biobank discovery work as a starting point, not an endpoint. They are pairing large-scale discovery with targeted assay development — building mass spec or immunoassay-based tests for the most promising candidates that can be validated and eventually commercialized without requiring the full proteomics platform to be deployed at the point of care.

Disease areas with high unmet need in biomarker-guided treatment are the obvious priority. Alzheimer's disease, where blood-based biomarkers for amyloid and tau pathology are moving rapidly toward clinical adoption, has been the leading example of proteomics-driven biomarker development succeeding at scale. Oncology, autoimmune disease, and cardiovascular risk stratification are all areas where the discovery pipeline is now producing candidates with serious clinical validation programs behind them.

Where the Investment Opportunity Is

The large-scale proteomics platform companies are primarily public and valued for their data generation and platform positions. The private investment opportunity is in the downstream applications: companies building targeted diagnostic products based on proteomic biomarker discoveries, companies building clinical decision support tools that integrate multi-protein panels into risk scores, and companies using proteomic biomarkers to stratify patient populations in clinical trials for therapeutic programs.

The last category is particularly interesting. Patient stratification using proteomic biomarkers can reduce the sample size and cost of clinical trials by enriching for the subpopulation most likely to respond. For programs where biological heterogeneity is driving high trial failure rates, a validated protein-based patient selection strategy can change the clinical development economics materially. This is an area where VC-backed biomarker companies can create value not primarily by developing diagnostic products but by enabling partner therapeutic programs — a business model that fits well with the risk profile of early-stage biomarker development.