Peptidomics - Creative Proteomics
Neoantigen Discovery and Target Identification Services

Overcoming In Silico Limitations in Tumor Antigen Discovery

The field of immuno-oncology has rapidly evolved beyond tracking simple tumor mutational burden (TMB). While next-generation sequencing (NGS) allows for the rapid identification of somatic mutations, relying solely on genomic data and in silico binding algorithms presents significant limitations.

Algorithms cannot accurately model the highly complex intracellular antigen processing machinery—including proteasomal cleavage, TAP transport, and ERAP trimming. This biological disconnect frequently leads to high false-positive rates, funneling resources into predicted neoantigens that are never naturally presented on the tumor cell surface.

To overcome this, modern neoantigen discovery demands an evidence-based approach. Whether it involves mapping the wild-type landscape via baseline HLA Peptidomics Analysis or actively hunting for tumor-specific mutations via high-resolution mass spectrometry, identifying true neoepitopes requires integrating physical peptide detection with deep genomic and transcriptomic context.

Intracellular antigen processing and presentation pathway

Immuno-Oncology Applications for Neoantigen Identification

Our neoantigen target identification hub is designed to support diverse translational research and therapeutic development pipelines:

Personalized Cancer Vaccines
Personalized Cancer Vaccines
Discover highly specific, patient-restricted neoepitopes to formulate mRNA or peptide vaccines that elicited robust T cell responses while bypassing central tolerance.
TCR-T / Adoptive Cell Therapy
TCR-T / Adoptive Cell Therapy
Identify and physically confirm the presentation of tumor-restricted targets to safely engineer T-cell receptors, mitigating the risk of on-target, off-tumor toxicities.
Biomarker Discovery & Patient Stratification
Biomarker Discovery & Patient Stratification
Profile the naturally presented neoantigen landscape to uncover mechanisms of immune evasion and identify predictive biomarkers for immune checkpoint inhibitor (ICI) response.

Comprehensive Neoantigen Discovery Solutions

As a fully integrated CRO, Creative Proteomics offers a complete suite of neoantigen discovery and prioritization services. We help you select the optimal route based on your project goals, integrating our specialized capabilities into a seamless workflow.

Mass Spectrometry-Based Neoantigen Discovery
For projects requiring direct physical evidence of antigen presentation without the need for prior patient sequencing, our Mass Spectrometry-Based Neoantigen Discovery Service provides an untargeted, deeply sensitive physical readout of the tumor ligandome using advanced LC-MS/MS and specialized de novo sequencing.
Proteogenomics-Driven Discovery
The ultimate multi-omics approach. Our Neoantigen Discovery by Proteogenomics Services strictly integrates patient-specific WES and RNA-seq data to build customized variant databases. We then use ultra-sensitive MS to search against this bespoke database, identifying true somatic variations with a strictly controlled FDR.
Tumor Neoantigen Validation
For clients who have already generated computational predictions and need to prioritize candidates. Our Tumor Neoantigen Discovery and Validation Services utilize targeted MS strategies (e.g., PRM/SureQuant) and heavy-isotope standards to physically confirm the presentation of your predicted shortlist.
In Silico Prediction and Prioritization
A computational starting point for early-stage screening. We ingest raw WES/RNA-seq data to call variants, calculate transcript expression thresholds, and utilize AI-driven algorithms alongside tools referencing the Immune Epitope Database (IEDB) to predict HLA binding affinities and generate initial candidate lists.
Cryptic & Non-Canonical Antigen Mining
Standard pipelines often miss non-coding regions. We offer specialized discovery of immunogenic cryptic peptides derived from non-coding regions, alternative splicing, and translated UTRs, utilizing 6-frame translated databases.
Baseline Immunopeptidome Profiling
Utilizing our core Immunopeptidomics Service, we provide global profiling of HLA Class I and Class II peptide repertoires across healthy and diseased tissues to establish vital baseline presentation maps.

Route Selection: Comparing Neoantigen Discovery Methodologies

Selecting the right discovery route depends heavily on your current research phase and sample availability. Our consultative approach ensures you invest in the appropriate level of evidence.

Discovery Route Primary Methodology Output Evidence Level Best Suited For
In Silico Prediction WES + RNA-seq + Algorithms Low: Theoretical binding scores and expression support. Early target screening; establishing mutation burden.
MS-Based Discovery HLA Immunoaffinity + LC-MS/MS Medium-High: Physical detection of tumor ligandomes. Direct discovery without sequencing; profiling cell lines.
Proteogenomics WES/RNA-seq + LC-MS/MS High: Custom database matching for unbiased discovery. Comprehensive novel target discovery from clinical biopsies.
Targeted Validation Pre-existing List + Targeted MS Highest: Absolute confirmation using synthetic standards. Late-stage pre-clinical validation; finalizing IND targets.

Detectable Tumor-Specific Antigen Types

Integrating RNA-seq with high-resolution mass spectrometry unlocks the ability to discover non-canonical targets that traditional DNA-centric predictions overlook.

Antigen Type Biological Origin Discovery Strategy
SNV-Derived Neoantigens Point mutations in protein-coding exons. Direct mapping of MS/MS spectra to variant databases.
Indel-Derived Neoantigens Frameshift mutations altering the reading frame. Identification of novel sequences via custom FASTA search.
Fusion-Derived Neoantigens Chromosomal translocations producing chimeric transcripts. Detection of peptides physically spanning the fusion junction.
Splice-Derived Neoantigens Dysregulated splicing or splice site mutations. RNA-seq guided mapping of intron-retention events.
Cryptic / Non-Canonical Translation of non-coding regions (UTRs, ncRNAs). Searching spectra against 6-frame translated RNA-seq data.

Core Platform Advantages

Creative Proteomics bridges the gap between genomics and proteomics, offering unique technical advantages that ensure your neoantigen discovery is accurate, reproducible, and ready for clinical translation.

Ultra-Low Sample Inputs
Optimized immunoaffinity protocols and highly sensitive MS platforms allow us to profile tumor ligandomes from as little as 50 mg of tissue, enabling analysis of rare clinical needle biopsies.
Strict False-Positive Control
We utilize custom, non-specific decoy database strategies to enforce a strict <1% False Discovery Rate (FDR) at the Peptide-Spectrum Match (PSM) level, ensuring only true HLA ligands are reported.
Dual-Class HLA Profiling
Simultaneous or separate enrichment of both HLA Class I (CD8+ targets) and HLA Class II (CD4+ targets) complexes from the same limited sample.
Isobaric Ambiguity Resolution
Advanced spectral scoring algorithms capable of accurately differentiating between Leucine and Isoleucine, preventing wild-type sequences from being falsely validated as mutated antigens.
Custom FASTA Database Generation
By directly utilizing patient-specific RNA-seq and WES data, our pipelines seamlessly identify individual somatic mutations, moving beyond reference wild-type sequences to ensure true personalized neoantigen discovery.
Unbiased Non-Canonical Antigen Detection
Leveraging 6-frame translation and splice-variant mapping, our multi-omics platform captures cryptic peptides and fusion-derived targets that traditional predictive algorithms inherently miss.

End-to-End Neoantigen Discovery Workflow

Regardless of the selected route, our operations follow a rigorous, quality-controlled wet-lab and dry-lab workflow to ensure data integrity from sample preparation to final reporting.

Project Consultation & Route Selection
Define HLA haplotype & pipeline
Sample Processing & Immunoaffinity
HLA complex immunoprecipitation
Multi-Omics Data Acquisition
High-resolution LC-MS/MS & NGS
Bioinformatics Filtering
FDR control & motif clustering
Prioritization & Target Reporting
Ranked neoepitope matrix
1
Project Consultation & Route Selection
We systematically evaluate your available biological inputs (tissues, cell pellets, FastQ files), define the patient-specific HLA haplotype background, and select the optimal bioinformatic and mass spectrometry pipeline.
2
Sample Processing & Immunoaffinity Enrichment
Tissue samples undergo gentle mechanical lysis and membrane solubilization. We utilize highly specific monoclonal antibodies (pan-HLA or allele-specific) to immunoprecipitate intact HLA-peptide complexes, followed by mild acidic elution to release the naturally presented peptides without destroying their structure.
3
Multi-Omics Data Acquisition
Eluted peptides are sequenced using cutting-edge liquid chromatography-tandem mass spectrometry (LC-MS/MS) via Data-Dependent (DDA) or Data-Independent Acquisition (DIA) modes. Concurrently, DNA and RNA are sequenced (WES/RNA-seq) if a proteogenomic route is selected.
4
Bioinformatics Filtering & Evidence Integration
MS spectra are matched against standard human proteomes and patient-specific mutated FASTA databases. We apply rigorous FDR controls (<1%), sequence motif clustering, and transcriptomic expression thresholding (TPM) to filter out wild-type noise and artifactual matches.
5
Prioritization & Target Reporting
Validated neoantigens are mapped to source genes and systematically ranked in a prioritization matrix based on physical MS abundance, RNA expression, and algorithmically predicted HLA binding affinity (%Rank), yielding a ready-to-use shortlist for functional testing.

High-Resolution Mass Spectrometry Platforms for Immunopeptidomics

Detecting neoantigens is exceptionally challenging due to their incredibly low abundance among thousands of endogenous wild-type peptides. Following the complex enrichment workflow, Creative Proteomics relies on industry-leading mass spectrometry hardware to ensure unmatched sensitivity, resolution, and scanning speed.

MS Platform Comparison

Platform Core Technology Key Advantage Best Suited For
Orbitrap Astral™ Asymmetric Track Lossless (Astral) analyzer Unparalleled scanning speed (>200 Hz) and high dynamic range. Deep proteogenomic ligandome discovery from limited samples.
timsTOF Ultra Trapped Ion Mobility Spectrometry (TIMS) & PASEF® 4D-Proteomics with extreme sensitivity and isobaric resolution. Ultra-low input biopsies and complex HLA ligand mixtures.
Orbitrap Exploris™ 480 High-Field Orbitrap Robust, reproducible high-resolution mass accuracy. Standard baseline immunopeptidomics and large cohort profiling.
TSQ Altis / SureQuant™ Triple Quadrupole / Targeted Orbitrap Absolute heavy-isotope quantification and multiplexing. Targeted pre-clinical validation of shortlisted neoantigens.

Comprehensive Sample Requirements for Neoantigen Profiling

Our flexible pipelines accommodate a wide variety of starting materials. Because HLA immunopeptidomics relies on extracting intact, non-crosslinked protein complexes, Formalin-Fixed Paraffin-Embedded (FFPE) tissues are strictly incompatible with MS-based discovery.

Sample Type Minimum Input Requirement Ideal Input for Deep Profiling Notes & Preservation
Fresh Frozen Tumor Tissue ≥ 50 mg 100 - 200 mg Must be snap-frozen immediately in liquid N2. Do NOT use OCT embedding.
Matched Normal Tissue ≥ 30 mg 50 mg Essential for proteogenomics to filter out patient-specific germline variants.
Cell Lines / Cultured Cells ≥ 1×10⁸ cells 5×10⁸ cells Wash thoroughly with cold PBS before snap-freezing the cell pellet.
Blood / PBMCs ≥ 1×10⁸ viable cells ≥ 1×10⁸ viable cells Suitable for baseline profiling or characterizing antigen-presenting cells (APCs).
Nucleic Acids (For In Silico) 1 µg DNA / 2 µg RNA 2 µg DNA / 5 µg RNA Required if starting from physical samples rather than raw sequencing data.
Raw Sequencing Data N/A N/A FastQ, BAM, or VCF formats. Acceptable for pure bioinformatic prediction.

Multi-Omics Bioinformatics & Data Visualization

Our bioinformatics pipelines generate publication-ready, Nature-grade data visualizations, providing transparent insights into the discovered neoantigen landscape.

Differential Neoantigen Presentation (Volcano Plot)

Volcano plot showing differential presentation of tumor neoantigens versus normal tissue

Differential Ligandome Analysis: Volcano plot highlighting significantly over-presented tumor-associated antigens and uniquely detected somatic neoepitopes compared to matched normal tissues.

HLA-Restricted Ligandome Motif Analysis

Sequence logo plots demonstrating HLA-restricted binding motifs of discovered neoantigens

HLA Binding Motif Verification: Sequence logo clustering confirms that physically detected peptides adhere to the anchor residue motifs of the patient's HLA allotypes, validating immunoprecipitation quality.

Extracted Ion Chromatogram (XIC) Quantification

Extracted ion chromatogram demonstrating quantitative MS validation of a neoepitope using a heavy isotope standard

Quantitative Physical Validation: Extracted Ion Chromatograms (XIC) demonstrating perfect retention time alignment between the endogenous neoantigen and a synthetic standard, ensuring high-confidence identification.

High-Resolution MS/MS Sequence Tagging

Annotated high-resolution MS2 fragmentation spectrum confirming the somatic mutation sequence of a neoantigen

Definitive Sequence Confirmation: Annotated tandem mass spectrum (MS2) providing incontrovertible physical proof of the neoantigen's exact sequence, pinpointing the somatic mutation site.

Neoantigen Data Deliverables & Downstream Validation

Creative Proteomics delivers highly structured, actionable data packages designed to empower your downstream clinical and research decision-making.

  • Target Prioritization Matrix: An integrated dataset ranking discovered neoantigens based on empirical MS abundance, transcriptomic expression (TPM), and HLA binding affinities (%Rank).
  • Comprehensive Variant Annotation: Detailed genomic contextualization of every validated peptide, identifying source genes and variant types.
  • Raw Multi-Omics Data: Full access to generated FastQ files, BAM alignments, and high-resolution mass spectrometry RAW files for internal archiving.
  • Executive Summary Reports: Methodological details, quality control metrics (FDR control, HLA yield), and advanced graphical visualizations.

Frequently Asked Questions

Which neoantigen discovery route is right for my project? +
If you only have genomic data, In Silico Prediction is the starting point. If you have tissue but no sequencing data, MS-Based Discovery is ideal. For the most rigorous, unbiased discovery with minimal false positives, the Proteogenomics route is recommended.
Do I need to provide both genomic data and physical tissue? +
Only for the Proteogenomics and Validation routes. Standard MS-Based Discovery only requires the physical tissue to profile the ligandome. Pure in silico prediction only needs DNA/RNA extracts or raw FastQ data.
How do you distinguish between prediction and physical discovery? +
Prediction uses algorithms to guess what might be presented based on DNA mutations. Physical discovery uses mass spectrometry to physically extract and measure actual peptides bound to HLA molecules, providing definitive proof of presentation.

T-cell receptor binding to an MHC-peptide complex

Can you identify structural variants like fusions and splice variants? +
Yes. By integrating deep RNA-seq data into our search databases, we can identify and physically validate neoantigens derived from gene fusions and cryptic non-canonical peptides that standard SNV-focused algorithms miss.
How is the False Discovery Rate (FDR) controlled in MS-based discovery? +
We utilize specialized decoy database generation algorithms to strictly enforce a <1% FDR at the peptide-spectrum match (PSM) level, ensuring high confidence in every reported sequence.
Do you support both HLA Class I and Class II profiling? +
Yes. Our immunopeptidomics pipelines utilize specific pan-HLA antibodies to independently enrich for both HLA Class I (CD8+ targets) and HLA Class II (CD4+ targets) complexes.
What is the next step after neoantigen discovery? +
Typically, the next step involves synthesizing top peptide candidates for functional T-cell assays (e.g., ELISpot) to confirm immunogenicity, or proceeding to vaccine formulation.

Case Study: Proteogenomic Identification of Melanoma Neoepitopes

Journal: Nature Communications

Published: Volume 7, 2016


Summary

Using a multi-omics proteogenomic approach, researchers integrated WES, transcriptomics, and high-resolution mass spectrometry to identify naturally presented neoantigens directly from melanoma tissues. The study proved that mass spectrometry significantly reduces the false-positive rates of in silico predictions, leading to the discovery of physically validated neoepitopes capable of eliciting strong T cell responses.

Methods & CP Support

This study highlights a rigorous proteogenomic pipeline to physically validate algorithmic predictions. To support research following these principles, Creative Proteomics offers specialized services including:

  • Custom variant database construction from patient-specific WES/RNA-seq.
  • High-resolution HLA immunopeptidomics profiling on Orbitrap/timsTOF platforms.
  • Multi-layer evidence integration to filter false positives for ranked candidate reporting.

Results

  • MS Validation: Only a highly specific subset of computationally predicted mutations was physically detected by MS on the tumor cell surface.
  • Novel Targets: The approach identified mutant neoepitopes (e.g., from NCAPG2 and SYTL4 genes) that were completely absent in matched normal tissues.
  • Functional Activation: T-cell assays confirmed that MS-validated neoepitopes triggered robust IFN-γ secretion from the patient's own CD8+ T cells.

Reference

  1. Bassani-Sternberg, M., et al. "Direct identification of clinically relevant neoepitopes presented on native human melanoma tissue by mass spectrometry." Nature Communications 7, 13404 (2016). DOI: 10.1038/ncomms13404

For Research Use Only. Not for use in diagnostic procedures.

Get a Quote