IgGM: A Generative Model for Functional Antibody and Nanobody Design

Rubo Wang *

University of Chinese Academy of Sciences

Fandi Wu *

Tencent AI Lab

Xingyu Gao

Chinese Academy of Sciences

Jiaxiang Wu

Tencent AI Lab

Peilin Zhao

Tencent AI Lab

Jianhua Yao

Tencent AI Lab

ICLR 2025

*Equal contribution, Corresponding author

Abstract

Immunoglobulins are crucial proteins produced by the immune system to identify and bind to foreign substances, playing an essential role in shielding organisms from infections and diseases. Designing specific antibodies opens new pathways for disease treatment. With the rise of deep learning, AI-driven drug design has become possible, leading to several methods for antibody design. However, many of these approaches require additional conditions that differ from real-world scenarios, making it challenging to incorporate them into existing antibody design processes. Here, we introduce IgGM, a generative model for the de novo design of immunoglobulins with functional specificity. IgGM simultaneously generates antibody sequences and structures for a given antigen, consisting of three core components: a pre-trained language model for extracting sequence features, a feature learning module for identifying pertinent features, and a prediction module that outputs designed antibody sequences and the predicted complete antibody-antigen complex structure. IgGM effectively predicts structures and designs novel antibodies and nanobodies. This makes it highly applicable in a wide range of practical situations related to antibody and nanobody design.

Introduction

For more details, please refer to the paper.

An antibody consists of a symmetric Y-shaped structure, which includes variable regions (VH, VL) and constant regions (CH, CL). In practical antibody design, the focus is on the variable regions, which comprise the framework regions (FRs) and the complementarity-determining regions (CDRs).

De novo antibody design refers to the process of creating a novel antibody that can bind to a given antigen, where the framework regions can be selected based on sequences with favorable physicochemical properties. Existing co-design methods require the simultaneous provision of both the structure and sequence of the framework regions; however, in practical antibody design, the structures are often unknown.

The forward process and reverse process for Co-Design of Antibody Sequences and Structures.
The forward process and reverse process for Co-Design of Antibody Sequences and Structures.

Model

IgGM Model Framework Diagram.
IgGM Model Framework Diagram.

IgGM is a generative model that performs simultaneous co-design of antibody sequence and structure. IgGM employs a multi-level network architecture. It first utilizes a pre-trained protein language model to extract evolutionary features of sequences. Then, a feature encoder studies the interactions between antigens and antibodies. Finally, a prediction module outputs the structures and sequences of the antibodies. IgGM leverages the interplay between sequence and structure to generate accurate antibody designs, even when only partial sequences of the framework region are available. This capability aligns with practical application scenarios and offers new possibilities for antibody design. IgGM excels at generating the CDR regions and their structures and can dock the generated structure to the corresponding epitope. It supports multiple design scenarios and can adapt to various conditions without the need for retraining, such as predicting antigen-antibody complexes, designing the CDR H3 region of antibodies, and designing multiple CDR regions. Furthermore, it can be extended to nanobodies, which are small single-domain antibodies that exhibit strong binding affinity to antigens and high stability. The experimental results indicate that IgGM achieves superior performance in multiple design tasks, demonstrating accuracy in structure prediction tasks that is comparable to existing structure prediction methods.

Design Example

Samples of generated antibodies and nanobodies by IgGM.
Samples of generated antibodies and nanobodies by IgGM.

Main Results

Complex structure prediction

MethodTM-Score↑lDDT↑RMSD↓DockQ↑iRMS↓LRMS↓SR↑
IgFold†→HDock0.95770.90192.17150.021816.651948.15710.0000
tFold-Ag*‡0.96340.91421.94890.25226.795721.03460.4068
AlphaFold 3‡0.97290.93051.50630.295110.964532.40800.3684
dyMEAN†*0.95720.88822.25210.10058.922727.42340.0667
IgGM†*0.95910.89562.19970.29866.219519.48880.4667
IgGM†*(AF3)0.95800.89412.14220.36303.863511.26470.6667

De novo design of antibodies for specific antigen

MetricsDiffAb (IgFold)DiffAb (AF3)MEAN (IgFold)MEAN (AF3)dyMEANIgGMIgGM (AF3)
AAR↑L10.5970.608--0.6330.7500.737
L20.5980.599--0.6340.7430.735
L30.4210.424--0.5700.6350.602
H10.6420.637--0.7420.7400.739
H20.3630.394--0.6270.6440.639
H30.2140.2260.2480.2460.2940.3600.330
RMSD↓L10.7830.749--0.8640.5890.659
L20.4710.466--0.4810.3780.395
L31.0021.017--0.9410.8470.903
H10.6500.623--0.6330.5550.590
H20.6410.586--0.7050.4860.566
H32.7412.6462.3572.3002.4542.1312.155
DockingDockQ↑0.0220.2080.0220.2070.0790.2460.326
iRMS↓17.0349.73116.8388.9689.6986.5794.030
LRMS↓48.16327.55948.10427.55728.76419.67811.229
SR↑0.0000.3680.0000.3540.0490.4330.627

Structure prediction of Nanobody

MethodTM-Score↑lDDT↑RMSD↓DockQ↑iRMS↓LRMS↓SR↑
tFold-Ag0.93440.93031.67220.28816.349015.08100.4296
AlphaFold 30.95190.92861.18850.286711.219432.67600.3885
IgGM0.93180.89311.99250.29077.987922.01680.4400

Design of Nanobody

MethodCDR1↑CDR2↑CDR3↑RMSD↓DockQ↑iRMS↓LRMS↓SR↑
DiffAb (AF3)0.5330.2910.1562.2740.21113.26535.8050.346
IgGM0.5650.3300.1831.9800.2676.92714.9660.415

Citation

    @inproceedings{
wang2025iggm,
title={Ig{GM}: A Generative Model for Functional Antibody and Nanobody Design},
author={Rubo Wang and Fandi Wu and Xingyu Gao and Jiaxiang Wu and Peilin Zhao and Jianhua Yao},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
url={https://openreview.net/forum?id=zmmfsJpYcq}
}