In the realm of energy and biotechnology, a team of researchers from the University of Chinese Academy of Sciences and the University of Macau has made significant strides in enzyme design, a field with profound implications for various industries, including energy. The team, comprising Zefeng Lin, Zhihang Zhang, Weirong Zhu, Tongchang Han, Xianyong Fang, Tianfan Fu, and Xiaohua Xu, has developed a novel framework called EnzyPGM that could revolutionize the way we design enzymes for specific substrates.
Enzyme design is a critical challenge in protein engineering, as the catalytic activity of enzymes depends on the precise interaction between their binding pockets and substrates. Current generative models, while dominant in functional protein design, fall short in modeling these pocket-substrate interactions, limiting the creation of enzymes with precise catalytic environments. To address this, the researchers proposed EnzyPGM, a unified framework that jointly generates enzymes and substrate-binding pockets conditioned on functional priors and substrates. This focus on accurate pocket-substrate interactions sets EnzyPGM apart from existing models.
At the heart of EnzyPGM are two main modules: the Residue-atom Bi-scale Attention (RBA) and the Residue Function Fusion (RFF). The RBA module jointly models intra-residue dependencies and fine-grained interactions between pocket residues and substrate atoms. Meanwhile, the RFF module incorporates enzyme function priors into residue representations, enhancing the model’s ability to design enzymes with specific functions. To train and validate their model, the researchers curated EnzyPock, an extensive enzyme-pocket dataset comprising 83,062 enzyme-substrate pairs across 1,036 four-level enzyme families.
The results of extensive experiments on EnzyPock demonstrate that EnzyPGM achieves state-of-the-art performance. Notably, EnzyPGM reduces the average binding energy by 0.47 kcal/mol compared to EnzyGen, a previous model, showcasing its superior performance in substrate-specific enzyme design. This advancement could lead to more efficient and precise enzyme design, with potential applications in various industries, including energy, where enzymes play a crucial role in processes like biofuel production and biomass conversion.
The researchers plan to release the code and dataset, which will further facilitate research in this field. This work was published in the journal Nature Communications, a reputable source for cutting-edge scientific research. As the energy industry continues to seek sustainable and efficient solutions, advancements in enzyme design like EnzyPGM bring us one step closer to achieving these goals.
This article is based on research available at arXiv.

