24 months

Development of ML/MM architectures for enzymatic reactivity and applications to rubisco

Development of ML/MM architectures for enzymatic reactivity
Nicolas Chéron - Contact
24 rue Lhomond, 75005 Paris

A two-year-funded post-doctoral position is available to work in the CPCV laboratory from Ecole Normale Superieure, Sorbonne University and CNRS, in Paris (France).

Context: The study of small chemical systems has been revolutionized in recent years by machine learned interatomic potentials (MLIP), and questions that were impossible to tackle five years ago can now be addressed. The state-of-the-art approach for MLIP applied to chemical reactivity implies performing quantum calculations on the full simulation box, which has two issues: (i) the MLIP are system-specific and cannot be easily extrapolated to a related system, (ii) it is computationally prohibitive for enzymes. Thus, enzymatic reactions are still mainly studied with QM/MM calculations or QM/MM-MD simulations. These approaches cannot be used on a large scale due to their cost, but also, they can’t capture global motions.

Objectives: The goal of the current project is to develop MLIP that will replace QM/MM, leading to ML/MM simulations that will be usable for enzymatic reactions. A key step in using MLIP for reactivity is having a training set that includes structures of reactant, product and transition state: this construction can be achieved through an active learning procedure such as the one included in the ArcaNN1 software that was developed in our laboratory. This project will merge an architecture that focused on ML/MM for spectroscopy2 to ArcaNN, aiming at smoothing the creation of a diverse training set with enhanced and biased sampling simulations.

As a proof-of-concept, we will first study the carboxylation step catalyzed by the enzyme rubisco,3 which happens to be the most abundant protein on Earth. Our goal will be to compute the free energy barrier for the wild-type and for mutants obtained through directed evolution,4 aiming at providing chemical insights in these recent results. We will then expand the study towards computing the full free energy profiles of both the carboxylation and the oxidation reactions. Finally, we will compare orthologs and explain why different species can display very different kinetic traits for rubisco.

Keywords: machine learning, enzymatic reactions, QM/MM, reactivity, MD simulations, CO2.

Profile: The candidate should have a PhD in computational (bio)chemistry and expertise in MD simulations, quantum chemistry or machine learning. Knowledge of biosystems, analysis skills (Python), scripting skills (bash and/or Python) and machine learning are assets.

Environment: The project will take place in the CPCV laboratory (ENS-PSL, SU, CNRS), hosted by the Ecole Normale Supérieure in the Latin Quarter of Paris. The theoretical chemistry group of CPCV is multidisciplinary in terms of methods and applications, and is widely recognized for its activities. The group welcomes 15 to 20 interns, PhD students, and post-doctoral fellows. During the project, we will collaborate with Guillaume Stirnemann and Damien Laage who will bring their expertise in ML, and with Julien Henri who will bring his expertise on photosynthesis.

Contact: Interested candidates can contact Nicolas Chéron (nicolas.cheron@ens.psl.eu) to get more information. Applicants must send a CV, a motivation letter, and the name of references. Please include “[Application]” in the object from the email. Emails written with ChatGPT or equivalent will not be answered.

References: [1] David et al., Digit. Discov. 2025, 54-72. [2] Mazzeo et al., Digit. Discov. 2024, 2560-2571. [3] Prywes et al., Ann. Rev. Biochem. 2023, 385-410. [4] Wilson et al., J. Biol. Chem. 2018, 18-27.

Autres Recrutements

Return to recruitment list
1 an

Gestionnaire administratif et financier - Chimie

Dans un environnement mutualisé des fonctions supports au sein du département de chimie, réaliser des activités de gestion administrative et comptable pour la communauté des UMR8004 IMAP, UMR8640 PASTEUR et UMR 7203 LBM en tenant compte de l'organisation collective.