Research People Partners Internship About Us

Language Models for Molecule Generation

We build AI-enhanced algorithms for molecular design with the goal of reaching superhuman performance. Our aim is to create systems that generate drug candidate molecules with properties specified by medicinal chemists, either in a single step or through iterative optimization. We have developed language models that understand 2D molecular graphs and basic molecular properties, and we combined them with evolutionary algorithms to generate property-conditioned molecules beyond existing databases. We are currently extending these models with 3D molecular understanding and developing more realistic, practically relevant benchmarks for molecular design.

Language Models for Molecule Generation

Scaling Laws for LLM-based Molecular Optimization Algorithms

Towards Molecular Conformer Generation with Language Models

Small Molecule Optimization with Large Language Models

BARTSmiles: large-scale generative masked language models for molecular representations

Improved molecular representations for property prediction based on VAEs

Research

Lab

Donate