AI, Engineering Biology and Beyond 2026

15-16 January 2026, Bristol, UK
Version

Goto:

Talks

Keynotes and Spotlights

Toward AI-driven genetic circuit design

Caleb Bashor (Rice University, USA)

Engineering synthetic regulatory circuits with precise input-output behavior—a central goal in synthetic biology—remains encumbered by the inherent molecular complexity of cells. Non-linear, high-dimensional interactions between genetic parts and host cell machinery make it difficult to design circuits using first principles biophysical models. Adopting data-driven approaches that integrate modern machine learning (ML) tools and high-throughput experimental approaches into the synthetic biology design/build/test/learn process could dramatically accelerate the pace and scope of circuit design, yielding workflows that rapidly and systematically discern design principles and achieve quantitatively precise behavior. I will argue that the application of ML to circuit design can occur at three distinct scales, and describe work in my group making progress in each of them: 1) learning relationships between part sequence and function; 2) determining how part composition determines circuit behavior; 3) understanding how function varies with genomic/host-cell context. The work points toward a future where ML-driven genetic design is used to program robust solutions to complex problems across diverse biotechnology domains.

Generative and Multimodal AI for Protein Design and Interaction Screening

Haiping Lu (University of Sheffield, UK)

Generative AI is opening new possibilities for protein design, while interaction screening remains essential for prioritising candidates in engineering biology workflows. In this talk, I will present generative and multimodal AI methods for protein sequence design and protein–ligand interaction screening. I will highlight a diffusion-based approach for structure-guided protein sequence design and an interpretable model for protein–small-molecule interaction screening. Together, these examples illustrate how multimodal representations, rigorous evaluation, benchmarking, and deployment-aware design are critical for translating AI advances into reliable design–screen pipelines for engineering biology.

Scaling Genetic Tractability Across the Tree of Life

Henry Lee (Cultivarium, USA)

Engineering biology is limited by the small number of organisms we know how to manipulate. This talk presents Cultivarium’s approach to systematically identifying conditions for growth, DNA delivery, and functional molecular tools across diverse microbes. We build datasets toward foundation models for engineering biology, enabling inference of genetic tractability for organisms at scale. This capability expands where and how biology can be engineered on Earth and beyond.

AI for Synthetic Biology

Huimin Zhao (University of Illinois at Urbana-Champaign, USA)

Synthetic biology aims to design novel or improved biological systems using engineering principles, which has broad applications in medical, chemical, food, and agricultural industries. Thanks to rapid advances in artificial intelligence/machine learning (AI/ML) and laboratory automation, a new AI-driven autonomous experimentation paradigm for synthetic biology is rapidly emerging. In this talk, I will highlight our recent work on the development of AI/ML tools and a self-driving biofoundry to accelerate the design-build-test-learn cycle in synthetic biology. Examples include but are not limited to: (a) ECNet: a deep learning model for protein engineering; (b) CLEAN: an AI tool for enzyme function prediction; (c) EZSpecificity: an AI tool for enzyme substrate specificity prediction; (d) design of novel mitochondrial targeting sequences using generative AI, and (e) BioAutomata: an AI-powered self-driving biofoundry for protein engineering, pathway engineering, and metabolic engineering. These advances will lead to the development of a new autonomous experimentation paradigm for basic and applied biology and medicine.

Molecular Machine Learning for Sustainable Molecular Designs

Jana M. Weber (TU Delft, The Netherlands)

Designing sustainable molecules requires navigating vast design spaces with sparse information. In this talk, I will present key building blocks of molecular machine learning and show how they enable tailored exploration of chemical space. I will illustrate these concepts through two case studies. The first focuses on the design of photocatalytic polymers, highlighting challenges in polymer representation, predictive modelling with limited data, and inverse design strategies for discovering promising candidates. The second examines what is needed for reaction system evaluation and optimisation of these in vast chemical space.

Cell-machine interfaces for control and learning

Jean-Baptiste Lugagne (University of Oxford, UK)

Engineering biology increasingly depends on the integration of data, models, and automation. Advances at the intersection of synthetic biology, control theory, and machine learning have led to the development of cell-machine interfaces: experimental platforms that directly couple living cells to computers through microfluidics, automated microscopy, and optogenetics. These cybergenetic systems establish a foundation for real-time learning and closed-loop control of cellular behaviour. In the first part of this talk, I will present how such interfaces have been used to dissect and manipulate antibiotic-resistance dynamics in bacteria. By combining data-driven control algorithms, high-throughput imaging, and single-cell optogenetics, we were able to precisely regulate gene expression in thousands of individual cells and generate rich datasets that reveal the regulatory principles underpinning tolerance and resistance acquisition. I will then describe how my group is extending this strategy to new biological systems and new application domains. In particular, I will present how we plan to use such platforms to emulate real-world, time-varying conditions in high throughput and at the microfluidic scale for training and validating predictive models, notably for biomanufacturing and scale-up. Finally, I will discuss how these interfaces create a bridge between biological experimentation and advanced AI/ML methods, and support autonomous experimentation and optimisation of biological systems. This work aims to establish experimental-computational infrastructures that enable AI to directly interact with, learn from, and ultimately design biological systems across scales.

Open-Ended, Quality Diversity, and AI-Generating Algorithms in the Era of Foundation Models

Jeff Clune (University of British Columbia, Canada)

Foundation models (e.g. large language models) create exciting new opportunities in our longstanding quests to produce open-ended and AI-generating algorithms, wherein agents can truly keep innovating and learning forever. In this talk, I will introduce quality diversity, open-ended, and AI-generating algorithms and share some of our recent work harnessing the power of foundation models to unleash their potential. I will cover our recent work including OMNI (Open-endedness via Models of human Notions of Interestingness), Video Pre-Training (VPT), Automatically Designing Agentic Systems (ADAS), the Darwin Gödel Machine, and The AI Scientist.

A Slice of Infinity: Building Robust, Interpretable AI for Open-ended Biodesign

Michiel Stock (Ghent University, Belgium)

The goal of synthetic biology is to design new-to-nature components, pathways, and organisms, often by combining, adapting, or changing existing biological parts in creative ways. Any computational tools to aid the biodesign process must be flexible, composable, and adaptable, as biology itself is. Hyperdimensional computing (HDC) is a promising brain-inspired learning paradigm that leverages the peculiar properties of high-dimensional spaces. In this talk, we explore how HDC can be used to efficiently learn from multi-modal, structured data and generate interpretable, robust models for biodesign.

Bottlenecks to Biological Progress

Niko McCarty (Asimov Press, USA)

Our ability to predictably engineer biology will greatly improve in the 21st century, aided in part by AI. These advances could prove useful in creating a more flourishing future; not only for medicine, but also for animal welfare, terraforming planets, and so on. In this talk, I discuss two classes of bottlenecks that I think are limiting biology progress, explain how they are being solved today, and discuss the good outcomes which bioengineers can help build as a result.

Engineering Swarms Across Scales

Sabine Hauert (University of Bristol, UK)

Building on 20 years of progress, swarm engineering is now ready to enable out-of-the-box solutions in real-world environments that adapt, scale, and are robust. At the nano- and micro-scale, these swarms can interface with the body, enabling applications in cancer treatment, wound healing, or tissue engineering. At the macro scale, robot swarms leverage AI, integrate advanced local perception, and share information not only locally but also quasi-globally to coordinate seamlessly in environments such as construction sites, farms, logistics hubs, and natural ecosystems. The goal is to foster a symbiotic ecosystem where next-generation robot swarms collaborate with each other --- and with nature --- at scale.

RNA strand exchange circuits as a general-purpose molecular programming language for synthetic biology

Samuel Schaffter (National Institute of Standards and Technology, USA)

A major goal of synthetic biology is to program biological systems with the same precision with which we program electronic devices, ultimately enabling the next generation of diagnostics, therapeutics, and biotechnologies. Great strides have been made, but a general challenge is most molecular programming paradigms are developed to operate in a narrow set of environments or applications. Further, sensing, information processing, and signal transduction are often coupled within a single device or design, making interoperability difficult. In contrast, in electronic computing inputs are converted into a universal machine language for information processing, and outputs of the machine instructions are mapped back to an application specific response. This enables seamless communication and integration across devices and applications. To enable similarly broad programmability of biological systems, we are developing a molecular equivalent of these machine instructions based on genetically encoded RNA strand exchange circuits called ctRSD circuits. RNA strand exchange reactions are easily programmed via predictable base pairing interactions universal across cell types, enabling wide operability. Further, in vitro nucleic acid strand exchange reactions are the most scalable and programmable biomolecular systems to date, demonstrating, for example, neural network-based classification of hundreds of inputs. I will provide an overview of our work operating ctRSD circuits across different environments and connecting these RNA circuits to different classes of inputs (metabolites and proteins) and outputs (proteins). Lastly, I will discuss the additional measurements needed to fully characterize circuit performance and properly inform AI/ML models that could ultimately automate the molecular circuit design process.

Contributed Talks

Deep mutational learning for effective enzyme engineering

Alperen Dalkiran (University of Edinburgh, UK)

Deep learning is transforming our ability to engineer proteins by identifying the complex rules that map genotype to phenotype, reducing costly rounds of trial-and-error experimentation. In this talk, I will present our deep mutational learning system designed to accelerate enzyme optimization. We first learned a high-quality fitness landscape of peroxidase activity of human myoglobin, using a yeast display assay of more than 6,000 variants, protein language models and supervised neural networks. Using the pretrained model, we computationally screened more than 4M unseen double mutants, and selected 20 of the highest-scoring myoglobin variants for experimental validation. This approach led to a 100% validation success rate, with every tested variant showing improved peroxidase function compared to the wild type. Our best variant displayed a nearly 5-fold improvement in catalytic efficiency. Furthermore, we validated the superior performance of the three top variants in soluble form. Our work presents a robust and scalable framework for data-driven enzyme engineering. It demonstrates how protein language models can be utilized to guide the discovery of high-performance biocatalysts, which can then be leveraged to construct novel metabolic pathways. I will discuss current extensions of these technologies to design novel gene regulatory elements in cyanobacteria for environmental applications as part of the CYBER Mission Award in Engineering Biology.

Generative design and construction of functional plasmids with a DNA language model

Angus Cunningham (University College London, UK)

DNA language models are emerging as a new approach for biological sequence generation, but their practical capabilities remain unclear. In this work, we explore whether such models can move beyond producing plausible DNA sequences to generating plasmids that are experimentally testable. We focus on Escherichia coli plasmids as a tractable system and present an end-to-end workflow spanning large-scale plasmid data curation, model fine-tuning, sequence generation, and bioinformatic screening. Using a GPT-style DNA language model, we generate synthetic plasmid backbones under both minimal and function-conditioned prompts, followed by stringent filtering to identify viable candidates. Selected designs were synthesised and evaluated in vivo, assessing growth, antibiotic resistance, and reporter expression. This work aims to probe the current limits of DNA language models and highlight both their potential and constraints as tools for generative biological design.

Computational inference of complex feedstock composition

Charlotte Merzbacher (Differential Biology GmBH, Germany)

Increasingly, cellular production systems use complex plant biomass feedstocks containing many different macromolecules as nutrient sources. Understanding and modeling how diverse carbon sources are utilized by host metabolism is important for the valorization of these feedstocks. However, there are few existing computational methods for estimating the metabolic inputs and impacts of complex feedstocks absent costly experimental data from mass spectrometry. In this work, we present the first computational method to infer how complex carbon sources integrate into microbial metabolism based on limited growth assay data alone. The method uses flux balance analysis on a genome-scale model of Escherichia coli metabolism with a top-layer optimization routine to fit model exchange parameters to experimental growth rate data. We train a machine learning surrogate model to select candidate exchange values; once selected, optimal values can be used to predict the effects of complex feedstocks on microbial metabolism. We explore the effects of various perturbations to cellular growth as well as to explore the high-dimensional shape of the metabolic space. Our method allows for forward-looking feedback composition modelling which can inform industrial decision-making and accelerate bioprocess optimization.

Reprogramming Development through Morphogenetic Perturbations

Elias Najarro (IT University of Copenhagen, Denmark)

Developmental systems rely on a variety of mechanisms to robustly produce their target morphologies. One such mechanism is morphogenetic gradients: non-uniform concentrations of biomolecules that scaffold patterning. We first present a computational model showing how morphogenetic gradients enable symmetry breaking in isotropic, purely local developmental programs. We then demonstrate how morphogenetic fields can be hijacked to steer development toward novel patterns through targeted perturbations—without modifying the developmental rules themselves.

Iterative NAND Hybrid Riboswitch Design by Deep Batch Bayesian Optimization

Erik Kubaczka (TU Darmstadt, Germany)

Designing large genetic circuits requires genetic regulatory devices that perform complex logic operations without placing an excessive metabolic burden on the host. Hybrid riboswitches are synthetically enhanced, compact RNA elements (less than 100 nucleotides) that form a tertiary structure and can specifically bind multiple different target molecules. They can be used to design genetic regulators that emulate Boolean logic. However, their functionality is sensitive to even single-nucleotide sequence changes, posing a challenge to targeted engineering. This study's goal is to design 2-input hybrid riboswitches that emulate Boolean NAND logic in yeast. We therefore developed a novel sequence design framework that combines in vivo screenings with uncertainty-aware batch Bayesian optimization, enabling score-driven sequence design. Through an initial screening, we discovered a hybrid riboswitch exhibiting NAND behavior. We then further engineered the riboswitch sequence by iteratively refining it in a design-build-test-learn (DBTL) cycle. Core to this is Bayesian optimization with an ensemble sequence-to-function surrogate trained on in vivo characterization data and pre-trained on custom riboswitch sequencing data. Ultimately, the hybrid riboswitch exhibits near digital NAND behavior. With its focus on model-based and score-driven design, our proposed method can complement experiment-driven approaches by enabling fine-grained adaptation of functionality, including constructs sensitive to single-nucleotide changes.

Development of an AI-Native Open Data Lakehouse for Automated Synthetic Biology

Matt Burridge (Newcastle University, UK)

Synthetic Biology is a highly interdisciplinary field that generates both computational and physical experimental data. To truly understand the result of an experiment, you must integrate information from every stage of an experimental workflow. Advances in lab automation and AI-driven experimental design have increased both the scale and complexity of these workflows. Biofoundries provide integrated infrastructure to handle this, however many laboratories rely on isolated instruments and software systems that operate as disconnected silos. These systems often produce heterogeneous data formats with limited structured metadata, hindering data integration, reproducibility, and downstream computational analysis. To address this, we present a data architecture that combines RO-Crates, domain ontologies and a vendor-agnostic data lakehouse paradigm. We demonstrate this architecture through a semi-automated enzyme characterisation use case, ingesting data from a metagenomic enzyme discovery workflow, spanning computational pipelines and automated laboratory equipment. Within the lab, we developed a lightweight automation data hub for RO-Crate ingestion, a Rust-based library (ro-crate-rs), and a labware barcoding system to capture provenance during both automated and manual work that is backed by PROV-O based data models. This open-source, vendor-independent approach enables robust, trustworthy, machine-readable wet-lab datasets that can be integrated with computational data to support future engineering biology workflows.

Protein design enables highly accurate sequence-to-expression models for an integral membrane protein

Paul Curnow (University of Bristol, UK)

The recombinant expression of integral membrane proteins is a major bottleneck across the biosciences. This includes industrial and academic research in bioengineering and synthetic biology. Computational sequence-to-expression models would be potentially transformative in addressing this challenge but the data needed to build such models is absent. Here, we describe a novel approach to generating ML-ready expression data that exploits membrane protein libraries derived from computational sequence design. We determine the recombinant expression phenotype of this library in E. coli, obtain gene sequences exhibiting high or low expression, and use this new dataset to develop very accurate predictive classifiers. We find that these sequence data are compatible with different embeddings and ML architectures, including explainable models, and use feature analysis to reveal that particular amino acid residues exert a strong influence on recombinant protein production. This residue-level insight is validated experimentally via model-guided engineering of membrane protein expression levels. While the results presented here are specific to our training data and conditions, we propose that similar methods could be applied to other membrane proteins to derive a general model of the expression fitness landscape in common recombinant chassis.

Upscaling Bacteriophage Therapy: AI and synthetic biology driven framework

Victor Németh (Ghent University, Belgium)

The rise of antibiotic resistance poses a significant global health threat, necessitating innovative solutions like phage therapy. Phages, viruses that target specific bacteria, offer a promising alternative due to their inexhaustible evolution potential, specificity and synergy with most antibiotics. However, two main challenges hinder their efficacy: the time-intensive process of identifying suitable phages for infections and the rapid development of bacterial resistance. My project addresses these challenges through leveraging the advances in synthetic biology and an evolution-aware machine learning framework. The latter incorporates protein language models, phage-host interaction classifiers and evolutionary algorithms to design improved receptor-binding proteins, which are key determinants of phage infectivity and host range. The resulting databases and models aid in the transition from the current empirical phage screening methods toward knowledge-driven bio design of engineered phages with improved and adaptable antibacterial activity.

Translation and Impact

FateView: AI-Powered, non-destructive live cell analytics and cell fate forecasting

Christopher Gribben (CellVoyant, UK)

A major bottleneck in engineering biology and cell manufacturing is that critical quality attributes (e.g., identity, state, and trajectory) are often assessed using destructive, end-point assays. This creates long feedback loops, limits the ability to intervene during culture, and increases the risk of batch failure during scale-up. Here, we present FateView, an AI-native approach for non-destructive, label-free monitoring of cell fate decisions during differentiation and manufacturing. FateView combines longitudinal in-process measurements (e.g., brightfield imaging and process metadata) with machine-learning models that first detect and segment cells, then extract post-detection morphological and dynamic features over time to predict downstream outcomes ahead of traditional QC. This approach allows earlier decisions that improve consistency, reduce cost, and support GMP-aligned manufacturing workflows.

Twist Bioscience - Changing How We Explore Sequence Space In The AI Era

Sarah Jackson (Twist Bioscience, USA)

Learn about how Twist Bioscience is supporting bioengineers in creating the biotechnologies of the future.

Biohm – Invention Led by Nature

Vittorio Bartoli (BIOHM, UK)

Biohm is a multi-award winning invention house where radical deep tech is conceived, incubated, evolved, and spun out. We are revolutionising industrial and economic systems to secure the future of humanity without impeding other species' evolutionary pathways. Hear more about how we have used our approach to develop Terraphyll --- the world's first high-performance, carbon-negative and scalable insulation platform technology engineered from agricultural waste and mycelium.

AI, Engineering Biology and the BBSRC - a funder’s perspective

Daniela Hensen (UKRI-BBSRC, UK)

The presentation will outline BBSRC’s approach towards support for AI, engineering biology and their convergence. We will share high level insights from our current portfolio of activities and outline our thinking for the future of the field.

AI, Engineering Biology and Bristol - a regional perspective

Kerstin Kinkelin & Kathleen Sedgley (University of Bristol, UK)

The talk will provide an overview of Bristol’s vibrant innovation ecosystem: from training the next generation of leaders in AI x Engineering Biology, supporting the latest innovative research in this space, to developing a regional hub. We will showcase the breadth of innovation at Bristol and beyond, and how we work with partners across sectors to drive AI x Engineering Biology innovation.

Posters

  1. Optimization of regulatory DNA with active learning
    Yuxin Shen (University of Edinburgh, UK)

  2. An atomistic graph-based computational platform for prioritisation of allosteric and functional residues across the proteome
    Long-Hung Pham (Imperial College London, UK)

  3. Machine-learning driven Bayesian inference of cell motility
    Daniel O'Hanlon (Imperial College London, UK)

  4. A Geometry-Based Approach for Computing Protein Transition Pathways from Molecular Dynamics Samples
    Zhaocheng Liu (University of Edinburgh, UK)

  5. Testing reversible disassembly of ferritin nanocages for cargo encapsulation
    Eli Hughes (University of Bristol, UK)

  6. Benchmarking AI Structure Prediction Tools for TCR-pHLA Drug Design
    Tania Gardasevic (University of Bristol, UK)

  7. Adapting Machine Learning Models to Noisy, Small-Sample Biological Data
    Chaithali Shashikanth Kundapur (University of Bristol, UK)

  8. Engineering keto reductases for improved stability with in silico tool EvoSelect®
    Lauren Turner (Isomerase Therapeutics Ltd.)

  9. Evolutionary algorithms to design biocomputing circuits beyond logic gates
    Lewis Grozinger (CNB-CSIC, Spain)

  10. Evolutionary Algorithm based Design and Control of Nature-Inspired 4D-Printed Hygromorphic Actuators
    Charles de Kergariou (University of Bristol, UK)

  11. Data driven design of combination therapies for non small cell lung cancer
    Antonella La Regina (University of Bristol, UK)

  12. SeqProFT: Sequence-only Protein Property Prediction with LoRA Finetuning
    Shuo Zhang (University of Birmingham, UK)

  13. Computational De-Risking of DNA Origami Assembly via Multi-Objective Off-Target Scoring
    Silvia Adriana Navarro (Newcastle University, UK)

  14. CelFDrive: Artificial Intelligence assisted microscopy for automated detection of rare events
    Scott Brooks (University of Warwick, UK)

  15. Accurate and efficient multiscale modelling of enzymatic Diels-Alder reactions with machine learning potentials
    Magdalena Kazmierczak (University of Bristol, UK)

  16. From Natural Modularity to Full Programmability: AI-Shaped Design Principles for Customizable Transcription Factor Biosensors
    Brecht De Paepe (Ghent University, Belgium)

  17. Computational prediction of protein-protein interactions to engineer microbial chitooligosaccharide cell factories
    Hamza Malik (Ghent University, Belgium)

  18. Learning the language of membrane targeting: AI-driven optimisation of plant cytochrome P450s in bacteria
    Miguel De Block (Ghent University, Belgium)

  19. Automating high-throughput protein production by synthetic biology and AI
    Shan Jiang (Ailurus Biotech)

  20. Boundary-Aware Metabolic-Informed Neural Networks for Multi-Substrate Phenotype Prediction and Strain Design
    Wenxing Ji (Newcastle University, UK)

  21. AdipoMap: Advancing Body Fat Assessment Beyond BMI Using AI and Whole-Body MRI
    Yi Yin (University of Bristol, UK)

  22. Computation and Communication Cooperation for Molecular Network
    Jichun Li (Newcastle University, UK)

  23. State-Aware Functional Annotations for Multi-State Design of Dynamic Proteins
    Wenrui Fan (Sheffield University, UK)

  24. Evolving complex gene regulation via the SCRaMbLE'ing of neo-chromosomes
    Qing Hsuan Ong (University of Bristol, UK)

  25. A Golden Gate compatible CRISPR-associated transposon tool for multiplexed genome editing
    Thea Irvine (University of Bristol, UK)

  26. Integrating optogenetic control for Vibrio natriegens
    Max Shail (University of Bristol, UK)

  27. Evaluating Genome Integration Architectures for Tunable Genetic Circuits in E. coli
    Riesa Rohmat (University of Bristol, UK)

  28. Exploring the Hidden Mechanisms of NK–Tumour Cytotoxic Interactions with Bayesian Inference
    Elephes Fanxin Sung Song (Imperial College London, UK)

  29. Using genetic programming and SINDy-PI to discover mathematical models from data
    Maria Tasca (University of Bristol, UK)

  30. CMA: Algorithmic Enhancement of Soluble Functional Protein Expression for Protein Engineering
    Vaibhav Tyagi (Northumbria University, UK)

  31. Towards high-throughput transcriptional termination screening for training datasets
    Felipe Buson (University of Bristol, UK)

  32. Construction of hyper-transcription E. coli strains with inducible control over three orthogonal phage RNA polymerases
    Shivang Joshi (University of Bristol, UK)

  33. On Flower Design
    Nicholas Desnoyer (The Sainsbury Laboratory, UK)

  34. Dataset Generation for Machine Learning-Guided Genome Design in Yeast
    Livia Soro (Imperial College London, UK)

  35. Exploring Respiratory Stress Tolerance in an Obligate Aerobe Yeast using Adaptive Laboratory Evolution (ALE)
    Mohamed Al Marei (Imperial College London, UK)

  36. Model-Based PD-1 High/Low Gating in CD8 T Cells for Tumour Microenvironment Profiling from Single-Cell Surface Proteomics
    Moahmmod Suvon (Sheffield University, UK)

  37. High-throughput biochemical assays and sequencing characterise diverse bacterial communities enriched on plastic additives
    Matthew Tarnowski (Swansea University, UK)

  38. Interpretable Protein Language Models and Batch Bayesian Optimization for the Design of a Novel Class of Antimicrobials
    Steff Taelman (Ghent University, Belgium)

  39. Automated Recognition of Doctor's handwritten prescriptions using R-CRNN and CTC
    Garima Saroj (GitLife Biotech)