AI in Pharmaceuticals & Drug Discovery•December 19, 2025•By 3L3C

See how AI predicts cell morphology from gene expression—then apply the same predict-first approach to cut lab costs, timelines, and energy use.

phenotypic screeningdiffusion modelscell paintingtranscriptomicsmechanism of actionbiomanufacturingenergy efficiency

Featured image for Predict Cell Morphology With AI—Before You Run Tests

Predict Cell Morphology With AI—Before You Run Tests

A single high-content imaging campaign can burn through weeks of instrument time, thousands of plates, and a lot of electricity—incubators, robotic handlers, microscopy lasers, GPUs for image analysis, HVAC for clean rooms. Most teams accept this as the cost of doing phenotypic drug discovery.

I don’t think that assumption holds up anymore.

A new class of generative AI models for biology is starting to answer a blunt question: What if you could preview what cells will look like after a perturbation before you run the experiment? A recent research effort dubbed MorphDiff uses a diffusion model conditioned on gene expression to generate realistic multi-channel microscopy images. It’s built for drug discovery, but the interesting lesson for this series—AI in Pharmaceuticals & Drug Discovery—is bigger: predict-first workflows are becoming practical. And the same logic maps cleanly to energy and utilities: you don’t optimize a grid by testing every configuration in the field; you simulate, prioritize, then verify.

Below is what MorphDiff is, why it matters, and how the “predict-then-confirm” pattern can cut both R&D cost and lab energy use—including in biotech areas tied to the energy transition (biofuels, biomanufacturing, carbon utilization, and enzyme engineering).

MorphDiff in plain terms: transcriptome-to-microscopy

MorphDiff predicts cell morphology from transcriptomics. Concretely, it takes an L1000 gene expression signature (a vector describing which genes are up or down after a perturbation) and generates what a cell painting image would look like after that perturbation.

That flips a familiar workflow:

Traditional phenotypic screening: run perturbation → image cells → extract features → interpret mechanism
MorphDiff-style workflow: measure or obtain gene signature → generate predicted morphology → prioritize what to image for confirmation

The practical appeal is immediate. There’s far more gene-expression data available at scale than paired “gene signature + high-quality morphology image” datasets. If your new compound already has (or can cheaply get) a transcriptomic profile, you can use that as the conditioning signal to simulate phenotypes you otherwise would’ve had to image.

What it actually generates

MorphDiff supports two modes:

Gene-to-image (G2I): start from noise and generate a post-perturbation image conditioned on the gene expression signature.
Image-to-image (I2I): take a control image and “push” it toward the perturbed state using the gene signature.

The second mode is underrated. When scientists argue about whether a generative model is “right,” the I2I framing anchors the change relative to a baseline. That’s closer to how lab teams reason: What changed, and is that change consistent with the biology we expect?

Why diffusion models fit biological images better than most people expect

Diffusion models are good at making small, structured changes without collapsing diversity. That’s a big deal for cell morphology, where the signal isn’t a single obvious object but subtle shifts across multiple channels—textures, granularity, compartment boundaries, correlations between stains.

MorphDiff’s design has two key pieces:

A morphology VAE compresses 5-channel microscopy images into a latent representation while preserving perceptual fidelity.
A latent diffusion model denoises in that latent space while being guided by the L1000 gene expression vector (using attention).

This architecture is essentially saying: learn a good “language” for cell images first, then learn how gene-expression conditions steer that language. That’s a sensible stance for lab data, which is noisy, batch-affected, and never as clean as the datasets used for consumer image generation.

The part that matters for drug discovery: mechanism-of-action retrieval

Pretty images don’t get budgets approved. Downstream utility does.

MorphDiff’s most compelling promise is that simulated morphology can support mechanism-of-action (MOA) retrieval—finding reference compounds that behave similarly even if their chemical structures don’t.

In the reported results, MorphDiff-generated morphologies:

outperform using transcriptomics alone for MOA retrieval, and
approach the performance you’d get with real images.

The practical interpretation: simulated phenotypes contain complementary information—not just “style transfer,” but signal that helps separate mechanisms.

If you run an early discovery team, that changes how you spend money:

Use transcriptomics (often cheaper and faster than large-scale imaging) to create signatures.
Generate predicted morphology profiles.
Cluster and prioritize only the most informative or novel phenotypic neighborhoods for real imaging.

That is a direct analog to how utilities prioritize field work: you model first, then roll trucks for the cases with the highest uncertainty or highest economic impact.

The energy angle: “predict-then-confirm” reduces lab energy waste

Wet labs are energy-intensive, and imaging-heavy workflows are particularly demanding. Even without quoting facility-specific numbers, the drivers are obvious:

microscopes and lasers running for days
incubators and environmental controls maintaining tight temperature and CO₂
robotics and plate handlers
downstream compute for segmentation, embeddings, and QC

A predict-first screening funnel can reduce the number of plates you run and re-run.

Here’s a concrete, operational way to think about it:

If you can cut confirmatory imaging by 30–50% because you’re only imaging prioritized compounds, that’s not just time saved.
It’s fewer instrument hours, less cold storage cycling, lower clean-room HVAC load, and fewer “just in case” repeats when the first batch is noisy.

And if your org is already under pressure to report Scope 2 reductions (common in pharma manufacturing and increasingly in R&D), AI-enabled experiment reduction becomes a sustainability lever you can defend with a spreadsheet.

Why this matters for bioenergy and biomanufacturing

This blog post sits in our drug discovery series, but the same workflow helps biotech programs tied to energy transition priorities:

Enzyme engineering for biomass conversion (predict phenotypes of edits before running full assays)
Microbial strain optimization for biofuels or biochemicals (prioritize edits that likely change organelle morphology, stress responses, or production pathways)
Carbon utilization biology (reduce iteration cycles when exploring pathway rewiring)

Morphology isn’t always the end metric in those domains, but it’s often a proxy for cellular health, stress, and pathway activation—signals that correlate with yield and stability.

What energy & utilities teams can learn from MorphDiff

The headline isn’t “cells look like grids.” It’s that the same AI pattern keeps showing up in complex systems:

When experiments are expensive, you win by predicting outcomes early, then validating only what matters.

Utilities have been doing this for years with different names:

predictive maintenance (model failure risk → inspect assets)
grid optimization (simulate dispatch → run constraints)
digital twins for substations, turbines, batteries (simulate dynamics → test selectively)

MorphDiff is essentially a digital twin for cellular phenotype, where the conditioning input is a molecular readout.

If you work in energy and are evaluating AI investments, this is a useful benchmark question:

Are we using AI to make existing processes faster… or to run fewer expensive processes in the first place?

The second one is where ROI tends to stack up.

How to apply a MorphDiff-like approach in a real discovery pipeline

You don’t need MorphDiff specifically to benefit from the workflow shift. The implementable idea is a staged decision system that uses cheaper modalities to decide when to spend imaging budget.

Step 1: Treat multimodal data as a routing problem

Build a triage funnel where each compound (or perturbation) gets routed based on uncertainty and expected value.

A pragmatic routing policy often looks like:

Structure + basic assays for early filtering
Transcriptomics for a broad signature map
Predicted morphology (generative model or surrogate) to cluster and detect novelty
High-content imaging for confirmation on selected candidates

Step 2: Decide what “good enough” means for simulated phenotypes

Simulated images are not ground truth. In practice, you’ll want acceptance criteria tied to your downstream tasks:

Does simulated morphology preserve neighborhood structure (similar perturbations cluster together)?
Does it support retrieval accuracy for known MOAs?
Do feature distributions match real data well enough for your decision thresholds?

This is where many AI projects fail: teams evaluate with generic image metrics, then wonder why science stakeholders don’t trust it. Evaluate on your decision, not on photorealism.

Step 3: Plan for the known limitations

Diffusion-based phenotype generation still faces constraints that you should design around:

Inference speed: diffusion can be slow; plan batching, caching, and faster samplers.
Missing conditions: time and concentration often aren’t encoded well; your model may blur these effects.
Out-of-distribution risk: predictions degrade when you move far from training data (new cell lines, new staining protocols, new lab sites).
Dependency on transcriptomics: if you can’t get a gene signature, you need upstream models to predict it (or alternate conditioning signals).

The best stance is not “trust the model.” It’s use the model to reduce the search space, then verify.

The lead-worthy takeaway: AI that reduces experiments beats AI that summarizes experiments

A lot of “AI for drug discovery” spend is going into copilots, document search, and reporting. Helpful, but incremental.

Predictive phenotype generation is different because it can change unit economics:

fewer plates
fewer imaging runs
fewer repeats
faster hypothesis pruning

That’s also why it maps so well to energy and utilities. Both domains are capital- and resource-intensive. The most valuable AI systems don’t just produce insights; they reduce expensive real-world actions.

If you’re building an AI roadmap—whether in pharma R&D, biomanufacturing, or energy operations—this is the question I’d keep on the whiteboard:

Where can we predict outcomes early enough to avoid doing 80% of the work?

Because once you can reliably skip that 80%, budgets, timelines, and emissions all start to move in the right direction.