Nine Ruby
AI-generated abstract visualization representing machine learning and email optimization

How We Built an AI That Writes Better Email Subject Lines Than Humans

Email subject lines are deceptively simple. They are a handful of words that determine whether your carefully crafted message gets opened or buried beneath a hundred others. For years, the best subject lines came from seasoned copywriters with sharp instincts and decades of pattern recognition. But last year, we asked a different question: what if a model trained on 14 million subject-line-to-open-rate pairs could do it better?

The answer, after six months of development and rigorous A/B testing across 32 client campaigns, was a decisive yes. Our internal model, which we call Ruby Subject, now outperforms human copywriters on open rates by an average of 23%. More importantly, it does so consistently -- across industries, audience segments, and campaign types. The variance in performance dropped by nearly half compared to human-written lines, meaning clients could rely on predictable results rather than hoping for a good day from their creative team.

The Architecture Behind Ruby Subject

We started with a fine-tuned language model, but the real breakthrough came from the reward signal. Most AI writing tools optimize for fluency or engagement proxies. We optimized directly for open rate lift -- the percentage improvement over a baseline control subject line. This meant building a custom training pipeline that paired generated subject lines with real-world performance data from anonymized client campaigns.

"The model doesn't just write catchy lines. It learns the invisible patterns that make someone stop scrolling and actually tap open -- patterns no human copywriter could articulate."

Arjun Mehta, Head of AI Engineering at Nine Ruby

The pipeline works in three stages. First, Ruby Subject generates a batch of 20 candidate subject lines for a given email brief. Second, a separate scoring model -- trained on our historical performance data -- ranks them by predicted open rate. Third, the top three candidates are surfaced to the client for final selection or sent directly into a multi-armed bandit test that automatically converges on the best performer within the first 10% of sends.

What We Learned Along the Way

The most surprising finding was how often the AI's top picks violated conventional copywriting wisdom. Subject lines that human writers dismissed as "too plain" or "missing a hook" routinely outperformed clever, punchy alternatives. It turns out that in crowded inboxes, clarity beats cleverness. The model learned this on its own -- no one programmed a preference for simplicity. It simply discovered, across millions of data points, that recipients respond to subject lines that immediately communicate value without making them work to decode the message.

We also discovered that personalization tokens ({first_name}, {company}) had diminishing returns. Five years ago, adding a first name to a subject line boosted open rates by 8-12%. Today, that lift has shrunk to under 2%. Everyone does it now, and recipients have become numb to the tactic. Ruby Subject learned to use personalization sparingly and strategically -- sometimes dropping it entirely in favor of a stronger value proposition. This is the kind of nuance that is hard to mandate in a style guide but emerges naturally from data-driven optimization.

"We ran the model against our three best copywriters in a blind test. The AI won 7 out of 10 rounds. After that, we stopped debating whether AI belonged in our creative workflow."

Sarah Lin, VP of Client Strategy

What This Means for Marketing Teams

This is not about replacing writers. Our best results come from a hybrid workflow where Ruby Subject generates candidates and human editors refine tone and brand voice. The AI handles the heavy lifting of pattern matching and optimization; the humans ensure the output sounds like the brand, not a machine. The result is a creative process that is faster, more consistent, and measurably more effective. For our clients, that has translated to an average 18% increase in email-driven revenue within the first quarter of adoption -- a number that continues to climb as the model ingests more campaign data and sharpens its predictions.

← Back to all articles