Brilliant Machines, Human Breakthroughs.

75%

of companies yet to generate meaningful value from AI (WEF)

55%

of biotech AI pilots fail not due to bad models, but bad conditions

~100%

software engineering benchmark score up from ~60% in just one year

Why the most powerful AI in history still can’t replace the people asking the right questions?

Every week, a new AI capability lands on your LinkedIn feed. Autonomous research agents. Agentic workflows that spin up, selfcorrect, and deliver results overnight. Models that ace PhDlevel science benchmarks. The pace is dizzying and deliberately so.

So here is a question worth sitting with: if AI is this good, why do R&D pipelines still stall? Why are threequarters of companies still struggling to generate meaningful value from their AI investments, according to the World Economic Forum? Why are 55% of AI pilots in biotech failing not because the models are bad, but because the conditions for good AI outcomes were not there to begin with?

“AI is sprinting. But the problems worth solving require something it has not learned yet: discernment.”

At NineSigma, we sit at the intersection of that tension every day. We work with companies on their hardest innovation challenges the ones where the answer is not in a database, and where the difference between a promising lead and a genuine breakthrough comes down to human curiosity, crossdomain instinct, and the ability to ask a question no one else thought to ask.

This is a map of where AI ends and where the real work begins.

The state of AI in R&D: a genuinely impressive ledger

Let us be honest about what AI has achieved. According to Stanford’s 2026 AI Index, models now meet or exceed human expert performance on benchmarks designed to test PhDlevel science, mathematics, and language understanding. Software engineering benchmark scores jumped from around 60% to nearly 100% in just one year.

In pharmaceutical R&D, AI is reshaping the early pipeline. Protein structure prediction tools are now used by 73% of leading biotech organisations. Predictive docking models have cut timelines from years to months in certain targetidentification workflows.

Biotech AI adoption 2026 Biotech AI Report

Use protein structure prediction73%

Report faster timetotarget50%

See improved model accuracy42%

AI pilots that fail55%

Source: 2026 Biotech AI Report

Agentic AI systems that do not just answer questions but pursue goals, orchestrate tasks, and loop back for human validation has moved from research labs into production. IBM, Google, Anthropic, and others are racing to standardise multiagent protocols. The automation of complex enterprise workflows is no longer science fiction; it is a procurement question.

In short: AI is a formidable accelerant. For welldefined problems, with clean data, in established domains, it delivers speed and scale that no human team can match.

Where it breaks down

The failure modes are structural and consistent across industries.

⚡

Confident

Wrong answers, at scale

Stanford researchers call it “jagged intelligence” AI that surpasses experts on one task, then fails at something a child could do. In open innovation, a scouting exercise that surfaces wrong signals, confidently and at scale, is worse than no scouting at all.

🗄️

55%

Pilots that went nowhere

Poor data quality is the number one reason AI pilots fail, cited by 55% of organisations. R&D data is messy, siloed, and shaped by organisational memory that never made it into any system.

🔁

Safer ≠

Innovation that converged

Harvard Business School research found that AI recommendation systems risk homogenising creative output. When evaluators rely on AI first, they converge on higheraverage solutions but miss the outliers. In innovation, it is the tails that matter.

🏢

75%

Value that never materialised

The WEF found that around threequarters of companies have yet to generate meaningful value from AI not because the models are bad, but because organisational processes were not built to host them.

“The most important innovations often look like noise to a pattern recognition system.”

What this means for innovation

NineSigma’s work is fundamentally about brokering knowledge across boundaries between industries, disciplines, geographies, and organisational contexts that would never otherwise connect. This is precisely the kind of work that AI, for all its power, cannot yet do alone.

Consider what a highquality innovation facilitation process actually requires:

🎯

Problem framing

Capturing what the client actually needs, not just what they have articulated. The right question is usually not the one that was asked.

🔀

Crossdomain translation

Recognising that a materials science solution might answer a food industry challenge, or that a defence technology might unlock a sustainability application.

🌐

Ecosystem sensing

Knowing which experts, research groups, and startups are doing quiet work that has not yet made it into the literature AI is trained on.

⚖️

Synthesis and judgment

Not just summarising what exists, but making a call on what matters and why. This requires context no dataset fully captures.

🤝

Relationship intelligence

Understanding who in a network is the right person to speak to, and how to earn their genuine engagement.

None of this is beyond augmentation by AI. All of it still requires a human at the centre.

The right frame: augmentation, not replacement

The most important shift happening in serious AI conversations right now is the move from replacement thinking to augmentation thinking. IBM’s 2026 tech outlook describes this well: the goal is not to replace human roles, but to decompose them automating the highvolume, repetitive tasks so that the humans in the room can focus entirely on judgmentintensive work.

AI productivity gains where they are (and are not) significant

Software development+26%

Customer service+14%

Judgmentintensive tasks~0%

Source: Stanford AI Index 2026. Gains of this magnitude are not observed in tasks requiring complex contextual judgment.

CapTech’s 2026 trends report puts it plainly: AI systems excel at speed, scale, and automation, but they still lack the nuanced judgment, ethical reasoning, and contextual understanding that humans provide. The failure mode is not AI replacing humans it is humans being removed from the loop too early, replaced not by AI but by overconfidence in AI.

The effective model: humanAI iterative loop

🧭

Frame directionDefine the right questionHuman

⚡

Search & synthesiseScale across sourcesAI

🔍

Interrogate outputSpot what’s missingHuman

🔄

Refine & iterateTighten the signalAI

✅

Judge & decideApply domain wisdomHuman

What smart companies are doing differently

The organisations pulling ahead in 2026 are not the ones with the best models. They are the ones who pair model capabilities with human depth. Three patterns stand out.

They invest in problem quality before solution speed. The fastest AI pipeline will not compensate for a poorly framed challenge. Leading organisations spend more time not less on understanding the root question. Domain expertise has become infrastructure: in a world where AI can produce a plausiblesounding analysis of almost any topic, the people who know a field deeply enough to spot when an output is technically correct but practically wrong become more valuable, not less.

They build for humanAI loops, not handoffs. The most effective deployments are not “AI does it, human approves.” They are iterative: human sets direction, AI searches and synthesises, human interrogates the output, AI refines. This loop structure is where the real gains live. And they stay curious about the edges the unexpected disciplines, the emerging researchers, the quiet laboratories working on something that has not been named yet.

Conclusion

AI is not the end of expertise. It is a formidable accelerant for problems that are welldefined, datarich, and bounded. But the evidence from failed pilots, stalled portfolios, and unrealised value is clear: when human judgment is removed too early, AI does not just underperform it can actively mislead.

The companies that will lead in innovation are not the ones who automate the most. They are the ones who stay relentlessly focused on the human insight at the centre of every breakthrough: the sharp question, the unexpected connection, the judgment call that no dataset could have predicted.

“Speed is AI’s gift. Discernment is yours. The organisations that understand this will define what innovation looks like in the next decade.”

That is what open innovation has always been about and why the most important thing a brilliant machine can do is help a brilliant person think better.

Work with us

At NineSigma, we combine the best of human expertise and AI capability to deliver innovation outcomes that neither can achieve alone. If you are facing an innovation challenge that has resisted easy solutions, we would love to talk.

Get in touch →

Get in touch

NineSigma Europe BV

Koning Leopold I straat 3
B-3000 Leuven
Belgium

+32 16 24 42 80

europe@ninesigma.com

NEWS

Cookie	Duration	Description
__cfduid	1 month	The cookie is used by cdn services like CloudFare to identify individual clients behind a shared IP address and apply security settings on a per-client basis. It does not correspond to any user ID in the web application and does not store any personally identifiable information.
_GRECAPTCHA	5 months 27 days	This cookie is set by Google. In addition to certain standard Google cookies, reCAPTCHA sets a necessary cookie (_GRECAPTCHA) when executed for the purpose of providing its risk analysis.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Advertisement".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_ga	2 years	This cookie is installed by Google Analytics. The cookie is used to calculate visitor, session, campaign data and keep track of site usage for the site's analytics report. The cookies store information anonymously and assign a randomly generated number to identify unique visitors.
_gcl_au	3 months	This cookie is used by Google Analytics to understand user interaction with the website.
_gid	1 day	This cookie is installed by Google Analytics. The cookie is used to store information of how visitors use a website and helps in creating an analytics report of how the website is doing. The data collected including the number visitors, the source where they have come from, and the pages visted in an anonymous form.

Cookie	Duration	Description
IDE	1 year 24 days	Used by Google DoubleClick and stores information about how the user uses the website and any other advertisement before visiting the website. This is used to present users with ads that are relevant to them according to the user profile.
test_cookie	15 minutes	This cookie is set by doubleclick.net. The purpose of the cookie is to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	This cookie is set by Youtube. Used to track the information of the embedded YouTube videos on a website.

Cookie	Duration	Description
_gat_gtag_UA_6914197_1	1 minute	No description
_gat_UA-138161758-1	1 minute	No description