Articles

How to control AI agents before they control you

ByRicardo July 3, 2026

AI agents are genuinely spectacular. They can plan, cause, search the net, write code, ship emails, and execute multi-step duties with minimal human enter.

We need your suggestions (and there is a reward in it for you)

We’re evolving AIAI to higher serve you, and we want your enter to get it proper. Our short survey covers how you use your membership and what you’d like to see extra of.

Complete it, and you’ll be entered right into a draw for considered one of 5 £50/$65 Amazon vouchers.

Complete our survey

If you’ve spent any time working with them, you already know the sensation of watching one full in minutes what would have taken an individual hours.

But here is the factor no person talks about sufficient: they additionally fail in methods which might be deeply unsettling. And the failures do not all the time seem like failures at first. Sometimes they look fully cheap, proper up till the second they completely aren’t.

When lowering hallucinations is not sufficient

Let’s begin with one thing which may appear unrelated however really units the stage for the whole lot else: hallucinations.

You can scale back hallucinations considerably with the correct methods. Retrieval-augmented technology, grounding responses in verified data, tightening prompts. All of that helps. But it would not deliver the quantity down to zero. There’s all the time a residual danger, and that residual danger compounds when agents are making choices autonomously.

So what do you do about it? A couple of issues, actually. You can add a verification layer before any output will get introduced to a person. That layer could be deterministic and rules-based, or it may be a second LLM checking the work of the primary. The “LLM as decide” sample has grow to be well-known for a cause. It works moderately nicely.

For high-stakes queries, although, you want one thing extra. If an agent is negotiating a contract or dealing with massive monetary figures, the agent can do hours of labor behind the scenes, however a human ought to overview and ensure the ultimate output before any motion is taken.

The agent does the heavy lifting. The human does the ultimate test. That division of labor issues.

The inbox incident that went viral for all of the incorrect causes

Most of you have most likely heard concerning the OpenClaw incident. If you have not, here is what occurred.

Samar Yu is Meta’s AI alignment director. Her complete job is ensuring AI programs do what people inform them to do. She arrange an OpenClaw agent on a Mac Mini to assist handle her electronic mail inbox. She gave it clear directions: test the inbox, recommend what to archive or delete, however take no motion till I say so.

As quickly as she linked it to her actual inbox, it began bulk deleting her emails.

She panicked. She despatched messages from her cellphone: “Don’t try this. Stop. STOP OPENCLAW.” Nothing labored. The agent stored going. She finally had to bodily run to her desk and manually kill all of the processes on the machine to get it to cease. Her phrases had been that it felt like diffusing a bomb.

The put up she shared on X acquired 9.6 million views. And sure, the agent later apologized. It stated it remembered violating the constraint and acknowledged she was proper to be upset. Which is, truthfully, one of many stranger issues you’ll encounter on this area.

So, what really went incorrect?

The core problem was context compaction. Agents have a restricted reminiscence window. When the actual inbox linked and the amount of information exploded, the agent had to compact what it had processed up to now. Her unique directions acquired compacted away. The agent now not had them.

💡

When she despatched cease instructions from her cellphone, these messages acquired queued on the similar precedence degree as the whole lot else. The agent was making an attempt to end its present process before taking up new directions. It wasn’t ignoring her precisely. It simply hadn’t gotten there but.

Articles Artificial Intelligence

The hidden risk of one-size-fits-all AI advice
ByRicardo December 22, 2025

You’ve probably asked ChatGPT for advice at some point. Maybe about investing that bonus check, or how to finally tackle your credit card debt. Here’s what you might not realize: the same financial advice that’s perfectly safe for someone earning six figures could be catastrophic for a gig worker drowning in high-interest debt. A new…

Read More The hidden risk of one-size-fits-all AI advice
Articles Artificial Intelligence

Fast-track product validation using AI
ByRicardo January 11, 2026

A key challenge of product management is reducing the time between idea generation and gaining validation to move forward (or kill it). What used to take months of building, testing, gathering feedback, and iterating (often with high costs) can now be compressed dramatically using AI tools. Here’s a breakdown of actionable steps so you can fast‑track your product…

Read More Fast-track product validation using AI
Articles Case Studies

Case study: OpenAI
ByRicardo November 27, 2025November 27, 2025

OpenAI: The $300 billion frontier, anchored within the UK The world AI panorama in 2025 is outlined by an intense, multi-front conflict for supremacy amongst foundational mannequin builders: OpenAI, Google DeepMind, Anthropic, and Mistral AI. While OpenAI continues to steer in general scale, person base, and income (projected close to $12 billion in 2025), its…

Read More Case study: OpenAI
Agentic AI Articles

Is multi-turn reasoning broken?
ByRicardo June 2, 2026June 2, 2026

Everybody assumed reasoning fashions would fail the apparent means. A mannequin commits to one thing in flip two, contradicts it in flip 9, and also you catch it. Clean, detectable, patchable. Grab a consistency checker, add some grounding, and transfer on. A paper offered on the Why your present verification stack is flying blind The…

Read More Is multi-turn reasoning broken?
Articles Artificial Intelligence

Marketers are adopting AI. So why aren’t organizations embracing it?
ByRicardo November 19, 2025

Earlier this yr, we started engaged on a report exploring the way forward for advertising and marketing. There’s a whole lot of noise on the subject – a lot of it conjecture – so we needed to listen to from precise entrepreneurs on what they assume the occupation will appear like in 1, 5, 10 years’ time….

Read More Marketers are adopting AI. So why aren’t organizations embracing it?
Articles Artificial Intelligence

Why your AI investments are falling short
ByRicardo October 9, 2025October 9, 2025

Let me share one thing that may sound acquainted: Your group has poured vital assets into AI initiatives, but the returns stay disappointingly linear. If you are nodding alongside, you are not alone. In reality, 85% of AI tasks fail to succeed in manufacturing in 2024. That’s a staggering statistic that retains me up at…

Read More Why your AI investments are falling short