Amazon arrives at the AI Agent party with Nova Act, a browsing agent SDK

Ewan MacLeod

01 Apr 2025 — 2 min read

Stop the clocks once more.

The AI world has changed, again.

This time it's the 800lb gorilla in the form of Amazon arriving at the party with a series of fascinating developer previews based on enabling a highly focused, intensively trained web browsing agent.

It's called Nova Act and it's Amazon's offering to enable you to build systems, processes and capabilities to help people get things done, accurately and effectively.

If you've been watching and playing with the likes of OpenAI's Operator, I'd suggest that this is similar in context, although it's a very different approach.

The Amazon team have really, really focused the models' approach to making sure it doesn't get stuck on some of the more complicated aspects of the browser experience, including for example, date picking, drop-downs, pop-ups and so on.

The team points out:

Nova Act is focused on reliable building blocks that can be composed into more complex workflows. Many agent benchmarks measure model performance on high-level tasks, where state-of-the-art models achieve 30% to 60% accuracy on completing tasks in web browsers. But agents must be reliable to be truly useful — we’ve focused on scoring >90% on internal evals of capabilities that trip up other models, like date picking, drop downs, and popups, and achieve best-in-class performance on benchmarks like ScreenSpot and GroundUI Web which most directly measure the ability for our model to actuate the web.

What this means is that you can essentially string together a series of tasks – including adding in a bit of Python code as needed – to get something done. One of the team highlights Nova Act ordering him the same salad every week, for example.

You can sit and watch the agent do its thing, or you can opt to have it run in 'headless' mode – whereby it just executes the task you've defined without any need to watch the browser activities.

Nova Act is the first step in our vision for building the key capabilities that will enable useful agents at scale. This is an early checkpoint from a much larger training curriculum we are pursuing with Nova models. To truly make agents smart and reliable for increasingly complex multi-step tasks, we think agents need to be trained via reinforcement learning on a wide range of useful environments, not just via supervised fine-tuning with simple demonstrations into an LLM.

From these statements and the surrounding commentary across the various videos and releases, you can begin to get a picture of what the Amazon team is thinking in terms of scale and capability. It's impressive, compelling and exciting.

If you're in the United States, you can go and find out a lot more at https://nova.amazon.com – otherwise, if you're external (like me) you'll need to sit and wait before you can start playing with it.

One to watch!

Skyfire: Building out payment rails for AI Agents

I wanted to make sure I documented skyfire.xyz here on Conversational AI News. The company's mission is one that is absolutely fascinating. And here is the Overview from their about page: To create the world’s first fully autonomous, open economy for Agentic AI. AI Agents shouldn’

Sky cuts 2,000 call centre jobs to "focus on AI and online"

From an article in The Times of London today: About 2,000 jobs will be cut at Sky as it replaces more of its customer call centre positions with online and artificial intelligence. Three of Sky’s UK call centres, in Stockport, Sheffield and Leeds, will be closed and another

Got an AI project coming up? Try Ankit Chhajer's PRIMED™ AI Framework

My good colleague Ankit Chhajer, the top man in AI at Barclays UK has published his personal PRIMED™ AI framework for managing AI project success and I think it's worth a read. He's given a whole load of detail on how to apply it on his

Bloomreach's Clarity AI increases conversion by 9% and order value by 20%

Here's a fascinating press release from the team at agentic commerce experts Bloomreach: Bloomreach, the agentic platform for personalization, today announced new data from its conversational shopping agent, Clarity, highlighting the powerful impact of conversational AI and the new consumer behaviors it is unveiling. Live on a number

Read more

Skyfire: Building out payment rails for AI Agents

Sky cuts 2,000 call centre jobs to "focus on AI and online"

Got an AI project coming up? Try Ankit Chhajer's PRIMED™ AI Framework

Bloomreach's Clarity AI increases conversion by 9% and order value by 20%