We let OpenAI’s “Agent Mode” surf the web for us—here’s what happened

We let OpenAI’s “Agent Mode” surf the web for us—here’s what happened

On Tuesday, OpenAI announced Atlas, a new web browser with ChatGPT integration, to let you “chat with a page,” as the company puts it. But Atlas also goes beyond the usual LLM back-and-forth with Agent Mode, a “preview mode” feature the company says can “get work done for you” by clicking, scrolling, and reading through various tabs.

“Agentic” AI is far from new, of course; OpenAI itself rolled out a preview of the web browsing Operator agent in January and introduced the more generalized “ChatGPT agent” in July. Still, prominently featuring this capability in a major product release like this—even in “preview mode”—signals a clear push to get this kind of system in front of end users.

I wanted to put Atlas’ Agent Mode through its paces to see if it could really save me time in doing the kinds of tedious online tasks I plod through every day. In each case, I’ll outline a web-based problem, lay out the Agent Mode prompt I devised to try to solve it, and describe the results. My final evaluation will rank each task on a 10-point scale, with 10 being “did exactly what I wanted with no problems” and one being “complete failure.”

Read full article

Comments

1 Comment

  1. conroy.audie

    This is an interesting development! The integration of ChatGPT with a web browser could really enhance the browsing experience. I’m curious to see how it performs in real-world scenarios.

Leave a Reply to conroy.audie Cancel reply

Your email address will not be published. Required fields are marked *