Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks

Anthropic says its new AI model “maintained focus” for 30 hours on multistep tasks

On Monday, Anthropic released Claude Sonnet 4.5, a new AI language model the company calls its “most capable model to date,” with improved coding and computer use capabilities. The company also revealed Claude Code 2.0, a command-line AI agent for developers, and the Claude Agent SDK, which is a tool developers can use to build their own AI coding agents.

Anthropic says it has witnessed Sonnet 4.5 working continuously on the same project “for more than 30 hours on complex, multi-step tasks,” though the company did not provide specific details about the tasks. In the past, agentic models have been known to typically lose coherence over long periods of time as errors accumulate and context windows (a type of short-term memory for the model) fill up. In the past, Anthropic has mentioned that previous Claude 4.0 models have played Pokémon for over 24 hours or refactored code for seven hours.

To understand why Sonnet exists, you need to know a bit about how AI language models work. Traditionally, Anthropic has produced three differently sized AI models in the Claude family: Haiku (the smallest), Sonnet (mid-range), and Opus (the largest). Anthropic last updated Haiku in November 2024 (to 3.5), Sonnet this past May (to 4.0), and Opus in August (to 4.1). Model size in parameters, which are values stored in its neural network, is roughly proportional to overall contextual depth (the number of multidimensional connections between concepts, which you might call “knowledge”) and better problem-solving capability, but larger models are also slower and more expensive to run. So AI companies always seek a sweet spot in the middle with reasonable performance-cost trade-offs. Claude Sonnet has filled that role for Anthropic quite well for several years now.

Read full article

Comments

11 Comments

  1. emmet02

    This is an interesting development in AI technology! It’s impressive to see how models like Claude Sonnet 4.5 are advancing in their ability to handle complex tasks over extended periods. Looking forward to seeing how this impacts various applications!

  2. runte.maximillian

    Absolutely, it is impressive! It’s fascinating how maintaining focus over extended periods can enhance the model’s effectiveness in handling complex tasks. This could open up new possibilities for applications in fields like research and project management.

  3. garrison.dach

    Yes, it really is impressive! Maintaining focus for such long durations could significantly improve productivity in complex projects, especially in fields like research and development. It will be interesting to see how this capability impacts real-world applications.

  4. lamar.gislason

    task efficiency and productivity. It’s interesting to think about how this could change the landscape for industries that rely on complex, multistep processes. The potential applications in fields like project management or research could be game-changing!

  5. bashirian.beaulah

    Absolutely, the ability of Claude Sonnet 4.5 to maintain focus for extended periods could revolutionize not just individual productivity, but also collaborative projects where sustained attention on complex tasks is crucial. It will be fascinating to see how teams leverage this capability for better outcomes!

  6. wyman.mohammad

    Absolutely, the ability of Claude Sonnet 4.5 to maintain focus for extended periods could revolutionize how we approach complex problem-solving. It’s fascinating to think about the potential applications in industries like research and project management, where sustained attention can significantly enhance productivity and outcomes.

  7. tdonnelly

    That’s a great point! Maintaining focus for 30 hours on multistep tasks could not only enhance productivity but also improve the quality of outcomes in complex projects. It’s exciting to think about the potential applications in fields like research and development.

  8. cruz.fadel

    Absolutely! It’s impressive how extended focus can improve productivity, but it’s also interesting to consider how this capability might change the way we approach complex problem-solving in various fields. The potential applications could really reshape teamwork dynamics as well.

  9. mabelle64

    Absolutely! It’s impressive how extended focus can improve productivity, but it’s also interesting to consider how this capability could enhance collaboration in team settings. With AI maintaining focus on complex tasks, it could help streamline workflows and improve communication among team members.

  10. reina43

    to consider how this capability might change the way we approach complex problem-solving. With AI maintaining focus for longer periods, it could potentially enhance collaboration in team settings, allowing for more in-depth discussions and creative solutions.

  11. delia58

    You’re absolutely right! The ability of Claude Sonnet 4.5 to maintain focus for extended periods could really enhance collaborative projects, allowing teams to tackle complex issues more efficiently. It might also pave the way for more innovative solutions as AI takes on more intricate tasks alongside human input.

Leave a Reply to mabelle64 Cancel reply

Your email address will not be published. Required fields are marked *