Anthropic unveiled its latest artificial intelligence model last 29 September. Called Claude Sonnet 4.5, it is labelled as "the best coding model in the world."
That AI boast is based on industry benchmarks, including software engineer bench tests that measure an AI system's software coding abilities, the company said in a news release. That includes following instructions more reliably, the company said.
It can operate 30 hours by itself.
"People are just noticing with this model, because it's just smarter and more of a colleague, that it's kind of fun to work with it when encountering problems and fixing them," Jared Kaplan, Anthropic's co-founder and chief science officer, told CNBC in an interview.
Anthropic's coding competitors are OpenAI's GitHub Copilot and Google's Gemini.
"It's the strongest model for building complex agents," the company said. "It's the best model at using computers. And it shows substantial gains in reasoning and math.
"Code is everywhere. It runs every application, spreadsheet, and software tool you use. Being able to use those tools and reason through hard problems is how modern work gets done."
Claude Sonnet 4.5, which is available to all users, is better than other companies' AI products on coding with computers and meeting practical business needs, including cybersecurity, finance and research, the company said.
OSWorld, a benchmark that tests AI models on real-world computer tasks, showed Sonnet 4.5 leads at 61.4 percent. Four months ago, Sonnet 4 held the lead at 42.2 percent.
"Experts in finance, law, medicine, and STEM found Sonnet 4.5 shows dramatically better domain-specific knowledge and reasoning compared to older models, including Opus 4.1," the company said.
No comments:
Post a Comment