News  Sources

Israeli startup tops AI code review benchmark, beating OpenAI and Google

An Israeli startup named Baz has ranked first in a new benchmark evaluating artificial intelligence systems designed for code review, outperforming tools from leading research groups such as OpenAI, Anthropic, Google and Cursor. Baz also placed second in the overall composite score, which measures both precision and recall.

The benchmark, called Code Review Bench, is the first evaluation focused specifically on systems that review code rather than generate it. Unlike earlier coding benchmarks that faced criticism because models were trained to optimize directly for them, this new assessment combines controlled testing with real world behavioral data to better reflect practical value for software developers.

Original article source: https://www.ynetnews.com/tech-and-digital/article/s16gogjyze
Source Id: 9109268816

share this article:  

Our mission is to provide you with up-to-date, concise news from multiple sources in one place, keeping you informed about Israel.
 
Hit 'Subscribe' to get the latest curated news about Israel delivered daily to your inbox