InferenceMax is released under the Apache 2.0 license and measures the performance of hundreds of AI accelerator hardware and ...
On Friday, research organization Epoch AI released FrontierMath, a new mathematics benchmark that has been turning heads in the AI world because it contains hundreds of expert-level problems that ...
Industry Leader Known for Software Development Skills Expertise Introduces Real-World Benchmark of AI Software Development Capabilities CUPERTINO, Calif., Feb. 11, 2025 (GLOBE NEWSWIRE) -- HackerRank, ...
Generative artificial intelligence startup Sierra Technologies Inc. is taking it upon itself to “advance the frontiers of conversational AI agents” with a new benchmark test that evaluates the ...
MONTREAL, QC / ACCESS Newswire / October 13, 2025 / Vision Marine Technologies Inc. (NASDAQ:VMAR) (“Vision Marine” or the “Company”), a global leader in electric marine propulsion, proudly announces ...
Meta released an agentic testing environment, Agents Research Environment, and a new benchmark called Gaia2 to measure agent's real-world adaptability.
Nvidia’s rack-scale Blackwell systems topped a new benchmark of AI inference performance, with the tech giant's networking ...
They could offer a more nuanced way to measure AI’s bias and its understanding of the world. New AI benchmarks could help developers reduce bias in AI models, potentially making them fairer and less ...