News from the AI & ML world
Asif Razzaq@MarkTechPost
//
OpenAI has unveiled PaperBench, a new benchmark designed to rigorously assess the ability of AI agents to autonomously replicate cutting-edge machine learning research. The benchmark consists of 20 papers from ICML 2024, spanning areas like reinforcement learning and probabilistic methods. PaperBench measures if AI systems can accurately interpret research papers, independently develop codebases, and execute experiments to replicate empirical outcomes. To ensure genuine independent replication, agents are prohibited from referencing original authors' code.
The effort involves systematic evaluation tools and detailed rubrics, co-developed with original paper authors, specifying 8,316 individually gradable tasks to facilitate precise evaluation of AI capabilities. OpenAI is also escalating competition with Anthropic by offering free ChatGPT Plus subscriptions to college students in the US and Canada through the end of May. This move gives millions of students access to OpenAI’s premium service just as they prepare for final exams, providing capabilities like GPT-4o, image generation, voice interaction, and advanced research tools.
References :
- venturebeat.com: OpenAI just made ChatGPT Plus free for millions of college students — and it’s a brilliant competitive move against Anthropic
- MarkTechPost: Open AI Releases PaperBench: A Challenging Benchmark for Assessing AI Agents’ Abilities to Replicate Cutting-Edge Machine Learning Research
- www.techradar.com: OpenAI is giving away ChatGPT Plus subscriptions to students to help you study for finals – here’s how to apply
- THE DECODER: Anthropic brings AI assistant Claude to university campuses
- www.techradar.com: ChatGPT-5 is on hold as OpenAI changes plans and releases new o3 and o4-mini models
- BleepingComputer: BleepingComputer about OpenAI's ChatGPT plus free for students
- www.zdnet.com: ChatGPT Plus is free for students now - how to grab this deal before finals
- The Tech Basic: OpenAI and Anthropic are fighting to be students’ favorite AI tools. This week, both released free helpers for college kids. Why? They know students are busy with classes, jobs, and exams. If students use their AI now, they might keep using it after graduation. Why Students? College life is tough. A student
- THE DECODER: OpenAI plans GPT-5 release in "a few months," shifts strategy on reasoning models
Classification: