Information
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Last Updated: 2025-04-15
Detailed Ratings