AIMarch 8, 2025Tufa Labs Introduced LADDER: A Recursive Learning Framework Enabling Large Language Models to Self-Improve without Human Intervention
AIMarch 7, 2025Researchers from AMLab and CuspAI Introduced Erwin: A Tree-based Hierarchical Transformer for Large-scale Physical Systems
AIMarch 5, 2025Researchers from FutureHouse and ScienceMachine Introduce BixBench: A Benchmark Designed to Evaluate AI Agents on Real-World Bioinformatics Task
AIFebruary 24, 2025Meta AI Introduces MLGym: A New AI Framework and Benchmark for Advancing AI Research Agents
AIFebruary 15, 2025Microsoft Research Introduces Data Formulator: An AI Application that Leverages LLMs to Transform Data and Create Rich Visualizations