AIMarch 5, 2025Researchers from FutureHouse and ScienceMachine Introduce BixBench: A Benchmark Designed to Evaluate AI Agents on Real-World Bioinformatics Task
AIMarch 5, 2025Project Alexandria: Democratizing Scientific Knowledge Through Structured Fact Extraction with LLMs
AIMarch 4, 2025Step by Step Guide to Build an AI Research Assistant with Hugging Face SmolAgents: Automating Web Search and Article Summarization Using LLM-Powered Autonomous Agents
AIFebruary 25, 2025Researchers from Moonshot AI Introduce Muon and Moonlight: Optimizing Large-Scale Language Models with Efficient Training Techniques
AIFebruary 24, 2025Meta AI Introduces MLGym: A New AI Framework and Benchmark for Advancing AI Research Agents