AIMarch 13, 2025Alibaba Researchers Introduce R1-Omni: An Application of Reinforcement Learning with Verifiable Reward (RLVR) to an Omni-Multimodal Large Language Model
AIMarch 8, 2025CMU Researchers Introduce PAPRIKA: A Fine-Tuning Approach that Enables Language Models to Develop General Decision-Making Capabilities Not Confined to Particular Environment
AIMarch 5, 2025Researchers from FutureHouse and ScienceMachine Introduce BixBench: A Benchmark Designed to Evaluate AI Agents on Real-World Bioinformatics Task
AIMarch 5, 2025Project Alexandria: Democratizing Scientific Knowledge Through Structured Fact Extraction with LLMs
AIFebruary 10, 2025Tutorial to Fine-Tuning Mistral 7B with QLoRA Using Axolotl for Efficient LLM Training