AIApril 25, 2025Microsoft Research Introduces MMInference to Accelerate Pre-filling for Long-Context Vision-Language Models
AIApril 25, 2025Meta AI Releases Web-SSL: A Scalable and Language-Free Approach to Visual Representation Learning
AIApril 23, 2025Muon Optimizer Significantly Accelerates Grokking in Transformers: Microsoft Researchers Explore Optimizer Influence on Delayed Generalization
AIApril 22, 2025LLMs Can Now Retain High Accuracy at 2-Bit Precision: Researchers from UNC Chapel Hill Introduce TACQ, a Task-Aware Quantization Approach that Preserves Critical Weight Circuits for Compression Without Performance Loss
AIApril 20, 2025NVIDIA Introduces CLIMB: A Framework for Iterative Data Mixture Optimization in Language Model Pretraining
AIApril 14, 2025A Coding Implementation for Advanced Multi-Head Latent Attention and Fine-Grained Expert Segmentation