AIJune 22, 2025This AI Paper Introduces WINGS: A Dual-Learner Architecture to Prevent Text-Only Forgetting in Multimodal Large Language Models
AIJune 15, 2025Internal Coherence Maximization (ICM): A Label-Free, Unsupervised Training Framework for LLMs
AIJune 12, 2025CURE: A Reinforcement Learning Framework for Co-Evolving Code and Unit Test Generation in LLMs
AIJune 9, 2025High-Entropy Token Selection in Reinforcement Learning with Verifiable Rewards (RLVR) Improves Accuracy and Reduces Training Cost for LLMs