AIApril 15, 2025THUDM Releases GLM 4: A 32B Parameter Model Competing Head-to-Head with GPT-4o and DeepSeek-V3
AIApril 14, 2025A Coding Implementation for Advanced Multi-Head Latent Attention and Fine-Grained Expert Segmentation
AIApril 14, 2025Reasoning Models Know When They’re Right: NYU Researchers Introduce a Hidden-State Probe That Enables Efficient Self-Verification and Reduces Token Usage by 24%
AIApril 13, 2025A Coding Implementation on Introduction to Weight Quantization: Key Aspect in Enhancing Efficiency in Deep Learning and LLMs
AIApril 13, 2025Moonsight AI Released Kimi-VL: A Compact and Powerful Vision-Language Model Series Redefining Multimodal Reasoning, Long-Context Understanding, and High-Resolution Visual Processing
AIApril 12, 2025Step by Step Coding Guide to Build a Neural Collaborative Filtering (NCF) Recommendation System with PyTorch