AIApril 25, 2025Microsoft Research Introduces MMInference to Accelerate Pre-filling for Long-Context Vision-Language Models
AIMarch 11, 2025STORM (Spatiotemporal TOken Reduction for Multimodal LLMs): A Novel AI Architecture Incorporating a Dedicated Temporal Encoder between the Image Encoder and the LLM