Google recently released DiffusionGemma, and it's weird in the best way.
AI-assisted coding or ‘vibe coding’ is a trend that dominated last year, so much so that it was named word of the year for 2025. Since OpenAI co-founder Andrej Kaparthy coined the term, developers ...
Apple researchers have developed an adapted version of the SlowFast-LLaVA model that beats larger models at long-form video analysis and understanding. Here’s what that means. Very basically, when an ...
Apple's researchers continue to focus on multimodal LLMs, with studies exploring their use for image generation, understanding, and multi-turn web searches with cropped images. Now, the company is ...
A new study suggests that traditional learning activities like making notes remain critical for students’ reading comprehension and retention, while also suggesting that large language models (LLMs), ...
Stanford University’s recent research, conducted in collaboration with Tsinghua University, has revealed a surprising shift in how we evaluate the performance of large language models (LLMs). Rather ...
Swapping large language models (LLMs) is supposed to be easy, isn’t it? After all, if they all speak “natural language,” switching from GPT-4o to Claude or Gemini should be as simple as changing an ...
In the study titled MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer, a team of nearly 30 Apple researchers details a novel unified approach that enables both ...