Tags in this category
ModelsHot
Claude 3.7 Sonnet Sets New Benchmark in Reasoning and Code Generation
Anthropic releases Claude 3.7 Sonnet with extended thinking capabilities, achieving state-of-the-art results on SWE-bench and GPQA Diamond. The model introduces hybrid reasoning that can switch between fast and deliberate thought modes.
Anthropic
5m15.4k4 days ago
ModelsHot
GPT-5 Architecture Leak Reveals Mixture-of-Experts with 1.8 Trillion Parameters
Leaked documents suggest GPT-5 employs a sparse Mixture-of-Experts architecture with 1.8 trillion total parameters, activating only 200B per forward pass. OpenAI has neither confirmed nor denied the reports.
OpenAI
6m28.9k5 days ago
ModelsHot
Gemini Ultra 2.0 Achieves Human-Level Performance on Medical Licensing Exams
Google DeepMind's Gemini Ultra 2.0 scores 90%+ on USMLE Step 1, 2, and 3, demonstrating expert-level medical knowledge. The model also shows strong performance in radiology image interpretation.
Google DeepMind
7m12.3k6 days ago
Models
A New Framework for Evaluating Voice Agents (EVA)
Hugging Face Blog
11m17 days ago
Models
Build a Domain-Specific Embedding Model in Under a Day
Hugging Face Blog
14m10 days ago
Models
What's New in Mellea 0.4.0 + Granite Libraries Release
Hugging Face Blog
3m10 days ago

Models
State of Open Source on Hugging Face: Spring 2026
Hugging Face Blog
14m13 days ago

Models
Holotron-12B - High Throughput Computer Use Agent
Hugging Face Blog
4m13 days ago
Models
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
Hugging Face Blog
46m21 days ago

Models
Introducing Storage Buckets on the Hugging Face Hub
Hugging Face Blog
7m21 days ago
Models
LeRobot v0.5.0: Scaling Every Dimension
Hugging Face Blog
10m22 days ago
Models
Ulysses Sequence Parallelism: Training with Million-Token Contexts
Hugging Face Blog
14m22 days ago
Models
Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations
Hugging Face Blog
10m25 days ago
