Live
🔥 OpenBMB/ChatDevGitHub Trending🔥 microsoft/agent-lightningGitHub Trending🔥 apache/supersetGitHub Trending🔥 shanraisshan/claude-code-best-practiceGitHub TrendingA-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation LearningarXivGUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play AnnotationarXivSommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language ModelsarXivCANGuard: A Spatio-Temporal CNN-GRU-Attention Hybrid Architecture for Intrusion Detection in In-Vehicle CAN NetworksarXivDesignWeaver: Dimensional Scaffolding for Text-to-Image Product DesignarXivA Lightweight, Transferable, and Self-Adaptive Framework for Intelligent DC Arc-Fault Detection in Photovoltaic SystemsarXivConsistency Amplifies: How Behavioral Variance Shapes Agent AccuracyarXivStabilizing Rubric Integration Training via Decoupled Advantage NormalizationarXivSemi-Automated Knowledge Engineering and Process Mapping for Total Airport ManagementarXivAIRA_2: Overcoming Bottlenecks in AI Research AgentsarXivBeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional EnvironmentsarXiv🔥 OpenBMB/ChatDevGitHub Trending🔥 microsoft/agent-lightningGitHub Trending🔥 apache/supersetGitHub Trending🔥 shanraisshan/claude-code-best-practiceGitHub TrendingA-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation LearningarXivGUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play AnnotationarXivSommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language ModelsarXivCANGuard: A Spatio-Temporal CNN-GRU-Attention Hybrid Architecture for Intrusion Detection in In-Vehicle CAN NetworksarXivDesignWeaver: Dimensional Scaffolding for Text-to-Image Product DesignarXivA Lightweight, Transferable, and Self-Adaptive Framework for Intelligent DC Arc-Fault Detection in Photovoltaic SystemsarXivConsistency Amplifies: How Behavioral Variance Shapes Agent AccuracyarXivStabilizing Rubric Integration Training via Decoupled Advantage NormalizationarXivSemi-Automated Knowledge Engineering and Process Mapping for Total Airport ManagementarXivAIRA_2: Overcoming Bottlenecks in AI Research AgentsarXivBeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional EnvironmentsarXiv
Models

Models

New AI model releases and updates from leading labs

13 articles

Tags in this category
Claude 3.7 Sonnet Sets New Benchmark in Reasoning and Code Generation
ModelsHot

Claude 3.7 Sonnet Sets New Benchmark in Reasoning and Code Generation

Anthropic releases Claude 3.7 Sonnet with extended thinking capabilities, achieving state-of-the-art results on SWE-bench and GPQA Diamond. The model introduces hybrid reasoning that can switch between fast and deliberate thought modes.

Anthropic
5m15.4k4 days ago
GPT-5 Architecture Leak Reveals Mixture-of-Experts with 1.8 Trillion Parameters
ModelsHot

GPT-5 Architecture Leak Reveals Mixture-of-Experts with 1.8 Trillion Parameters

Leaked documents suggest GPT-5 employs a sparse Mixture-of-Experts architecture with 1.8 trillion total parameters, activating only 200B per forward pass. OpenAI has neither confirmed nor denied the reports.

OpenAI
6m28.9k5 days ago
Gemini Ultra 2.0 Achieves Human-Level Performance on Medical Licensing Exams
ModelsHot

Gemini Ultra 2.0 Achieves Human-Level Performance on Medical Licensing Exams

Google DeepMind's Gemini Ultra 2.0 scores 90%+ on USMLE Step 1, 2, and 3, demonstrating expert-level medical knowledge. The model also shows strong performance in radiology image interpretation.

Google DeepMind
7m12.3k6 days ago
A New Framework for Evaluating Voice Agents (EVA)
Models

A New Framework for Evaluating Voice Agents (EVA)

Hugging Face Blog
11m17 days ago
Build a Domain-Specific Embedding Model in Under a Day
Models

Build a Domain-Specific Embedding Model in Under a Day

Hugging Face Blog
14m10 days ago
What's New in Mellea 0.4.0 + Granite Libraries Release
Models

What's New in Mellea 0.4.0 + Granite Libraries Release

Hugging Face Blog
3m10 days ago
State of Open Source on Hugging Face: Spring 2026
Models

State of Open Source on Hugging Face: Spring 2026

Hugging Face Blog
14m13 days ago
Holotron-12B - High Throughput Computer Use Agent
Models

Holotron-12B - High Throughput Computer Use Agent

Hugging Face Blog
4m13 days ago
Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries
Models

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Hugging Face Blog
46m21 days ago
Introducing Storage Buckets on the Hugging Face Hub
Models

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face Blog
7m21 days ago
LeRobot v0.5.0: Scaling Every Dimension
Models

LeRobot v0.5.0: Scaling Every Dimension

Hugging Face Blog
10m22 days ago
Ulysses Sequence Parallelism: Training with Million-Token Contexts
Models

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Hugging Face Blog
14m22 days ago
Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations
Models

Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations

Hugging Face Blog
10m25 days ago