Live

•🔥 OpenBMB/ChatDevGitHub Trending •🔥 microsoft/agent-lightningGitHub Trending •🔥 apache/supersetGitHub Trending •🔥 shanraisshan/claude-code-best-practiceGitHub Trending •A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation LearningarXiv •GUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play AnnotationarXiv •Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language ModelsarXiv •CANGuard: A Spatio-Temporal CNN-GRU-Attention Hybrid Architecture for Intrusion Detection in In-Vehicle CAN NetworksarXiv •DesignWeaver: Dimensional Scaffolding for Text-to-Image Product DesignarXiv •A Lightweight, Transferable, and Self-Adaptive Framework for Intelligent DC Arc-Fault Detection in Photovoltaic SystemsarXiv •Consistency Amplifies: How Behavioral Variance Shapes Agent AccuracyarXiv •Stabilizing Rubric Integration Training via Decoupled Advantage NormalizationarXiv •Semi-Automated Knowledge Engineering and Process Mapping for Total Airport ManagementarXiv •AIRA_2: Overcoming Bottlenecks in AI Research AgentsarXiv •BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional EnvironmentsarXiv •🔥 OpenBMB/ChatDevGitHub Trending •🔥 microsoft/agent-lightningGitHub Trending •🔥 apache/supersetGitHub Trending •🔥 shanraisshan/claude-code-best-practiceGitHub Trending •A-SelecT: Automatic Timestep Selection for Diffusion Transformer Representation LearningarXiv •GUIDE: Resolving Domain Bias in GUI Agents through Real-Time Web Video Retrieval and Plug-and-Play AnnotationarXiv •Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language ModelsarXiv •CANGuard: A Spatio-Temporal CNN-GRU-Attention Hybrid Architecture for Intrusion Detection in In-Vehicle CAN NetworksarXiv •DesignWeaver: Dimensional Scaffolding for Text-to-Image Product DesignarXiv •A Lightweight, Transferable, and Self-Adaptive Framework for Intelligent DC Arc-Fault Detection in Photovoltaic SystemsarXiv •Consistency Amplifies: How Behavioral Variance Shapes Agent AccuracyarXiv •Stabilizing Rubric Integration Training via Decoupled Advantage NormalizationarXiv •Semi-Automated Knowledge Engineering and Process Mapping for Total Airport ManagementarXiv •AIRA_2: Overcoming Bottlenecks in AI Research AgentsarXiv •BeSafe-Bench: Unveiling Behavioral Safety Risks of Situated Agents in Functional EnvironmentsarXiv

AI NEWS

by techtonicshifts.blog

Models

Models

New AI model releases and updates from leading labs

13 articles

Tags in this category

Claude 3.7 Sonnet Sets New Benchmark in Reasoning and Code Generation

Claude 3.7 Sonnet Sets New Benchmark in Reasoning and Code Generation

Anthropic releases Claude 3.7 Sonnet with extended thinking capabilities, achieving state-of-the-art results on SWE-bench and GPQA Diamond. The model introduces hybrid reasoning that can switch between fast and deliberate thought modes.

5m15.4k4 days ago

GPT-5 Architecture Leak Reveals Mixture-of-Experts with 1.8 Trillion Parameters

GPT-5 Architecture Leak Reveals Mixture-of-Experts with 1.8 Trillion Parameters

Leaked documents suggest GPT-5 employs a sparse Mixture-of-Experts architecture with 1.8 trillion total parameters, activating only 200B per forward pass. OpenAI has neither confirmed nor denied the reports.

6m28.9k5 days ago

Gemini Ultra 2.0 Achieves Human-Level Performance on Medical Licensing Exams

Gemini Ultra 2.0 Achieves Human-Level Performance on Medical Licensing Exams

Google DeepMind's Gemini Ultra 2.0 scores 90%+ on USMLE Step 1, 2, and 3, demonstrating expert-level medical knowledge. The model also shows strong performance in radiology image interpretation.

Google DeepMind

7m12.3k6 days ago

A New Framework for Evaluating Voice Agents (EVA)

A New Framework for Evaluating Voice Agents (EVA)

Hugging Face Blog

Build a Domain-Specific Embedding Model in Under a Day

Build a Domain-Specific Embedding Model in Under a Day

Hugging Face Blog

What's New in Mellea 0.4.0 + Granite Libraries Release

What's New in Mellea 0.4.0 + Granite Libraries Release

Hugging Face Blog

State of Open Source on Hugging Face: Spring 2026

State of Open Source on Hugging Face: Spring 2026

Hugging Face Blog

Holotron-12B - High Throughput Computer Use Agent

Holotron-12B - High Throughput Computer Use Agent

Hugging Face Blog

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Hugging Face Blog

Introducing Storage Buckets on the Hugging Face Hub

Introducing Storage Buckets on the Hugging Face Hub

Hugging Face Blog

LeRobot v0.5.0: Scaling Every Dimension

LeRobot v0.5.0: Scaling Every Dimension

Hugging Face Blog

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Ulysses Sequence Parallelism: Training with Million-Token Contexts

Hugging Face Blog

Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations

Bringing Robotics AI to Embedded Platforms: Dataset Recording, VLA Fine‑Tuning, and On‑Device Optimizations

Hugging Face Blog