Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessNvidia Needs to Remind Itself What PC Gamers Actually WantGizmodoAI’s affect on communities, students, staff - USI | student newspaperGoogle News: Generative AI2 Artificial Intelligence (AI) Stocks I'd Buy With $1,000 Before They Rebound From the Tech Sell-Off - The Motley FoolGoogle News: AIGoogle Updates Gemini API Pricing Tiers for Optimization - Intellectia AIGoogle News: GeminiIran Says It Hit Oracle Facilities in UAEGizmodoInside the ethics of artificial intelligence - New Day NW - KING5.comGoogle News: AIAI Needs Memory— And The DRAM ETF Is All In On Micron, Samsung And Sandisk - Roundhill Memory ETF (BATS:D - BenzingaGNews AI SamsungMicrosoft Generative AI Report: The 40 Jobs Most Disrupted Jobs & The 40 Most Secure Jobs - HackerNoonGoogle News: Generative AIGeopolitics, AI, and Cybersecurity: Insights From RSAC 2026Dark ReadingQualcomm joins MassRobotics, to support startups with Dragonwing Robotics HubRobotics Business ReviewDisney, OpenAI Eye Future Deal After Sora Shutdown - lamag.comGoogle News: OpenAIThe BR Privacy, Security & AI Download: April 2026 - The National Law ReviewGNews AI cybersecurityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessNvidia Needs to Remind Itself What PC Gamers Actually WantGizmodoAI’s affect on communities, students, staff - USI | student newspaperGoogle News: Generative AI2 Artificial Intelligence (AI) Stocks I'd Buy With $1,000 Before They Rebound From the Tech Sell-Off - The Motley FoolGoogle News: AIGoogle Updates Gemini API Pricing Tiers for Optimization - Intellectia AIGoogle News: GeminiIran Says It Hit Oracle Facilities in UAEGizmodoInside the ethics of artificial intelligence - New Day NW - KING5.comGoogle News: AIAI Needs Memory— And The DRAM ETF Is All In On Micron, Samsung And Sandisk - Roundhill Memory ETF (BATS:D - BenzingaGNews AI SamsungMicrosoft Generative AI Report: The 40 Jobs Most Disrupted Jobs & The 40 Most Secure Jobs - HackerNoonGoogle News: Generative AIGeopolitics, AI, and Cybersecurity: Insights From RSAC 2026Dark ReadingQualcomm joins MassRobotics, to support startups with Dragonwing Robotics HubRobotics Business ReviewDisney, OpenAI Eye Future Deal After Sora Shutdown - lamag.comGoogle News: OpenAIThe BR Privacy, Security & AI Download: April 2026 - The National Law ReviewGNews AI cybersecurity
AI NEWS HUBbyEIGENVECTOREigenvector

[P] PhAIL (phail.ai) – an open benchmark for robot AI on real hardware. Best model: 5% of human throughput, needs help every 4 minutes.

Reddit r/MachineLearningby /u/svertix https://www.reddit.com/user/svertixApril 2, 20262 min read0 views
Source Quiz

I spent the last year trying to answer a simple question: how good are VLA models on real commercial tasks? Not demos, not simulation, not success rates on 10 tries. Actual production metrics on real hardware. I couldn't find honest numbers anywhere, so I built a benchmark. Setup: DROID platform, bin-to-bin order picking – one of the most common warehouse and industrial operations. Four models fine-tuned on the same real-robot dataset, evaluated blind (the operator doesn't know which model is running). We measure Units Per Hour (UPH) and Mean Time Between Failures (MTBF) – the metrics operations people actually use. Results (full data with video and telemetry for every run at phail.ai ): Model UPH MTBF OpenPI (pi0.5) 65 4.0 min GR00T 60 3.5 min ACT 44 2.8 min SmolVLA 18 1.2 min Teleop / Fi

Could not retrieve the full article text.

Read on Reddit r/MachineLearning →
Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
[P] PhAIL (…modelbenchmarktrainingopen-sourceproductplatformReddit r/Ma…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 168 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!