Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessI Tried Building My Own AI… Here’s What Actually HappenedDEV CommunityFilesystem for AI Agents: What I Learned Building OneDEV CommunityGoogle debuts Gemma 4 open AI models for local use - TestingCatalogGNews AI multimodalAI’s Uncertain Cost Effects in Health Care - American Enterprise Institute - AEIGNews AI healthcareMulti-Model AI Orchestration for Software Development: How I Ship 10x Faster with Claude, Codex, and GeminiDEV CommunityMigrating a Webpack-Era Federated Module to Vite Without Breaking the Host ContractDEV CommunityGet ready for a wave of TBPN clones after its blockbuster OpenAI dealBusiness InsiderFrom MOUs to Markets: Transatlantic Deals Face Reality Testeetimes.comGitHub Copilot Code Review: Complete Guide (2026)DEV CommunityMulti-Stage Continuous DeliveryDEV CommunityBlackSwanX,174 AI agents predict the future by fighting each other,run on OllamaHacker News AI TopPSA: Anyone with a link can view your Granola notes by defaultThe Verge AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessI Tried Building My Own AI… Here’s What Actually HappenedDEV CommunityFilesystem for AI Agents: What I Learned Building OneDEV CommunityGoogle debuts Gemma 4 open AI models for local use - TestingCatalogGNews AI multimodalAI’s Uncertain Cost Effects in Health Care - American Enterprise Institute - AEIGNews AI healthcareMulti-Model AI Orchestration for Software Development: How I Ship 10x Faster with Claude, Codex, and GeminiDEV CommunityMigrating a Webpack-Era Federated Module to Vite Without Breaking the Host ContractDEV CommunityGet ready for a wave of TBPN clones after its blockbuster OpenAI dealBusiness InsiderFrom MOUs to Markets: Transatlantic Deals Face Reality Testeetimes.comGitHub Copilot Code Review: Complete Guide (2026)DEV CommunityMulti-Stage Continuous DeliveryDEV CommunityBlackSwanX,174 AI agents predict the future by fighting each other,run on OllamaHacker News AI TopPSA: Anyone with a link can view your Granola notes by defaultThe Verge AI
AI NEWS HUBbyEIGENVECTOREigenvector

From 3D Pose to Prose: Biomechanics-Grounded Vision--Language Coaching

arXivMarch 31, 20261 min read0 views
Source Quiz

arXiv:2603.26938v1 Announce Type: new Abstract: We present BioCoach, a biomechanics-grounded vision--language framework for fitness coaching from streaming video. BioCoach fuses visual appearance and 3D skeletal kinematics, through a novel three-stage pipeline: an exercise-specific degree-of-freedom selector that focuses analysis on salient joints; a structured biomechanical context that pairs individualized morphometrics with cycle and constraint analysis; and a vision--biomechanics conditioned feedback module that applies cross-attention to generate precise, actionable text. Using parameter- — Yuyang Ji, Yixuan Shen, Shengjie Zhu, Yu Kong, Feng Liu

View PDF HTML (experimental)

Abstract:We present BioCoach, a biomechanics-grounded vision--language framework for fitness coaching from streaming video. BioCoach fuses visual appearance and 3D skeletal kinematics, through a novel three-stage pipeline: an exercise-specific degree-of-freedom selector that focuses analysis on salient joints; a structured biomechanical context that pairs individualized morphometrics with cycle and constraint analysis; and a vision--biomechanics conditioned feedback module that applies cross-attention to generate precise, actionable text. Using parameter-efficient training that freezes the vision and language backbones, BioCoach yields transparent, personalized reasoning rather than pattern matching. To enable learning and fair evaluation, we augment QEVD-fit-coach with biomechanics-oriented feedback to create QEVD-bio-fit-coach, and we introduce a biomechanics-aware LLM judge metric. BioCoach delivers clear gains on QEVD-bio-fit-coach across lexical and judgment metrics while maintaining temporal triggering; on the original QEVD-fit-coach, it improves text quality and correctness with near-parity timing, demonstrating that explicit kinematics and constraints are key to accurate, phase-aware coaching.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26938 [cs.CV]

(or arXiv:2603.26938v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26938

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yuyang Ji [view email] [v1] Fri, 27 Mar 2026 19:26:28 UTC (1,051 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
From 3D Pos…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 180 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!