Live
Black Hat USAAI BusinessBlack Hat AsiaAI Business1 Artificial Intelligence (AI) Software Stock to Buy Hand Over Fist Before It Soars 62%, According to Dan Ives - The Motley FoolGoogle News: AIGroup Pushing Age Verification Requirements for AI Turns Out to Be Sneakily Backed by OpenAIGizmodoGroup Pushing Age Verification Requirements for AI Turns Out to Be Sneakily Backed by OpenAI - GizmodoGoogle News: OpenAIInside the race to recreate Claude Code and mine its guts for revelationsBusiness InsiderAnthropic Executive Sees Cowork Agent as Bigger Than Claude Code - Bloomberg.comGoogle News: ClaudeAnthropic Executive Sees Cowork Agent as Bigger Than Claude CodeBloomberg TechnologyABAP OOP Design Patterns — Part 2: Factory, Observer, and Decorator Patterns in Real SAP SystemsDEV CommunityWhy Your AI Agent Health Check Is Lying to YouDEV CommunityDeep Dive: Array Internals & Memory LayoutDEV CommunityIllinois Tech computer science researcher honored by IEEE Chicago Section - EurekAlert!Google News: Machine LearningICE Tells Lawmakers It’s Using Spyware in Fight Against FentanylBloomberg TechnologyAmazon Facilities in Bahrain Hit Again as Iran Follows Through on Threat, Report SaysGizmodoBlack Hat USAAI BusinessBlack Hat AsiaAI Business1 Artificial Intelligence (AI) Software Stock to Buy Hand Over Fist Before It Soars 62%, According to Dan Ives - The Motley FoolGoogle News: AIGroup Pushing Age Verification Requirements for AI Turns Out to Be Sneakily Backed by OpenAIGizmodoGroup Pushing Age Verification Requirements for AI Turns Out to Be Sneakily Backed by OpenAI - GizmodoGoogle News: OpenAIInside the race to recreate Claude Code and mine its guts for revelationsBusiness InsiderAnthropic Executive Sees Cowork Agent as Bigger Than Claude Code - Bloomberg.comGoogle News: ClaudeAnthropic Executive Sees Cowork Agent as Bigger Than Claude CodeBloomberg TechnologyABAP OOP Design Patterns — Part 2: Factory, Observer, and Decorator Patterns in Real SAP SystemsDEV CommunityWhy Your AI Agent Health Check Is Lying to YouDEV CommunityDeep Dive: Array Internals & Memory LayoutDEV CommunityIllinois Tech computer science researcher honored by IEEE Chicago Section - EurekAlert!Google News: Machine LearningICE Tells Lawmakers It’s Using Spyware in Fight Against FentanylBloomberg TechnologyAmazon Facilities in Bahrain Hit Again as Iran Follows Through on Threat, Report SaysGizmodo

Task Tokens: A Flexible Approach to Adapting Behavior Foundation Models

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2503.22886v2 Announce Type: replace Abstract: Recent advancements in imitation learning have led to transformer-based behavior foundation models (BFMs) that enable multi-modal, human-like control for humanoid agents. While excelling at zero-shot generation of robust behaviors, BFMs often require meticulous prompt engineering for specific tasks, potentially yielding suboptimal results. We introduce "Task Tokens", a method to effectively tailor BFMs to specific tasks while preserving their flexibility. Our approach leverages the transformer architecture of BFMs to learn a new task-specific — Ron Vainshtein, Zohar Rimon, Shie Mannor, Chen Tessler

View PDF HTML (experimental)

Abstract:Recent advancements in imitation learning have led to transformer-based behavior foundation models (BFMs) that enable multi-modal, human-like control for humanoid agents. While excelling at zero-shot generation of robust behaviors, BFMs often require meticulous prompt engineering for specific tasks, potentially yielding suboptimal results. We introduce "Task Tokens", a method to effectively tailor BFMs to specific tasks while preserving their flexibility. Our approach leverages the transformer architecture of BFMs to learn a new task-specific encoder through reinforcement learning, keeping the original BFM frozen. This allows incorporation of user-defined priors, balancing reward design and prompt engineering. By training a task encoder to map observations to tokens, used as additional BFM inputs, we guide performance improvement while maintaining the model's diverse control characteristics. We demonstrate Task Tokens' efficacy across various tasks, including out-of-distribution scenarios, and show their compatibility with other prompting modalities. Our results suggest that Task Tokens offer a promising approach for adapting BFMs to specific control tasks while retaining their generalization capabilities.

Subjects:

Machine Learning (cs.LG); Robotics (cs.RO)

Cite as: arXiv:2503.22886 [cs.LG]

(or arXiv:2503.22886v2 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2503.22886

arXiv-issued DOI via DataCite

Submission history

From: Zohar Rimon [view email] [v1] Fri, 28 Mar 2025 21:28:13 UTC (22,373 KB) [v2] Thu, 26 Mar 2026 20:41:07 UTC (4,987 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Task Tokens…researchpaperarxivmachine-lea…deep-learni…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 177 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers