Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessMy Claude Code Buddy Moved Into My MacBook's Notch and I Can't Stop Looking at ItDEV CommunityI Turned My MacBook's Notch Into a Control Center for AI Coding AgentsDEV Communitytrunk/98fc38c4eb17c435699cea1a7d3aa84c14458ed9: Add autograd_cache_key to aot_autograd with tests (#178152)PyTorch ReleasesBuildWithAI: What Broke, What I Learned, What's NextDEV CommunityBuildWithAI: Prompt Engineering 6 DR Tools with Amazon BedrockDEV CommunityBuildWithAI: Architecting a Serverless DR Toolkit on AWSDEV CommunityThe Locksmith's ApprenticeDEV CommunitySame Agents, Different Minds — What 180 Configurations Proved About AI Environment DesignDEV CommunityI Built a Self-Hosted AI Agent That Runs on a Raspberry PiDEV CommunityBeyond the Boardroom: How Decentralized Autonomous Organizations (DAOs) are Reshaping E-commerceDEV Community20 Articles Later: What I've Learned About AI Agent WritingDEV CommunityFrance’s Mistral AI seeks Samsung memory for AI expansion - The Korea HeraldGoogle News - Mistral AI FranceBlack Hat USADark ReadingBlack Hat AsiaAI BusinessMy Claude Code Buddy Moved Into My MacBook's Notch and I Can't Stop Looking at ItDEV CommunityI Turned My MacBook's Notch Into a Control Center for AI Coding AgentsDEV Communitytrunk/98fc38c4eb17c435699cea1a7d3aa84c14458ed9: Add autograd_cache_key to aot_autograd with tests (#178152)PyTorch ReleasesBuildWithAI: What Broke, What I Learned, What's NextDEV CommunityBuildWithAI: Prompt Engineering 6 DR Tools with Amazon BedrockDEV CommunityBuildWithAI: Architecting a Serverless DR Toolkit on AWSDEV CommunityThe Locksmith's ApprenticeDEV CommunitySame Agents, Different Minds — What 180 Configurations Proved About AI Environment DesignDEV CommunityI Built a Self-Hosted AI Agent That Runs on a Raspberry PiDEV CommunityBeyond the Boardroom: How Decentralized Autonomous Organizations (DAOs) are Reshaping E-commerceDEV Community20 Articles Later: What I've Learned About AI Agent WritingDEV CommunityFrance’s Mistral AI seeks Samsung memory for AI expansion - The Korea HeraldGoogle News - Mistral AI France
AI NEWS HUBbyEIGENVECTOREigenvector

Aligning LLMs with Biomedical Knowledge using Balanced Fine-Tuning

arXivby [Submitted on 26 Nov 2025 (v1), last revised 27 Mar 2026 (this version, v2)]March 30, 20262 min read1 views
Source Quiz

arXiv:2511.21075v2 Announce Type: replace-cross Abstract: Aligning Large Language Models (LLMs) with biomedical knowledge requires understanding both concepts and causal mechanisms in scientific reports. Supervised Fine-Tuning (SFT) often fails to capture these logical structures, while Reinforcement Learning (RL) is limited by sparse reward signals. We propose Balanced Fine-Tuning (BFT), a dual-scale post-training method that stabilizes training via confidence-weighted token-level optimization and adaptively emphasizes knowledge-dense hard samples using minimum group confidence. Experiments o — Zhenchao Tang, Fang Wang, Haohuai He, Jiale Zhou, Tianxu Lv, Jun Zhu, Shouzhi Chen, Minghao Yang, Yu Wang, Jiayang Wu, Yidong Song, Yaokun Li, Jiehui Huang, Dawei Huang, Zhi Song, Jianhua Yao

Authors:Zhenchao Tang, Fang Wang, Haohuai He, Jiale Zhou, Tianxu Lv, Jun Zhu, Shouzhi Chen, Minghao Yang, Yu Wang, Jiayang Wu, Yidong Song, Yaokun Li, Jiehui Huang, Dawei Huang, Zhi Song, Jianhua Yao

View PDF HTML (experimental)

Abstract:Aligning Large Language Models (LLMs) with biomedical knowledge requires understanding both concepts and causal mechanisms in scientific reports. Supervised Fine-Tuning (SFT) often fails to capture these logical structures, while Reinforcement Learning (RL) is limited by sparse reward signals. We propose Balanced Fine-Tuning (BFT), a dual-scale post-training method that stabilizes training via confidence-weighted token-level optimization and adaptively emphasizes knowledge-dense hard samples using minimum group confidence. Experiments on medical and biological reasoning benchmarks show that BFT consistently outperforms SFT and achieves competitive or superior performance to specialized systems such as GeneAgent. Beyond improving generative accuracy, BFT enhances the fidelity of LLM-generated biomedical entity descriptions, such that their embeddings produced by standard encoders outperform those from domain-specific biological foundation models. This enables a single post-trained LLM to support both reasoning generation and representation-based biological analysis. Overall, BFT provides a concise and effective framework for aligning LLMs with biomedical knowledge while bridging generative and representational capabilities.

Subjects:

Machine Learning (cs.LG); Artificial Intelligence (cs.AI)

Cite as: arXiv:2511.21075 [cs.LG]

(or arXiv:2511.21075v2 [cs.LG] for this version)

https://doi.org/10.48550/arXiv.2511.21075

arXiv-issued DOI via DataCite

Submission history

From: Zhenchao Tang [view email] [v1] Wed, 26 Nov 2025 05:34:26 UTC (5,636 KB) [v2] Fri, 27 Mar 2026 03:36:42 UTC (4,721 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Aligning LL…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 241 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers