Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechGoogle releases Gemma 4 under Apache 2.0 — and that license change may matter more than benchmarksVentureBeat AIOpenAI acquires TBPN - OpenAIGoogle News: OpenAIOpenAI just bought TBPNThe Verge AIOpenAI just bought TBPN - The VergeGoogle News: OpenAIExclusive | OpenAI Buys Tech-Industry Talk Show TBPN - WSJGoogle News: OpenAIPrediction: The $700 Billion Artificial Intelligence (AI) Capex Boom Will Create the Best Buying Opportunity of 2026 for These 3 Stocks - The Motley FoolGoogle News: AIp-e-w/gemma-4-E2B-it-heretic-ara: Gemma 4's defenses shredded by Heretic's new ARA method 90 minutes after the official releaseReddit r/LocalLLaMAAI startup trains Chinese humanoid robots on Japanese hospitality - Nikkei AsiaGoogle News - AI roboticsFrom Assistant to Actor: What the Rise of Agentic AI Means for Your Business - Morgan LewisGoogle News: Generative AIIndia AI Startup Sarvam Raises Funds at $1.5 Billion ValuationBloomberg TechnologyBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechGoogle releases Gemma 4 under Apache 2.0 — and that license change may matter more than benchmarksVentureBeat AIOpenAI acquires TBPN - OpenAIGoogle News: OpenAIOpenAI just bought TBPNThe Verge AIOpenAI just bought TBPN - The VergeGoogle News: OpenAIExclusive | OpenAI Buys Tech-Industry Talk Show TBPN - WSJGoogle News: OpenAIPrediction: The $700 Billion Artificial Intelligence (AI) Capex Boom Will Create the Best Buying Opportunity of 2026 for These 3 Stocks - The Motley FoolGoogle News: AIp-e-w/gemma-4-E2B-it-heretic-ara: Gemma 4's defenses shredded by Heretic's new ARA method 90 minutes after the official releaseReddit r/LocalLLaMAAI startup trains Chinese humanoid robots on Japanese hospitality - Nikkei AsiaGoogle News - AI roboticsFrom Assistant to Actor: What the Rise of Agentic AI Means for Your Business - Morgan LewisGoogle News: Generative AIIndia AI Startup Sarvam Raises Funds at $1.5 Billion ValuationBloomberg Technology
AI NEWS HUBbyEIGENVECTOREigenvector

Clinical named entity recognition in the Portuguese language: a benchmark of modern BERT models and LLMs

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.26510v1 Announce Type: new Abstract: Clinical notes contain valuable unstructured information. Named entity recognition (NER) enables the automatic extraction of medical concepts; however, benchmarks for Portuguese remain scarce. In this study, we aimed to evaluate BERT-based models and large language models (LLMs) for clinical NER in Portuguese and to test strategies for addressing multilabel imbalance. We compared BioBERTpt, BERTimbau, ModernBERT, and mmBERT with LLMs such as GPT-5 and Gemini-2.5, using the public SemClinBr corpus and a private breast cancer dataset. Models were t — Vinicius Anjos de Almeida, Sandro Saorin da Silva, Josimar Chire, Leonardo Vicenzi, N\'icolas Henrique Borges, Helena Kociolek, Sarah Miri\~a de Castro Rocha, Frederico Nassif Gomes, J\'ulia Cristina Ferreira, Oge Marques, Lucas Emanuel Silva e Oliveira

Authors:Vinicius Anjos de Almeida, Sandro Saorin da Silva, Josimar Chire, Leonardo Vicenzi, Nícolas Henrique Borges, Helena Kociolek, Sarah Miriã de Castro Rocha, Frederico Nassif Gomes, Júlia Cristina Ferreira, Oge Marques, Lucas Emanuel Silva e Oliveira

View PDF HTML (experimental)

Abstract:Clinical notes contain valuable unstructured information. Named entity recognition (NER) enables the automatic extraction of medical concepts; however, benchmarks for Portuguese remain scarce. In this study, we aimed to evaluate BERT-based models and large language models (LLMs) for clinical NER in Portuguese and to test strategies for addressing multilabel imbalance. We compared BioBERTpt, BERTimbau, ModernBERT, and mmBERT with LLMs such as GPT-5 and Gemini-2.5, using the public SemClinBr corpus and a private breast cancer dataset. Models were trained under identical conditions and evaluated using precision, recall, and F1-score. Iterative stratification, weighted loss, and oversampling were explored to mitigate class imbalance. The mmBERT-base model achieved the best performance (micro F1 = 0.76), outperforming all other models. Iterative stratification improved class balance and overall performance. Multilingual BERT models, particularly mmBERT, perform strongly for Portuguese clinical NER and can run locally with limited computational resources. Balanced data-splitting strategies further enhance performance.

Comments: Under peer review. GitHub: this https URL

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2603.26510 [cs.CL]

(or arXiv:2603.26510v1 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2603.26510

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Vinicius Anjos De Almeida [view email] [v1] Fri, 27 Mar 2026 15:22:07 UTC (163 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Clinical na…researchpaperarxivnlplanguage-mo…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 156 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!