Research Papers research paper arxiv nlp language-models

MiNER: A Two-Stage Pipeline for Metadata Extraction from Municipal Meeting Minutes

arXivMarch 30, 202610 min read0 views

arXiv:2602.00316v3 Announce Type: replace Abstract: Municipal meeting minutes are official documents of local governance, exhibiting heterogeneous formats and writing styles. Effective information retrieval (IR) requires identifying metadata such as meeting number, date, location, participants, and start/end times, elements that are rarely standardized or easy to extract automatically. Existing named entity recognition (NER) models are ill-suited to this task, as they are not adapted to such domain-specific categories. In this paper, we propose a two-stage pipeline for metadata extraction from — Rodrigo Batista, Lu\'is Filipe Cunha, Purifica\c{c}\~ao Silvano, Nuno Guimar\~aes, Al\'ipio Jorge, Evelin Amorim, Ricardo Campos

View PDF HTML (experimental)

Abstract:Municipal meeting minutes are official documents of local governance, exhibiting heterogeneous formats and writing styles. Effective information retrieval (IR) requires identifying metadata such as meeting number, date, location, participants, and start/end times, elements that are rarely standardized or easy to extract automatically. Existing named entity recognition (NER) models are ill-suited to this task, as they are not adapted to such domain-specific categories. In this paper, we propose a two-stage pipeline for metadata extraction from municipal minutes. First, a question answering (QA) model identifies the opening and closing text segments containing metadata. Transformer-based models (BERTimbau and XLM-RoBERTa with and without a CRF layer) are then applied for fine-grained entity extraction and enhanced through deslexicalization. To evaluate our proposed pipeline, we benchmark both open-weight (Phi) and closed-weight (Gemini) LLMs, assessing predictive performance, inference cost, and carbon footprint. Our results demonstrate strong in-domain performance, better than larger general-purpose LLMs. However, cross-municipality evaluation reveals reduced generalization reflecting the variability and linguistic complexity of municipal records. This work establishes the first benchmark for metadata extraction from municipal meeting minutes, providing a solid foundation for future research in this domain.

Subjects:

Computation and Language (cs.CL)

Cite as: arXiv:2602.00316 [cs.CL]

(or arXiv:2602.00316v3 [cs.CL] for this version)

https://doi.org/10.48550/arXiv.2602.00316

arXiv-issued DOI via DataCite

Journal reference: Advances in Information Retrieval. ECIR 2026. Lecture Notes in Computer Science, vol 16484. Springer, Cham

DOI(s) linking to related resources

Submission history

From: Nuno Guimaraes [view email] [v1] Fri, 30 Jan 2026 21:09:13 UTC (23 KB) [v2] Mon, 9 Feb 2026 10:04:48 UTC (23 KB) [v3] Thu, 26 Mar 2026 19:56:25 UTC (23 KB)

Original source

arXiv

https://arxiv.org/abs/2602.00316

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

ModelsRecent

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m1 day ago

ModelsFresh

When Labels Are Scarce: A Systematic Mapping of Label-Efficient Code Vulnerability Detection

arXiv:2604.00079v1 Announce Type: cross Abstract: Machine-learning-based code vulnerability detection (CVD) has progressed rapidly, from deep program representations to pretrained code models and LLM-centered pipelines. Yet dependable vulnerability labeling remains expensive, noisy, and uneven across projects, languages, and CWE types, motivating approaches that reduce reliance on human labeling. This survey maps these approaches, synthesizing five paradigm families and the mechanisms they use. It connects mechanisms to token, graph, hybrid, and knowledgebased representations, and consolidates evaluation and reporting axes that limit comparison (label-budget specification, compute/cost assumptions, leakage, and granularity mismatches). A Design Map and constraintfirst Decision Guide distil

arXiv cs.SE

1mabout 4 hours ago

ProductsFresh

Harnessing Hype to Teach Empirical Thinking: An Experience With AI Coding Assistants

arXiv:2604.01110v1 Announce Type: new Abstract: Software engineering students often struggle to appreciate empirical methods and hypothesis-driven inquiry, especially when taught in theoretical terms. This experience report explores whether grounding empirical learning in hype-driven technologies can make these concepts more accessible and engaging. We conducted a one-semester seminar framed around the currently popular topic of AI coding assistants, which attracted unusually high student interest. The course combined hands-on sessions using AI coding assistants with small, student-designed empirical studies. Classroom observations and survey responses suggest that the hype topic sparked curiosity and critical thinking. Students engaged with the AI coding assistants while questioning their

arXiv cs.SE

1mabout 4 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 301 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

Leveraging Commit Size Context and Hyper Co-Change Graph Centralities for Defect Prediction

arXiv:2604.01132v1 Announce Type: new Abstract: File-level defect prediction models traditionally rely on product and process metrics. While process metrics effectively complement product metrics, they often overlook commit size the number of files changed per commit despite its strong association with software quality. Network centrality measures on dependency graphs have also proven to be valuable product level indicators. Motivated by this, we first redefine process metrics as commit size aware process metric vectors, transforming conventional scalar measures into 100 dimensional profiles that capture the distribution of changes across commit size strata. We then model change history as a hyper co change graph, where hyperedges naturally encode commit-size semantics. Vector centralities

arXiv cs.SE

1mabout 4 hours ago

Research PapersFresh

Detecting Call Graph Unsoundness without Ground Truth

arXiv:2604.00885v1 Announce Type: new Abstract: Java static analysis frameworks are commonly compared under the assumption that analysis algorithms and configurations compose monotonically and yield semantically comparable results across tools. In this work, we show that this assumption is fundamentally flawed. We present a large-scale empirical study of semantic consistency within and across four widely used Java static analysis frameworks: Soot, SootUp, WALA, and Doop. Using precision partial orders over analysis algorithms and configurations, we systematically identify violations where increased precision introduces new call-graph edges or amplifies inconsistencies. Our results reveal three key findings. First, algorithmic precision orders frequently break within frameworks due to moder

arXiv cs.SE

1mabout 4 hours ago

Research PapersFresh

Containing the Reproducibility Gap: Automated Repository-Level Containerization for Scholarly Jupyter Notebooks

arXiv:2604.01072v1 Announce Type: new Abstract: Computational reproducibility is fundamental to trustworthy science, yet remains difficult to achieve in practice across various research workflows, including Jupyter notebooks published alongside scholarly articles. Environment drift, undocumented dependencies and implicit execution assumptions frequently prevent independent re-execution of published research. Despite existing reproducibility guidelines, scalable and systematic infrastructure for automated assessment remains limited. We present an automated, web-oriented reproducibility engineering pipeline that reconstructs and evaluates repository-level execution environments for scholarly notebooks. The system performs dependency inference, automated container generation, and isolated exe

arXiv cs.SE

2mabout 4 hours ago

Research PapersFresh

Single-Waveguide Multiple-Pinching-Antenna Systems: OMA versus NOMA

arXiv:2604.00588v1 Announce Type: new Abstract: This paper investigates the performance of a pinching-antenna (PA) system with a signal waveguide and multiple pinching antennas to serve users distributed across multiple rooms. The performance of the system is evaluated through a comparative analysis under both orthogonal multiple access (OMA) and non-orthogonal multiple access (NOMA) schemes. Specifically, this paper derives closed-form expressions for the outage probability (OP) and ergodic rate (ER) in each scheme. Furthermore, asymptotic analyses are conducted to characterize the system behavior in the high signal-to-noise ratio (SNR) regime. Extensive Monte Carlo simulations are utilized to validate the accuracy of the analytical derivations. The comparative results can be summarized a

arXiv eess.SP

2mabout 4 hours ago