MiNER: A Two-Stage Pipeline for Metadata Extraction from Municipal Meeting Minutes
arXiv:2602.00316v3 Announce Type: replace Abstract: Municipal meeting minutes are official documents of local governance, exhibiting heterogeneous formats and writing styles. Effective information retrieval (IR) requires identifying metadata such as meeting number, date, location, participants, and start/end times, elements that are rarely standardized or easy to extract automatically. Existing named entity recognition (NER) models are ill-suited to this task, as they are not adapted to such domain-specific categories. In this paper, we propose a two-stage pipeline for metadata extraction from — Rodrigo Batista, Lu\'is Filipe Cunha, Purifica\c{c}\~ao Silvano, Nuno Guimar\~aes, Al\'ipio Jorge, Evelin Amorim, Ricardo Campos
View PDF HTML (experimental)
Abstract:Municipal meeting minutes are official documents of local governance, exhibiting heterogeneous formats and writing styles. Effective information retrieval (IR) requires identifying metadata such as meeting number, date, location, participants, and start/end times, elements that are rarely standardized or easy to extract automatically. Existing named entity recognition (NER) models are ill-suited to this task, as they are not adapted to such domain-specific categories. In this paper, we propose a two-stage pipeline for metadata extraction from municipal minutes. First, a question answering (QA) model identifies the opening and closing text segments containing metadata. Transformer-based models (BERTimbau and XLM-RoBERTa with and without a CRF layer) are then applied for fine-grained entity extraction and enhanced through deslexicalization. To evaluate our proposed pipeline, we benchmark both open-weight (Phi) and closed-weight (Gemini) LLMs, assessing predictive performance, inference cost, and carbon footprint. Our results demonstrate strong in-domain performance, better than larger general-purpose LLMs. However, cross-municipality evaluation reveals reduced generalization reflecting the variability and linguistic complexity of municipal records. This work establishes the first benchmark for metadata extraction from municipal meeting minutes, providing a solid foundation for future research in this domain.
Subjects:
Computation and Language (cs.CL)
Cite as: arXiv:2602.00316 [cs.CL]
(or arXiv:2602.00316v3 [cs.CL] for this version)
https://doi.org/10.48550/arXiv.2602.00316
arXiv-issued DOI via DataCite
Journal reference: Advances in Information Retrieval. ECIR 2026. Lecture Notes in Computer Science, vol 16484. Springer, Cham
Related DOI:
https://doi.org/10.1007/978-3-032-21300-6_33
DOI(s) linking to related resources
Submission history
From: Nuno Guimaraes [view email] [v1] Fri, 30 Jan 2026 21:09:13 UTC (23 KB) [v2] Mon, 9 Feb 2026 10:04:48 UTC (23 KB) [v3] Thu, 26 Mar 2026 19:56:25 UTC (23 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxiv
When Labels Are Scarce: A Systematic Mapping of Label-Efficient Code Vulnerability Detection
arXiv:2604.00079v1 Announce Type: cross Abstract: Machine-learning-based code vulnerability detection (CVD) has progressed rapidly, from deep program representations to pretrained code models and LLM-centered pipelines. Yet dependable vulnerability labeling remains expensive, noisy, and uneven across projects, languages, and CWE types, motivating approaches that reduce reliance on human labeling. This survey maps these approaches, synthesizing five paradigm families and the mechanisms they use. It connects mechanisms to token, graph, hybrid, and knowledgebased representations, and consolidates evaluation and reporting axes that limit comparison (label-budget specification, compute/cost assumptions, leakage, and granularity mismatches). A Design Map and constraintfirst Decision Guide distil

Harnessing Hype to Teach Empirical Thinking: An Experience With AI Coding Assistants
arXiv:2604.01110v1 Announce Type: new Abstract: Software engineering students often struggle to appreciate empirical methods and hypothesis-driven inquiry, especially when taught in theoretical terms. This experience report explores whether grounding empirical learning in hype-driven technologies can make these concepts more accessible and engaging. We conducted a one-semester seminar framed around the currently popular topic of AI coding assistants, which attracted unusually high student interest. The course combined hands-on sessions using AI coding assistants with small, student-designed empirical studies. Classroom observations and survey responses suggest that the hype topic sparked curiosity and critical thinking. Students engaged with the AI coding assistants while questioning their
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers

Leveraging Commit Size Context and Hyper Co-Change Graph Centralities for Defect Prediction
arXiv:2604.01132v1 Announce Type: new Abstract: File-level defect prediction models traditionally rely on product and process metrics. While process metrics effectively complement product metrics, they often overlook commit size the number of files changed per commit despite its strong association with software quality. Network centrality measures on dependency graphs have also proven to be valuable product level indicators. Motivated by this, we first redefine process metrics as commit size aware process metric vectors, transforming conventional scalar measures into 100 dimensional profiles that capture the distribution of changes across commit size strata. We then model change history as a hyper co change graph, where hyperedges naturally encode commit-size semantics. Vector centralities

Detecting Call Graph Unsoundness without Ground Truth
arXiv:2604.00885v1 Announce Type: new Abstract: Java static analysis frameworks are commonly compared under the assumption that analysis algorithms and configurations compose monotonically and yield semantically comparable results across tools. In this work, we show that this assumption is fundamentally flawed. We present a large-scale empirical study of semantic consistency within and across four widely used Java static analysis frameworks: Soot, SootUp, WALA, and Doop. Using precision partial orders over analysis algorithms and configurations, we systematically identify violations where increased precision introduces new call-graph edges or amplifies inconsistencies. Our results reveal three key findings. First, algorithmic precision orders frequently break within frameworks due to moder

Containing the Reproducibility Gap: Automated Repository-Level Containerization for Scholarly Jupyter Notebooks
arXiv:2604.01072v1 Announce Type: new Abstract: Computational reproducibility is fundamental to trustworthy science, yet remains difficult to achieve in practice across various research workflows, including Jupyter notebooks published alongside scholarly articles. Environment drift, undocumented dependencies and implicit execution assumptions frequently prevent independent re-execution of published research. Despite existing reproducibility guidelines, scalable and systematic infrastructure for automated assessment remains limited. We present an automated, web-oriented reproducibility engineering pipeline that reconstructs and evaluates repository-level execution environments for scholarly notebooks. The system performs dependency inference, automated container generation, and isolated exe
Single-Waveguide Multiple-Pinching-Antenna Systems: OMA versus NOMA
arXiv:2604.00588v1 Announce Type: new Abstract: This paper investigates the performance of a pinching-antenna (PA) system with a signal waveguide and multiple pinching antennas to serve users distributed across multiple rooms. The performance of the system is evaluated through a comparative analysis under both orthogonal multiple access (OMA) and non-orthogonal multiple access (NOMA) schemes. Specifically, this paper derives closed-form expressions for the outage probability (OP) and ergodic rate (ER) in each scheme. Furthermore, asymptotic analyses are conducted to characterize the system behavior in the high signal-to-noise ratio (SNR) regime. Extensive Monte Carlo simulations are utilized to validate the accuracy of the analytical derivations. The comparative results can be summarized a
Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!