Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessApple turns 50: 8 of the company’s biggest tech milestonesSilicon RepublicI Built an AI Agent That Can Write Its Own Tools When It Gets StuckDEV CommunityBuilding a "Soft Sensor" for Cement Kilns: Predicting Control Levers with PythonDEV CommunityWe Traced One Query Through Perplexity’s Entire Stack in Cohort – Here’s What Actually Happens in 3 SecondsDEV CommunityAgent Self-Discovery: How AI Agents Find Their Own WalletsDEV CommunityYour content pipeline is lying to you, and in regulated software, that's a serious problemDEV CommunityDiffusion-based AI model successfully trained in electroplatingPhys.org AIClaude Code hooks: how to intercept every tool call before it runsDEV CommunityHow I built a browser-based video editor with FFmpeg.wasm (no backend, no server costs)DEV CommunityWhy We Built an API for Spanish Fiscal ID Validation Instead of Just Implementing ItDEV CommunityA technical deep-dive into building APEX: an autonomous AI operations system on OpenClawDEV CommunityBest Amazon Spring Sale laptop deals 2026ZDNet AIBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessApple turns 50: 8 of the company’s biggest tech milestonesSilicon RepublicI Built an AI Agent That Can Write Its Own Tools When It Gets StuckDEV CommunityBuilding a "Soft Sensor" for Cement Kilns: Predicting Control Levers with PythonDEV CommunityWe Traced One Query Through Perplexity’s Entire Stack in Cohort – Here’s What Actually Happens in 3 SecondsDEV CommunityAgent Self-Discovery: How AI Agents Find Their Own WalletsDEV CommunityYour content pipeline is lying to you, and in regulated software, that's a serious problemDEV CommunityDiffusion-based AI model successfully trained in electroplatingPhys.org AIClaude Code hooks: how to intercept every tool call before it runsDEV CommunityHow I built a browser-based video editor with FFmpeg.wasm (no backend, no server costs)DEV CommunityWhy We Built an API for Spanish Fiscal ID Validation Instead of Just Implementing ItDEV CommunityA technical deep-dive into building APEX: an autonomous AI operations system on OpenClawDEV CommunityBest Amazon Spring Sale laptop deals 2026ZDNet AI

Semantic Labeling for Third-Party Cybersecurity Risk Assessment: A Semi-Supervised Approach to Intent-Aware Question Retrieval

arXiv cs.CRby Ali Nour Eldin, Mohamed Sellami, Mehdi Acheli, Walid Gaaloul, Julien SteunouApril 1, 20262 min read0 views
Source Quiz

arXiv:2602.10149v3 Announce Type: replace Abstract: Third-Party Risk Assessment (TPRA) relies on large repositories of cybersecurity compliance questions used to assess external suppliers against standards such as ISO/IEC 27001 and NIST. In practice, not all questions are relevant for a specific supplier and selecting questions for a given assessment context remains a manual and time-consuming task. Existing question retrieval approaches based on lexical or semantic similarity can identify topically related questions, but they often fail to capture the underlying assessment intent, including control domain and evaluation scope. To address this limitation, we investigate whether an explicit semantic label space can improve intent-aware TPRA question selection. In particular, we separate lab

View PDF HTML (experimental)

Abstract:Third-Party Risk Assessment (TPRA) relies on large repositories of cybersecurity compliance questions used to assess external suppliers against standards such as ISO/IEC 27001 and NIST. In practice, not all questions are relevant for a specific supplier and selecting questions for a given assessment context remains a manual and time-consuming task. Existing question retrieval approaches based on lexical or semantic similarity can identify topically related questions, but they often fail to capture the underlying assessment intent, including control domain and evaluation scope. To address this limitation, we investigate whether an explicit semantic label space can improve intent-aware TPRA question selection. In particular, we separate label space discovery from large-scale label assignment. We start by discovering overlapping clusters of semantically similar questions and then exploit LLMs to assign unique labels for each cluster. Second, we propagate labels through k-nearest neighbors (kNN) for a larger-scale question annotation. Question retrieval is finally achieved by similarity measure of the query with respect to the extracted labels instead of the questions themselves. This reduces repeated LLM calls while preserving label consistency. Experimental results show that the proposed semi-supervised framework reduces labeling cost and runtime compared with per-question LLM annotation while maintaining label quality and improving efficiency. Furthermore, label-based retrieval achieves better alignment with cybersecurity control domains and assessment scope than similarity-based retrieval, highlighting the value of semantic labels as an intermediate representation.

Subjects:

Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI)

Cite as: arXiv:2602.10149 [cs.CR]

(or arXiv:2602.10149v3 [cs.CR] for this version)

https://doi.org/10.48550/arXiv.2602.10149

arXiv-issued DOI via DataCite

Submission history

From: Ali Nour Eldin [view email] [v1] Mon, 9 Feb 2026 18:36:50 UTC (234 KB) [v2] Wed, 4 Mar 2026 14:54:20 UTC (225 KB) [v3] Tue, 31 Mar 2026 09:06:12 UTC (219 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

announcevaluationcompliance

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Semantic La…announcevaluationcompliancealignmentarxivarXiv cs.CR

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 151 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models