Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessUS crude tops US$110, Wall Street falls after Trump vows more Iran attacksSCMP Tech (Asia AI)1.13.0a7CrewAI ReleasesCalls to Regulate Smart Glasses Are Officially DeafeningGizmodoAmazon vs. Apple: Which Is the Better Artificial Intelligence (AI) Stock to Buy Today? - The Motley FoolGoogle News: AITesla’s cheaper vehicles aren’t helping its declining salesTechCrunch AIReviewing the evidence on psychological manipulation by Bots and AILessWrong AIv0.20.0-rc0: Merge pull request #42 from ollama/jmorganca/gemma4-ggml-improvementsOllama ReleasesThe Trillion-Dollar AI Tsunami: OpenAI and Anthropic Prepare for Historic Public Debuts - FinancialContentGoogle News: OpenAITesla sales grew by 6% in Q1, but company has an overproduction problemArs TechnicaAmazon is trying to buy Globalstar to compete with SpaceX's StarlinkArs Technicatrunk/0953a37ba32974c34408bbdebef7a3174ae6ef33PyTorch ReleasesVertiv to Expand Ohio Manufacturing to Boost U.S. Production of Critical Thermal Management Technologies for AI Data Centers - cxotoday.comGNews AI manufacturingBlack Hat USADark ReadingBlack Hat AsiaAI BusinessUS crude tops US$110, Wall Street falls after Trump vows more Iran attacksSCMP Tech (Asia AI)1.13.0a7CrewAI ReleasesCalls to Regulate Smart Glasses Are Officially DeafeningGizmodoAmazon vs. Apple: Which Is the Better Artificial Intelligence (AI) Stock to Buy Today? - The Motley FoolGoogle News: AITesla’s cheaper vehicles aren’t helping its declining salesTechCrunch AIReviewing the evidence on psychological manipulation by Bots and AILessWrong AIv0.20.0-rc0: Merge pull request #42 from ollama/jmorganca/gemma4-ggml-improvementsOllama ReleasesThe Trillion-Dollar AI Tsunami: OpenAI and Anthropic Prepare for Historic Public Debuts - FinancialContentGoogle News: OpenAITesla sales grew by 6% in Q1, but company has an overproduction problemArs TechnicaAmazon is trying to buy Globalstar to compete with SpaceX's StarlinkArs Technicatrunk/0953a37ba32974c34408bbdebef7a3174ae6ef33PyTorch ReleasesVertiv to Expand Ohio Manufacturing to Boost U.S. Production of Critical Thermal Management Technologies for AI Data Centers - cxotoday.comGNews AI manufacturing
AI NEWS HUBbyEIGENVECTOREigenvector

Performance Evaluation of LLMs in Automated RDF Knowledge Graph Generation

arXiv cs.IRby [Submitted on 6 Feb 2026]April 1, 20262 min read1 views
Source Quiz

arXiv:2603.29878v1 Announce Type: new Abstract: Cloud systems generate large, heterogeneous log data containing critical infrastructure, application, and security information. Transforming these logs into RDF triples enables their integration into knowledge graphs, improving interpretability, root-cause analysis, and cross-service reasoning beyond what raw logs allow. Large Language Models (LLMs) offer a promising approach to automate RDF knowledge graph generation; however, their effectiveness on complex cloud logs remains largely unexplored. In this paper, we evaluate multiple LLM architectures and prompting strategies for automated RDF extraction using a controlled framework with two pipelines for systematically processing semi-structured log data. The extraction pipeline integrates mul

View PDF

Abstract:Cloud systems generate large, heterogeneous log data containing critical infrastructure, application, and security information. Transforming these logs into RDF triples enables their integration into knowledge graphs, improving interpretability, root-cause analysis, and cross-service reasoning beyond what raw logs allow. Large Language Models (LLMs) offer a promising approach to automate RDF knowledge graph generation; however, their effectiveness on complex cloud logs remains largely unexplored. In this paper, we evaluate multiple LLM architectures and prompting strategies for automated RDF extraction using a controlled framework with two pipelines for systematically processing semi-structured log data. The extraction pipeline integrates multiple LLMs to identify relevant entities and relationships, automatically generating subject-predicate-object triples. These outputs are evaluated using a dedicated validation pipeline with both syntactic and semantic metrics to assess accuracy, completeness, and quality. Due to the lack of public ground-truth datasets, we created a reference Log-to-KG dataset from OpenStack logs using manual annotation and ontology-driven methods, enabling objective baseline. Our analysis shows that Few-Shot learning is the most effective strategy, with Llama achieving a 99.35% F1 score and 100% valid RDF output while Qwen, NuExtract, and Gemma also perform well under Few-Shot prompting, with Chain-of-Thought approaches maintaining similar accuracy. One-Shot prompting offers a lighter but effective alternative, while Zero-Shot and advanced strategies such as Tree-of-Thought, Self-Critique, and Generate-Multiple perform substantially worse. These results highlight the importance of contextual examples and prompt design for accurate RDF extraction and reveal model-specific limitations across LLM architectures.

Comments: submitted to journal

Subjects:

Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Distributed, Parallel, and Cluster Computing (cs.DC)

Cite as: arXiv:2603.29878 [cs.IR]

(or arXiv:2603.29878v1 [cs.IR] for this version)

https://doi.org/10.48550/arXiv.2603.29878

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Ionut Anghel [view email] [v1] Fri, 6 Feb 2026 06:30:35 UTC (1,170 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Performance…llamamodellanguage mo…announceapplicationservicearXiv cs.IR

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 174 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!