Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessHow AI and Alternative Data Are Finally Making Germany's Hidden Champions Accessible to Global InvestorsDev.to AIThe Simple Truth About AI Agent RevenueDev.to AIAI Transformation in German SMEs: McKinsey Data Shows Up to 10x ROI from Strategic AI IntegrationDev.to AIAutomating Your Urban Farm with AI: From Guesswork to PrecisionDev.to AIThe Real Ceiling in Claude Code's Memory System (It’s Not the 200-Line Cap)Dev.to AIThe Invisible Rhythms of the Siuntio FortDev.to AIExploring RAG Embedding Techniques in DepthDev.to AIHow I Built a Multi-Agent Geopolitical Simulator with FastAPI + LiteLLMDev.to AI90% людей используют нейросети как поисковик. И проигрывают.Dev.to AII Let AI Coding Agents Build My Side Projects for a Month — Here's My Honest TakeDev.to AINvidia vs. CoreWeave: The Better AI Supercycle Stock Might Surprise You - AOL.comGNews AI NVIDIANvidia Partner Hon Hai’s Sales Meet Estimates on Solid AI DemandBloomberg TechnologyBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessHow AI and Alternative Data Are Finally Making Germany's Hidden Champions Accessible to Global InvestorsDev.to AIThe Simple Truth About AI Agent RevenueDev.to AIAI Transformation in German SMEs: McKinsey Data Shows Up to 10x ROI from Strategic AI IntegrationDev.to AIAutomating Your Urban Farm with AI: From Guesswork to PrecisionDev.to AIThe Real Ceiling in Claude Code's Memory System (It’s Not the 200-Line Cap)Dev.to AIThe Invisible Rhythms of the Siuntio FortDev.to AIExploring RAG Embedding Techniques in DepthDev.to AIHow I Built a Multi-Agent Geopolitical Simulator with FastAPI + LiteLLMDev.to AI90% людей используют нейросети как поисковик. И проигрывают.Dev.to AII Let AI Coding Agents Build My Side Projects for a Month — Here's My Honest TakeDev.to AINvidia vs. CoreWeave: The Better AI Supercycle Stock Might Surprise You - AOL.comGNews AI NVIDIANvidia Partner Hon Hai’s Sales Meet Estimates on Solid AI DemandBloomberg Technology
AI NEWS HUBbyEIGENVECTOREigenvector

Amalgam: Hybrid LLM-PGM Synthesis Algorithm for Accuracy and Realism

arXivby [Submitted on 28 Mar 2026]March 31, 20261 min read1 views
Source Quiz

arXiv:2603.27254v1 Announce Type: cross Abstract: To generate synthetic datasets, e.g., in domains such as healthcare, the literature proposes approaches of two main types: Probabilistic Graphical Models (PGMs) and Deep Learning models, such as LLMs. While PGMs produce synthetic data that can be used for advanced analytics, they do not support complex schemas and datasets. LLMs on the other hand, support complex schemas but produce skewed dataset distributions, which are less useful for advanced analytics. In this paper, we therefore present Amalgam, a hybrid LLM-PGM data synthesis algorithm s — Antheas Kapenekakis, Bent Thomsen, Katja Hose, Michele Albano

View PDF

Abstract:To generate synthetic datasets, e.g., in domains such as healthcare, the literature proposes approaches of two main types: Probabilistic Graphical Models (PGMs) and Deep Learning models, such as LLMs. While PGMs produce synthetic data that can be used for advanced analytics, they do not support complex schemas and datasets. LLMs on the other hand, support complex schemas but produce skewed dataset distributions, which are less useful for advanced analytics. In this paper, we therefore present Amalgam, a hybrid LLM-PGM data synthesis algorithm supporting both advanced analytics, realism, and tangible privacy properties. We show that Amalgam synthesizes data with an average 91 % $\chi^2 P$ value and scores 3.8/5 for realism using our proposed metric, where state-of-the-art is 3.3 and real data is 4.7.

Subjects:

Databases (cs.DB); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.27254 [cs.DB]

(or arXiv:2603.27254v1 [cs.DB] for this version)

https://doi.org/10.48550/arXiv.2603.27254

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Antheas Kapenekakis [view email] [v1] Sat, 28 Mar 2026 12:10:46 UTC (202 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Amalgam: Hy…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 181 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers