Don't Stop the Multi-Party! On Generating Synthetic Written Multi-Party Conversations with Constraints
arXiv:2502.13592v2 Announce Type: replace Abstract: Written Multi-Party Conversations (WMPCs) are widely studied across disciplines, with social media as a primary data source due to their accessibility. However, these datasets raise privacy concerns and often reflect platform-specific properties. For example, interactions between speakers may be limited due to rigid platform structures (e.g., threads, tree-like discussions), which yield overly simplistic interaction patterns (e.g., one-to-one "reply-to" links). This work explores the feasibility of generating synthetic WMPCs with instruction- — Nicol\`o Penzo, Marco Guerini, Bruno Lepri, Goran Glava\v{s}, Sara Tonelli
View PDF HTML (experimental)
Abstract:Written Multi-Party Conversations (WMPCs) are widely studied across disciplines, with social media as a primary data source due to their accessibility. However, these datasets raise privacy concerns and often reflect platform-specific properties. For example, interactions between speakers may be limited due to rigid platform structures (e.g., threads, tree-like discussions), which yield overly simplistic interaction patterns (e.g., one-to-one "reply-to" links). This work explores the feasibility of generating synthetic WMPCs with instruction-tuned Large Language Models (LLMs) by providing deterministic constraints such as dialogue structure and participants' stance. We investigate two complementary strategies of leveraging LLMs in this context: (i.) LLMs as WMPC generators, where we task the LLM to generate a whole WMPC at once and (ii.) LLMs as WMPC parties, where the LLM generates one turn of the conversation at a time (made of speaker, addressee and message), provided the conversation history. We next introduce an analytical framework to evaluate compliance with the constraints, content quality, and interaction complexity for both strategies. Finally, we assess the level of obtained WMPCs via human and LLM-as-a-judge evaluations. We find stark differences among LLMs, with only some being able to generate high-quality WMPCs. We also find that turn-by-turn generation yields better conformance to constraints and higher linguistic variability than generating WMPCs in one pass. Nonetheless, our structural and qualitative evaluation indicates that both generation strategies can yield high-quality WMPCs.
Comments: Accepted at AAAI2026
Subjects:
Computation and Language (cs.CL)
Cite as: arXiv:2502.13592 [cs.CL]
(or arXiv:2502.13592v2 [cs.CL] for this version)
https://doi.org/10.48550/arXiv.2502.13592
arXiv-issued DOI via DataCite
Related DOI:
https://doi.org/10.1609/aaai.v40i39.40548
DOI(s) linking to related resources
Submission history
From: Nicolò Penzo [view email] [v1] Wed, 19 Feb 2025 10:10:43 UTC (1,028 KB) [v2] Fri, 27 Mar 2026 12:43:18 UTC (1,211 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxivArtificial Intelligence at JPMorgan Chase - Emerj Artificial Intelligence Research
<a href="https://news.google.com/rss/articles/CBMibEFVX3lxTE12bUwyd1dkamZPZExOVHJvb1MxTkZDaml4ak1PbDBvdXlrODBFdmtFVnBMVkhiS1RHTy0yWVRqVmYzQng2NG9VUkcwYVo0R0txZHhMUjFmTDh6NG00N3E2R0RIZWxUb2d4X0dJcw?oc=5" target="_blank">Artificial Intelligence at JPMorgan Chase</a> <font color="#6f6f6f">Emerj Artificial Intelligence Research</font>
Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ
<a href="https://news.google.com/rss/articles/CBMiuANBVV95cUxNYk90NlRFVDRuRDQxRlFGY3o1SHhHSWdXR3Z3eGJkZjE4blJGSzdKZUNlMlNXR1lUUU5ydGhZQ2ZCS1ItUi12MjBMMEdDc3VfNTE1bUpPYjgxTUI1YU8wZjNZQ3F5RmFyVThObXlZMG9VM1FqQ0xUaThidHNYU3k5dzRBQ2FKcnNLY3FZMjBKcjFUZlFJcVd6dFoyRUd5QlVsVDdCWGVBZk9KXzg4WWotZVdqMUpGS0xUbDBYRmwtWWwxLXRsYU4zSDBLVVhFby12SXFqSVVxWU5YUkMtaVh5b1NPS2tBYkdiR0JuLXR0TEp5MHg0Y1dRR1EyOXV5STdkSzF0U0t2Z0V4UlBJUXkzbDNDNTZvZWotN0Z1UFZ4d2lNY0RMVWo3TEI1MHFrTG11aUZ1bmEtRExzZlhncFg0elYwOTd1RTBvS0t4dGQxcmpvV2JmRU9zWWxMSjVnbW15YklFeG83cWJZNHhEN3JNZXp3WFNGaDdtdDVvNFdTNlJnODFsWlZBTDE1VmRFWGI4SzdFMWxGUFZKUDR5RFNsUGJiaHZnYWlJQmJvTGRRRXdTS3FBVWpIaA?oc=5" target="_blank">Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models</a> <font color="#6f6f6f">WSJ</font>

US may reassess Nato ties after Iran war ends, Rubio says
Secretary of State Marco Rubio said the US may need to reassess its relationship with Nato after the Iran war is finished, calling the military alliance’s alleged lack of support during the Middle East conflict “very disappointing”. Rubio assailed Nato members for denying access to military bases, following prior criticism from US President Donald Trump that partners in the security bloc are “cowards” and that the alliance is a “paper tiger”. “The president and our country will have to...
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers
A Retrospective on the ICLR 2026 Review Process
The selection of papers for ICLR 2026 has fully concluded. We extend our congratulations to the authors whose work will appear at the conference. Creating ICLR’s technical program requires immense effort from the authors, reviewers, and area chairs, and we thank you for your contributions and service. For researchers whose work was rejected, we hope […]
Vector Researchers present papers at ACL 2024
Vector researchers will be well represented at the 62nd Annual Meeting of the Association for Computational Linguistics in Bangkok, Thailand this year. 14 papers co-authored by Vector-affiliated researchers are being […] The post Vector Researchers present papers at ACL 2024 appeared first on Vector Institute for Artificial Intelligence .
Yann LeCun's Team's New Paper: AI Development Mimicking Human Intelligence Hits a Dead End - eu.36kr.com
<a href="https://news.google.com/rss/articles/CBMiU0FVX3lxTFBkbTRhNlhtRnY0cVBERld2OTdWNkRGMXBEaG9Vc21janRUcjJaUlJ4YzZRajVmMGQxNGJYTFB6M3lleUFNakUtWElHdGwzTXBQZjNZ?oc=5" target="_blank">Yann LeCun's Team's New Paper: AI Development Mimicking Human Intelligence Hits a Dead End</a> <font color="#6f6f6f">eu.36kr.com</font>
Plans must be made for the welfare of sentient AI, animal consciousness researchers argue - The Hill
<a href="https://news.google.com/rss/articles/CBMiiAFBVV95cUxNNzVaUTkzYkFUaVRsNGtnQVRXS2xsQVZfd1dFQ01RUlNZWUdDbjBNLUNycll2enl2NHp4Z0Ficm9HUnNWUnlvSGFrR3lDVUVxT1QyeE03QWhWcHFDTVJxV3VUQ0FKT3hiTkY3dWZha3JjcjRIM3l3WUtHZVlBUlhxdVBhLW1tdlJ40gGOAUFVX3lxTFBDQnllcVNNa1NRYVMyYlBtVXVxR0VPeHNjTjNMNWNTMFZXRjRkSU1OeXRFNmxvcENqbXkwSERoU1pGdXJYX2g5c214cFJFdEc1WUlkaEE5TlFDTTNoek5yR18tVi1vWUlGUnl4Tk13VWlFMDhzdUUyOUl3RmhNZ0FobTdiVG51N2h1SmJ5Y3c?oc=5" target="_blank">Plans must be made for the welfare of sentient AI, animal consciousness researchers argue</a> <font color="#6f6f6f">The Hill</font>
Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!