Research Papers model announce feature valuation analysis study

Detecting Call Graph Unsoundness without Ground Truth

arXiv cs.SEby Fangtian Zhong, Ollie Wold, Joseph WindmannApril 2, 20261 min read0 views

arXiv:2604.00885v1 Announce Type: new Abstract: Java static analysis frameworks are commonly compared under the assumption that analysis algorithms and configurations compose monotonically and yield semantically comparable results across tools. In this work, we show that this assumption is fundamentally flawed. We present a large-scale empirical study of semantic consistency within and across four widely used Java static analysis frameworks: Soot, SootUp, WALA, and Doop. Using precision partial orders over analysis algorithms and configurations, we systematically identify violations where increased precision introduces new call-graph edges or amplifies inconsistencies. Our results reveal three key findings. First, algorithmic precision orders frequently break within frameworks due to moder

View PDF HTML (experimental)

Abstract:Java static analysis frameworks are commonly compared under the assumption that analysis algorithms and configurations compose monotonically and yield semantically comparable results across tools. In this work, we show that this assumption is fundamentally flawed. We present a large-scale empirical study of semantic consistency within and across four widely used Java static analysis frameworks: Soot, SootUp, WALA, and Doop. Using precision partial orders over analysis algorithms and configurations, we systematically identify violations where increased precision introduces new call-graph edges or amplifies inconsistencies. Our results reveal three key findings. First, algorithmic precision orders frequently break within frameworks due to modern language features such as lambdas, reflection, and native modeling. Second, configuration choices strongly interact with analysis algorithms, producing synergistic failures that exceed the effects of algorithm or configuration changes alone. Third, cross-framework comparisons expose irreconcilable semantic gaps, demonstrating that different frameworks operate over incompatible notions of call-graph ground truth. These findings challenge prevailing evaluation practices in static analysis and highlight the need to reason jointly about algorithms, configurations, and framework semantics when assessing precision and soundness.

Subjects:

Software Engineering (cs.SE)

Cite as: arXiv:2604.00885 [cs.SE]

(or arXiv:2604.00885v1 [cs.SE] for this version)

https://doi.org/10.48550/arXiv.2604.00885

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Fangtian Zhong [view email] [v1] Wed, 1 Apr 2026 13:32:53 UTC (209 KB)

Original source

arXiv cs.SE

https://arxiv.org/abs/2604.00885

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelannouncefeature

Models

The US AI Acceleration Plan vs China’s Diffusion Model - Foreign Policy Research Institute

The US AI Acceleration Plan vs China’s Diffusion Model Foreign Policy Research Institute

GNews AI USA

1m2 months ago

ProductsRecent

Software-update - MobaXterm 26.2

Versie 26.2 van MobaXterm is uitgekomen. Dit veelzijdige programma heeft alles aan boord om een verbinding naar een andere computer op te zetten. Het kan overweg met onder meer SSH, Telnet, RDP, VNC en FTP. Het kan meerdere sessies in tabbladen weergeven en het kan ook als portable app worden gebruikt. MobaXterm is gratis voor thuisgebruik met slechts lichte beperkingen; een licentie voor in een commerciële omgeving kost 49 euro. Daarvoor mag het programma voor altijd worden gebruikt en krijg je ook ondersteuning en updates voor een jaar. De changelog voor deze uitgave ziet er als volgt uit: New feature:

Tweakers.net

1mabout 15 hours ago

Models

Cohere Enters Voice AI Race with Open-Source ‘Transcribe’ Model for Speech Recognition - CXO Digitalpulse

Cohere Enters Voice AI Race with Open-Source ‘Transcribe’ Model for Speech Recognition CXO Digitalpulse

GNews AI voice

1m9 days ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 184 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Research Papers

Research PapersFresh

[D] KDD Review Discussion

KDD 2026 (Feb Cycle) reviews will release today (4-April AoE), This thread is open to discuss about reviews and importantly celebrate successful reviews. Let us all remember that review system is noisy and we all suffer from it and this doesn't define our research impact. Let's all prioritise reviews which enhance our papers. Feel free to discuss your experiences submitted by /u/BomsDrag [link] [comments]

Reddit r/MachineLearning

1mabout 2 hours ago

Research PapersFresh

The CEO Building the Most Powerful AI on Earth Just Admitted It Will Destroy Half Your Industry.

January 26, 2026. Dario Amodei published a 20,000-word essay warning that AI will displace 50% of entry-level white-collar jobs in 1–5… Continue reading on Towards AI »

Medium AI

1mabout 3 hours ago

Research Papers

Taiwan and Sweden expand joint research in chips, AI and drones - Scandasia

Taiwan and Sweden expand joint research in chips, AI and drones Scandasia

Google News AI Sweden

1mabout 2 months ago

Research PapersFresh

New Rowhammer attack can grant kernel-level control on Nvidia workstation GPUs

A study from researchers at UNC Chapel Hill and Georgia Tech shows that GDDR6-based Rowhammer attacks can grant kernel-level access to Linux systems equipped with GPUs based on Nvidia's Ampere and Ada Lovelace architectures. The vulnerability appears significantly more severe than what was outlined in a paper last year. Read Entire Article

TechSpot

1mabout 5 hours ago