Models model announce integration valuation analysis perspective

From Component Manipulation to System Compromise: Understanding and Detecting Malicious MCP Servers

arXiv cs.SEby [Submitted on 2 Apr 2026]April 3, 20262 min read2 views

🧒Explain Like I'm 5Simple language

Hey there, little explorer! Imagine your favorite robot friend, like a super-smart toy. This toy can talk to other toys to learn new things, right?

Sometimes, naughty people try to trick these talking toys. They send bad instructions, like telling your robot to draw on the walls instead of a nice picture!

Scientists found a new way to understand these tricks better. They looked at all the little parts of the robot's brain that talk to other toys. They even made a special helper named Connor!

Connor is like a super-detective for robots. He watches very carefully to make sure no one is sending bad instructions. If he sees something sneaky, he says, "Uh oh! That's not right!" This helps keep our smart robot friends safe and happy! Yay Connor!

arXiv:2604.01905v1 Announce Type: cross Abstract: The model context protocol (MCP) standardizes how LLMs connect to external tools and data sources, enabling faster integration but introducing new attack vectors. Despite the growing adoption of MCP, existing MCP security studies classify attacks by their observable effects, obscuring how attacks behave across different MCP server components and overlooking multi-component attack chains. Meanwhile, existing defenses are less effective when facing multi-component attacks or previously unknown malicious behaviors. This work presents a component-centric perspective for understanding and detecting malicious MCP servers. First, we build the first component-centric PoC dataset of 114 malicious MCP servers where attacks are achieved as manipulatio

View PDF HTML (experimental)

Abstract:The model context protocol (MCP) standardizes how LLMs connect to external tools and data sources, enabling faster integration but introducing new attack vectors. Despite the growing adoption of MCP, existing MCP security studies classify attacks by their observable effects, obscuring how attacks behave across different MCP server components and overlooking multi-component attack chains. Meanwhile, existing defenses are less effective when facing multi-component attacks or previously unknown malicious behaviors. This work presents a component-centric perspective for understanding and detecting malicious MCP servers. First, we build the first component-centric PoC dataset of 114 malicious MCP servers where attacks are achieved as manipulation over MCP components and their compositions. We evaluate these attacks' effectiveness across two MCP hosts and five LLMs, and uncover that (1) component position shapes attack success rate; and (2) multi-component compositions often outperform single-component attacks by distributing malicious logic. Second, we propose and implement Connor, a two-stage behavioral deviation detector for malicious MCP servers. It first performs pre-execution analysis to detect malicious shell commands and extract each tool's function intent, and then conducts step-wise in-execution analysis to trace each tool's behavioral trajectories and detect deviations from its function intent. Evaluation on our curated dataset indicates that Connor achieves an F1-score of 94.6%, outperforming the state of the art by 8.9% to 59.6%. In real-world detection, Connor identifies two malicious servers.

Subjects:

Cryptography and Security (cs.CR); Software Engineering (cs.SE)

Cite as: arXiv:2604.01905 [cs.CR]

(or arXiv:2604.01905v1 [cs.CR] for this version)

https://doi.org/10.48550/arXiv.2604.01905

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yiheng Huang [view email] [v1] Thu, 2 Apr 2026 11:22:07 UTC (796 KB)

Original source

arXiv cs.SE

https://arxiv.org/abs/2604.01905

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelannounceintegration

ReleasesLive

The Spaceballs sequel will be released in April next year

There's finally a release date for the Spaceballs sequel — but before you get too excited, it's a whole year away. As first reported by Deadline , Amazon MGM Studios announced on Friday night that the upcoming Spaceballs movie will hit theaters on April 23, 2027, right around the 40th anniversary of the first film. Several members of the original cast will be reprising their roles, according to Deadline , including Mel Brooks, Rick Moranis, Bill Pullman, George Wynder and Daphne Zuniga. Spaceballs: The Release Date. April 23, 2027. pic.twitter.com/5Xv0BKmf7C — Amazon MGM Studios (@AmazonMGMStudio) April 4, 2026 Whispers of a potential Spaceballs 2 go back a couple of years, but Brooks officially confirmed in an extremely on-brand announcement video last summer that the movie is actually ha

Engadget

1mabout 2 hours ago

Models

Intel Labs Introduces AI Diffusion Model, Generates 360-Degree Images from Text Prompts - intc.com

Intel Labs Introduces AI Diffusion Model, Generates 360-Degree Images from Text Prompts intc.com

GNews AI diffusion

1malmost 3 years ago

ProductsLive

Why I Run 22 Docker Services at Home

Somewhere in my living room, a 2018 gaming PC is running 22 Docker containers, processing 15,000 emails through a local LLM, and managing the finances of a real business. It was never supposed to do any of this. I run a one-person software consultancy in the Netherlands; web development, 3D printing, and consulting. Last year, I started building an AI system to help me manage it all. Eight specialized agents handling email triage, financial tracking, infrastructure monitoring, and scheduling. Every piece of inference runs locally. No cloud APIs touching my private data. This post covers the hardware, what it actually costs, and what I'd do differently if I started over. The Setup: Three Machines, One Mesh Network The entire system runs on three machines connected via Tailscale mesh VPN: do