Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAn I/O psychologist's rules for stopping AI agents from cutting cornersHacker News AI TopMeshLedger – AI agents hire and pay each other through on-chain escrowHacker News AI TopOpenAI shifts to usage-based pricing for Codex in ChatGPT business plansThe DecoderDenseNet Paper Walkthrough: All ConnectedTowards Data ScienceShow HN: Ckpt – Automatic checkpoints for AI coding sessions with per-step undoHacker News AI TopThe Shell Is the Most Underrated Interface in AIHacker News AI TopOrganization firewall settings for Copilot cloud agentGitHub Copilot Changelogtrunk/9557db4da58adc85b48136aad2e383f7ab6e456e: [xpu] Add deterministic matmul test to test_gemm.py (#179004)PyTorch ReleasesThe Facebook insider building content moderation for the AI eraTechCrunchMemory will consume 30% of hyperscaler data center spending this year, a 4X increase over 2023 — Nvidia gets preferential supply terms well below standard market rates, says analyst firmtomshardware.comQwen3.5 vs Gemma 4: Benchmarks vs real world use?Reddit r/LocalLLaMATop Analyst Hikes Nvidia Target to $220 on AI Chip Demand - GuruFocusGNews AI NVIDIABlack Hat USADark ReadingBlack Hat AsiaAI BusinessAn I/O psychologist's rules for stopping AI agents from cutting cornersHacker News AI TopMeshLedger – AI agents hire and pay each other through on-chain escrowHacker News AI TopOpenAI shifts to usage-based pricing for Codex in ChatGPT business plansThe DecoderDenseNet Paper Walkthrough: All ConnectedTowards Data ScienceShow HN: Ckpt – Automatic checkpoints for AI coding sessions with per-step undoHacker News AI TopThe Shell Is the Most Underrated Interface in AIHacker News AI TopOrganization firewall settings for Copilot cloud agentGitHub Copilot Changelogtrunk/9557db4da58adc85b48136aad2e383f7ab6e456e: [xpu] Add deterministic matmul test to test_gemm.py (#179004)PyTorch ReleasesThe Facebook insider building content moderation for the AI eraTechCrunchMemory will consume 30% of hyperscaler data center spending this year, a 4X increase over 2023 — Nvidia gets preferential supply terms well below standard market rates, says analyst firmtomshardware.comQwen3.5 vs Gemma 4: Benchmarks vs real world use?Reddit r/LocalLLaMATop Analyst Hikes Nvidia Target to $220 on AI Chip Demand - GuruFocusGNews AI NVIDIA
AI NEWS HUBbyEIGENVECTOREigenvector

MIT scientists investigate memorization risk in the age of clinical AI

MIT ML Newsby Alex Ouyang | Abdul Latif Jameel Clinic for Machine Learning in HealthJanuary 5, 20261 min read0 views
Source Quiz

New research demonstrates how AI models can be tested to ensure they don’t cause harm by revealing anonymized patient health data.

What is patient privacy for? The Hippocratic Oath, thought to be one of the earliest and most widely known medical ethics texts in the world, reads: “Whatever I see or hear in the lives of my patients, whether in connection with my professional practice or not, which ought not to be spoken of outside, I will keep secret, as considering all such things to be private.”

As privacy becomes increasingly scarce in the age of data-hungry algorithms and cyberattacks, medicine is one of the few remaining domains where confidentiality remains central to practice, enabling patients to trust their physicians with sensitive information.

But a paper co-authored by MIT researchers investigates how artificial intelligence models trained on de-identified electronic health records (EHRs) can memorize patient-specific information. The work, which was recently presented at the 2025 Conference on Neural Information Processing Systems (NeurIPS), recommends a rigorous testing setup to ensure targeted prompts cannot reveal information, emphasizing that leakage must be evaluated in a health care context to determine whether it meaningfully compromises patient privacy.

Foundation models trained on EHRs should normally generalize knowledge to make better predictions, drawing upon many patient records. But in “memorization,” the model draws upon a singular patient record to deliver its output, potentially violating patient privacy. Notably, foundation models are already known to be prone to data leakage.

“Knowledge in these high-capacity models can be a resource for many communities, but adversarial attackers can prompt a model to extract information on training data,” says Sana Tonekaboni, a postdoc at the Eric and Wendy Schmidt Center at the Broad Institute of MIT and Harvard and first author of the paper. Given the risk that foundation models could also memorize private data, she notes, “this work is a step towards ensuring there are practical evaluation steps our community can take before releasing models.”

To conduct research on the potential risk EHR foundation models could pose in medicine, Tonekaboni approached MIT Associate Professor Marzyeh Ghassemi, who is a principal investigator at the Abdul Latif Jameel Clinic for Machine Learning in Health (Jameel Clinic) and a member of the Computer Science and Artificial Intelligence Lab. Ghassemi, a faculty member in the MIT Department of Electrical Engineering and Computer Science and Institute for Medical Engineering and Science, runs the Healthy ML group, which focuses on robust machine learning in health.

Just how much information does a bad actor need to expose sensitive data, and what are the risks associated with the leaked information? To assess this, the research team developed a series of tests that they hope will lay the groundwork for future privacy evaluations. These tests are designed to measure various types of uncertainty, and assess their practical risk to patients by measuring various tiers of attack possibility.

“We really tried to emphasize practicality here; if an attacker has to know the date and value of a dozen laboratory tests from your record in order to extract information, there is very little risk of harm. If I already have access to that level of protected source data, why would I need to attack a large foundation model for more?” says Ghassemi.

With the inevitable digitization of medical records, data breaches have become more commonplace. In the past 24 months, the U.S. Department of Health and Human Services has recorded 747 data breaches of health information affecting more than 500 individuals, with the majority categorized as hacking/IT incidents.

Patients with unique conditions are especially vulnerable, given how easy it is to pick them out. “Even with de-identified data, it depends on what sort of information you leak about the individual,” Tonekaboni says. “Once you identify them, you know a lot more.”

In their structured tests, the researchers found that the more information the attacker has about a particular patient, the more likely the model is to leak information. They demonstrated how to distinguish model generalization cases from patient-level memorization, to properly assess privacy risk.

The paper also emphasized that some leaks are more harmful than others. For instance, a model revealing a patient’s age or demographics could be characterized as a more benign leakage than the model revealing more sensitive information, like an HIV diagnosis or alcohol abuse.

The researchers note that patients with unique conditions are especially vulnerable given how easy it is to pick them out, which may require higher levels of protection. “Even with de-identified data, it really depends on what sort of information you leak about the individual,” Tonekaboni says. The researchers plan to expand the work to become more interdisciplinary, adding clinicians and privacy experts as well as legal experts.

“There’s a reason our health data is private,” Tonekaboni says. “There’s no reason for others to know about it.”

This work supported by the Eric and Wendy Schmidt Center at the Broad Institute of MIT and Harvard, Wallenberg AI, the Knut and Alice Wallenberg Foundation, the U.S. National Science Foundation (NSF), a Gordon and Betty Moore Foundation award, a Google Research Scholar award, and the AI2050 Program at Schmidt Sciences. Resources used in preparing this research were provided, in part, by the Province of Ontario, the Government of Canada through CIFAR, and companies sponsoring the Vector Institute.

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelresearch

Knowledge Map

Knowledge Map
TopicsEntitiesSource
MIT scienti…modelresearchMIT ML News

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 132 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models