Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessMassachusetts Sen. Ed Markey is putting AV firms on blast for using human staffersFast Company TechOpenClaw has 500,000 instances and no enterprise kill switchVentureBeat AIHere's how U.S. residents feel about Trump's signature on American cashAxios TechStablecoin Market to Hit $2 Trillion in 2028 Even as Velocity Doubles: Standard CharteredDecrypt AIBuilding Trust Between Agents: AgentID + ArkForge InteroperabilityDEV CommunityI Analyzed Claude Code's Leaked Source — Here's How Anthropic's AI Agent Actually WorksDEV CommunityI wish AI Agents just knew how I work without me explaining - so I made something that quietly observes me, learns and teaches it.DEV CommunityEmotion-Aware Voice Agents: How AI Now Detects Frustration and Adjusts in Real TimeDEV CommunityXoul - Local Personal Assistant Agent Release (Beta, v0.1.0-beta)DEV CommunityIntroduction to GIT- GITHUB/GITLABDEV CommunityTurboQuant MoE 0.3.0DEV CommunityCSS Grid Lanes (Masonry Layout) Is Here: A Complete Guide for 2026DEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessMassachusetts Sen. Ed Markey is putting AV firms on blast for using human staffersFast Company TechOpenClaw has 500,000 instances and no enterprise kill switchVentureBeat AIHere's how U.S. residents feel about Trump's signature on American cashAxios TechStablecoin Market to Hit $2 Trillion in 2028 Even as Velocity Doubles: Standard CharteredDecrypt AIBuilding Trust Between Agents: AgentID + ArkForge InteroperabilityDEV CommunityI Analyzed Claude Code's Leaked Source — Here's How Anthropic's AI Agent Actually WorksDEV CommunityI wish AI Agents just knew how I work without me explaining - so I made something that quietly observes me, learns and teaches it.DEV CommunityEmotion-Aware Voice Agents: How AI Now Detects Frustration and Adjusts in Real TimeDEV CommunityXoul - Local Personal Assistant Agent Release (Beta, v0.1.0-beta)DEV CommunityIntroduction to GIT- GITHUB/GITLABDEV CommunityTurboQuant MoE 0.3.0DEV CommunityCSS Grid Lanes (Masonry Layout) Is Here: A Complete Guide for 2026DEV Community

DinoDental: Benchmarking DINOv3 as a Unified Vision Encoder for Dental Image Analysis

arXivMarch 31, 20262 min read0 views
Source Quiz

arXiv:2603.28297v1 Announce Type: new Abstract: The scarcity and high cost of expert annotations in dental imaging present a significant challenge for the development of AI in dentistry. DINOv3, a state-of-the-art, self-supervised vision foundation model pre-trained on 1.7 billion images, offers a promising pathway to mitigate this issue. However, its reliability when transferred to the dental domain, with its unique imaging characteristics and clinical subtleties, remains unclear. To address this, we introduce DinoDental, a unified benchmark designed to systematically evaluate whether DINOv3 — Kun Tang, Xinquan Yang, Mianjie Zheng, Xuefen Liu, Xuguang Li, Xiaoqi Guo, Ruihan Chen, Linlin Shen, He Meng

View PDF HTML (experimental)

Abstract:The scarcity and high cost of expert annotations in dental imaging present a significant challenge for the development of AI in dentistry. DINOv3, a state-of-the-art, self-supervised vision foundation model pre-trained on 1.7 billion images, offers a promising pathway to mitigate this issue. However, its reliability when transferred to the dental domain, with its unique imaging characteristics and clinical subtleties, remains unclear. To address this, we introduce DinoDental, a unified benchmark designed to systematically evaluate whether DINOv3 can serve as a reliable, off-the-shelf encoder for comprehensive dental image analysis without requiring domain-specific pre-training. Constructed from multiple public datasets, DinoDental covers a wide range of tasks, including classification, detection, and instance segmentation on both panoramic radiographs and intraoral photographs. We further analyze the model's transfer performance by scaling its size and input resolution, and by comparing different adaptation strategies, including frozen features, full fine-tuning, and the parameter-efficient Low-Rank Adaptation (LoRA) method. Our experiments show that DINOv3 can serve as a strong unified encoder for dental image analysis across both panoramic radiographs and intraoral photographs, remaining competitive across tasks while showing particularly clear advantages for intraoral image understanding and boundary-sensitive dense prediction. Collectively, DinoDental provides a systematic framework for comprehensively evaluating DINOv3 in dental analysis, establishing a foundational benchmark to guide efficient and effective model selection and adaptation for the dental AI community.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.28297 [cs.CV]

(or arXiv:2603.28297v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.28297

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Xinquan Yang [view email] [v1] Mon, 30 Mar 2026 11:23:57 UTC (8,618 KB)

Original source

arXiv

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
DinoDental:…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 175 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers