A Hybrid Machine Learning Approach for Graduate Admission Prediction and Combined University-Program Recommendation
arXiv:2603.29881v1 Announce Type: new Abstract: Graduate admissions have become increasingly competitive. This study highlights the need for a hybrid machine learning framework for graduate admission prediction, focusing on high-quality similar applicants and a recommendation system. The dataset, collected and enriched by the authors, includes 13,000 self-reported GradCafe application records from 2021 to 2025, enriched with features from the OpenAlex API, QS World University Rankings by Subject, and Wikidata SPARQL queries. A hybrid model was developed by combining XGBoost with a residual refinement k-nearest neighbors module, achieving 87\% accuracy on the test set. A recommendation module, then built on the model for rejected applicants, provided targeted university and program alternat
View PDF HTML (experimental)
Abstract:Graduate admissions have become increasingly competitive. This study highlights the need for a hybrid machine learning framework for graduate admission prediction, focusing on high-quality similar applicants and a recommendation system. The dataset, collected and enriched by the authors, includes 13,000 self-reported GradCafe application records from 2021 to 2025, enriched with features from the OpenAlex API, QS World University Rankings by Subject, and Wikidata SPARQL queries. A hybrid model was developed by combining XGBoost with a residual refinement k-nearest neighbors module, achieving 87% accuracy on the test set. A recommendation module, then built on the model for rejected applicants, provided targeted university and program alternatives, resulting in actionable guidance and improving expected acceptance probability by 70%. The results indicate that university quality metrics strongly influence admission decisions in competitive applicant pools. The features used in the study include applicant quality metrics, university quality metrics, program-level metrics, and interaction features.
Subjects:
Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as: arXiv:2603.29881 [cs.IR]
(or arXiv:2603.29881v1 [cs.IR] for this version)
https://doi.org/10.48550/arXiv.2603.29881
arXiv-issued DOI via DataCite (pending registration)
Submission history
From: Elham Tabrizi [view email] [v1] Mon, 9 Feb 2026 17:25:42 UTC (1,037 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
modelannounceapplicationBlazor WASM's Deputy Thread Model Will Break JavaScript Interop - Here's Why That Matters
<h2> The Problem </h2> <p>Microsoft is changing how .NET runs inside WebAssembly. When you enable threading with <code><WasmEnableThreads>true</WasmEnableThreads></code>, the entire .NET runtime moves off the browser's main thread and onto a background Web Worker — what they call the <strong>"Deputy Thread" model</strong>.</p> <p>This sounds like a good idea on paper. The UI stays responsive. .NET gets real threads. Everyone wins.</p> <p>Except it breaks JavaScript interop. Not in a subtle, edge-case way. It breaks it <em>fundamentally</em>.</p> <h2> What Actually Happens </h2> <p>In traditional Blazor WASM (no threading), .NET and JavaScript share the same thread. When JavaScript calls <code>DotNet.invokeMethod</code>, the CPU jumps from the JS stack to the C# stack and back. It's fast. I
The Wrong Way to Use AI for Debugging (And the Mental Model That Actually Works)
<p>Same AI tool. Same codebase access. Same staging incident. Three experienced engineers spent hours and couldn't find the root cause. I found it in about 20 minutes — and I barely knew what half the tools did.</p> <p>This isn't a story about juniors being better than seniors. It's about a mental model for AI collaboration that anyone can use — and why the "obvious" way most people use AI actually holds them back.</p> <h2> The Incident </h2> <p>A message hits the team Slack channel:</p> <blockquote> <p>"Staging is broken. We're investigating."</p> </blockquote> <p>The error:<br> </p> <div class="highlight js-code-highlight"> <pre class="highlight plaintext"><code>alembic current ERROR: Can't locate revision identified by 'ba29cdc8739d' FAILED: Can't locate revision identified by 'ba29cdc8
The hidden cost of GPT-4o: what every SaaS founder should know about per-user LLM spend it
<p>So you're running a SaaS that leans on an LLM. You check your OpenAI bill at the end of the month, it's a few hundred bucks, you shrug and move on. As long as it's not five figures, who cares, right?</p> <p>Wrong. That total is hiding a nasty secret: you're probably losing money on some of your users.</p> <p>I'm not talking about the obvious free-tier leeches. I'm talking about paying customers who are costing you more in API calls than they're giving you in subscription fees. You're literally paying for them to use your product.</p> <p><strong>The problem with averages</strong></p> <p>Let's do some quick, dirty math. GPT-4o pricing settled at around $3/1M tokens for input and $10/1M for output. It's cheap, but it's not free.</p> <p>Say you have a summarization feature. A user pastes in
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Products
Why natural transformations?
This post is aimed primarily at people who know what a category is in the extremely broad strokes, but aren't otherwise familiar or comfortable with category theory. One of mathematicians' favourite activities is to describe compatibility between the structures of mathematical artefacts. Functions translate the structure of one set to another, continuous functions do the same for topological spaces, and so on... Many among these "translations" have the nice property that their character is preserved by composition. At some point, it seems that some mathematicians noticed that they: 1. kept defining intuitively similar properties for these different structures 2. had wayyyyyy too much time on their hands So they generalised this concept into a unified theory. Categories consist of objects m
Blazor WASM's Deputy Thread Model Will Break JavaScript Interop - Here's Why That Matters
<h2> The Problem </h2> <p>Microsoft is changing how .NET runs inside WebAssembly. When you enable threading with <code><WasmEnableThreads>true</WasmEnableThreads></code>, the entire .NET runtime moves off the browser's main thread and onto a background Web Worker — what they call the <strong>"Deputy Thread" model</strong>.</p> <p>This sounds like a good idea on paper. The UI stays responsive. .NET gets real threads. Everyone wins.</p> <p>Except it breaks JavaScript interop. Not in a subtle, edge-case way. It breaks it <em>fundamentally</em>.</p> <h2> What Actually Happens </h2> <p>In traditional Blazor WASM (no threading), .NET and JavaScript share the same thread. When JavaScript calls <code>DotNet.invokeMethod</code>, the CPU jumps from the JS stack to the C# stack and back. It's fast. I
The Wrong Way to Use AI for Debugging (And the Mental Model That Actually Works)
<p>Same AI tool. Same codebase access. Same staging incident. Three experienced engineers spent hours and couldn't find the root cause. I found it in about 20 minutes — and I barely knew what half the tools did.</p> <p>This isn't a story about juniors being better than seniors. It's about a mental model for AI collaboration that anyone can use — and why the "obvious" way most people use AI actually holds them back.</p> <h2> The Incident </h2> <p>A message hits the team Slack channel:</p> <blockquote> <p>"Staging is broken. We're investigating."</p> </blockquote> <p>The error:<br> </p> <div class="highlight js-code-highlight"> <pre class="highlight plaintext"><code>alembic current ERROR: Can't locate revision identified by 'ba29cdc8739d' FAILED: Can't locate revision identified by 'ba29cdc8
Setting Up a Production-Ready Laravel Stack: Nginx, PHP 8.4, MySQL, Valkey & Supervisor
<p>A Laravel application running on your local machine with <code>php artisan serve</code> and a SQLite database is a fundamentally different beast from a Laravel application serving thousands of requests per minute in production. The gap between development and production is not just about code. It is about infrastructure: a properly tuned web server, an optimized PHP runtime, a robust database engine, a fast cache layer, and a process manager that keeps your queue workers alive.</p> <p>Building this stack manually is a rite of passage for many developers, but it is also a minefield of configuration mistakes, security oversights, and wasted hours. Deploynix provisions this entire stack automatically when you create an App Server, configured with production-grade defaults that reflect year
Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!