Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessHow NLP Actually Understands Text?Medium AIXENONOSTRA RESEARCH NOTES ALGEBROS: An Algebraic Meta-Language for Code Structure Extraction and…Medium AI18 Specific Tutorial Ideas for AI Voice Integration Using Vapi and TwilioDev.to AIUI DESIGNERS IN TROUBLEMedium AIMastering Python for Machine Learning: A Practical, No-Nonsense RoadmapMedium AII Audited 13 AI Agent Platforms for Security Misconfigurations — Here's the Open-Source Scanner I BuiltDev.to AIFrom Reality to Writing: Why I Explore Technology, Identity and Human BehaviorMedium AIA Developer's Introduction to Generative AIDEV CommunityOutcome Routing in Autonomous Vehicles: Fleet Intelligence Without Location DataDev.to AIHow I Hunt Security Bounties with Claude Code (Real Workflow, Real Payouts)Dev.to AII'm 해나, Leader 43 of Lawmadi OS — Your AI Industrial Accidents Expert for Korean LawDev.to AIAgentGraph UpdateDev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessHow NLP Actually Understands Text?Medium AIXENONOSTRA RESEARCH NOTES ALGEBROS: An Algebraic Meta-Language for Code Structure Extraction and…Medium AI18 Specific Tutorial Ideas for AI Voice Integration Using Vapi and TwilioDev.to AIUI DESIGNERS IN TROUBLEMedium AIMastering Python for Machine Learning: A Practical, No-Nonsense RoadmapMedium AII Audited 13 AI Agent Platforms for Security Misconfigurations — Here's the Open-Source Scanner I BuiltDev.to AIFrom Reality to Writing: Why I Explore Technology, Identity and Human BehaviorMedium AIA Developer's Introduction to Generative AIDEV CommunityOutcome Routing in Autonomous Vehicles: Fleet Intelligence Without Location DataDev.to AIHow I Hunt Security Bounties with Claude Code (Real Workflow, Real Payouts)Dev.to AII'm 해나, Leader 43 of Lawmadi OS — Your AI Industrial Accidents Expert for Korean LawDev.to AIAgentGraph UpdateDev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

Prompts you use to test/trip up your LLMs

Reddit r/LocalLLaMAby /u/FenderMoon https://www.reddit.com/user/FenderMoonApril 6, 20264 min read0 views
Source Quiz

I'm obsessed with finding prompts to test the quality of different local models. I've pretty much landed on several that I use across the board. Tell me about the Apple A6 (a pass is if it mentions Apple made their own microarchitecture called swift for the CPU cores, the main thing that the A6 is historically known for as the first Apple SOC to do it. This tests if it is smart enough to mention historically relevant information first) Tell me about the history of Phoenix's freeway network (A pass is if it gives a historical narration instead of just listing freeways. We asked for history, after all. Again, testing for its understanding of putting relevant information first.) Tell me about the Pentium D. Why was it a bad processor ( A pass is it it mentions that it glued two separate penti

Could not retrieve the full article text.

Read on Reddit r/LocalLLaMA →
Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modeltrainingnew model

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Prompts you…modeltrainingnew modelservicereasoningpaperReddit r/Lo…

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 203 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Models

Три месяца я использовал Cursor неправильно. Вот как надо.
ModelsLive

Три месяца я использовал Cursor неправильно. Вот как надо.

14 февраля, 23:40. Я сижу перед ноутбуком, пытаясь закрыть дашборд за $1200 . В панике копирую огромные блоки кода в Cursor и требую "почини это". Ответы - мусор. Пытаюсь снова и снова, но проходит три часа, а проблема не решается. Наутро, случайно выделив маленькую функцию из 12 строк и добавив точный контекст, я получаю решение за 40 секунд . Понимание пришло слишком поздно - Cursor, как и любой AI, требует точности. Почему мне было больно Cursor казался мне волшебной палочкой, но на практике это было как попытка открыть замок набором случайных ключей. Когда я копировал целые файлы по 400 строк , он терялся и выдавал бессмысленный код. Было обидно терять часы на правки, которые никто не оплачивает. Этот опыт знаком многим, кто использует AI для кода. Хотим получить решения мгновенно, но