Reality is not an End-to-End Prediction Problem: Applied NLP in the Age of Generative AI

Explosion AI Blogby Ines MontaniOctober 17, 20241 min read0 views

Video: https://www.youtube.com/watch?v=K_Y9wvGjNKw

Large Language Models (LLMs) and in-context learning have introduced a new paradigm for developing natural language understanding systems: prompts are all you need! Prototyping has never been easier, but not all prototypes give a smooth path to production. In this talk, I'll share the most important lessons we've learned from solving real-world information extraction problems in industry, and show you a new approach and mindset for designing robust and modular NLP pipelines in the age of Generative AI.

Breaking down larger business problems into actionable machine learning tasks is one of the central challenges of applied natural language processing. I will walk you through example applications and practical solutions, and show you how to use LLMs to their fullest potential, how and where to integrate your custom business logic and how to maximize efficiency, transparency and data privacy.

Video

Resources

A practical guide to human-in-the-loop distillation

https://explosion.ai/blog/human-in-the-loop-distillation

This blog post presents practical solutions for using the latest state-of-the-art models in real-world applications and distilling their knowledge into smaller and faster components that you can run and maintain in-house.

How S&P Global is making markets more transparent with NLP, spaCy and Prodigy

https://explosion.ai/blog/sp-global-commodities

A case study on S&P Global’s efficient information extraction pipelines for real-time commodities trading insights in a high-security environment using human-in-the-loop distillation.

How GitLab uses spaCy to analyze support tickets and empower their community

https://explosion.ai/blog/gitlab-support-insights

A case study on GitLab’s large-scale NLP pipelines for extracting actionable insights from support tickets and usage questions.

Applied NLP Thinking: How to Translate Problems into Solutions

https://explosion.ai/blog/applied-nlp-thinking

This blog post discusses some of the biggest challenges for applied NLP and translating business problems into machine learning solutions, including the distinction between utility and accuracy.

The Window-Knocking Machine Test

https://ines.io/blog/window-knocking-machine-test/

How will technology shape our world going forward? And what tools and products should we build? When imagining what the future could look like, it helps to look back in time and compare past visions to our reality today.

Using LLMs for human-in-the-loop distillation in Prodigy

https://prodi.gy/docs/large-language-models

Prodigy comes with preconfigured workflows for using LLMs to speed up and automate annotation and create datasets for distilling large generative models into more accurate, smaller, faster and fully private task-specific components.

Transcript

Ines Montani Explosion LLM
de fi nition s E volution
de fi nition s E volution rules or instructions ✍

programming & rules machine learning examples 📝 supervised learning in-context learning rules or instructions ✍ LLM prompt engineering instructions: human-shaped, easy for non-experts, risk of data drift ✍

de fi nition s E volution rules or instructions ✍

de fi nition s E volution rules or instructions ✍

programming & rules machine learning examples 📝 supervised learning in-context learning rules or instructions ✍ LLM prompt engineering ? ? LLM instructions: human-shaped, easy for non-experts, risk of data drift ✍ 📝 examples: nuanced and intuitive behaviors, specific to use case, labor-intensive

Falcon MIXTRAL GPT-4 LLM
Falcon MIXTRAL GPT-4 good contextual results LLM
in the loop H uma n explosion.ai/blog/human-in-the-loop-distillation LLM
Case Stud y : S&P Global 99% 99% • real-time

commodities trading insights by extracting structured attributes • high-security environment 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities

Case Stud y : S&P Global 99% 99% • real-time

commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities

Case Stud y : S&P Global 99% 99% • real-time

commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation • 10× data development speedup with humans and model in the loop 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities

Case Stud y : S&P Global 99% 99% • real-time

commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation • 10× data development speedup with humans and model in the loop • 8 market pipelines in production 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities

Case Stud y : S&P Global 99% 99% • real-time

commodities trading insights by extracting structured attributes • high-security environment • used LLM during annotation • 10× data development speedup with humans and model in the loop • 8 market pipelines in production 6mb 6mb model size 16k+ 16k+ words/second F-score explosion.ai/blog/sp-global-commodities

Refactor your code and data.
Software 1.0 Software 1.0 📄 code 💾 program compiler
Case Stud y : GitLab 1 year 1 year 6×

• extract actionable insights from support tickets and usage questions • high-security environment 6× speedup of support tickets explosion.ai/blog/gitlab-support-insights

Case Stud y : GitLab 1 year 1 year 6×

• extract actionable insights from support tickets and usage questions • high-security environment • easy to adapt to new scenarios and business questions 6× speedup of support tickets explosion.ai/blog/gitlab-support-insights

Case Stud y : GitLab 1 year 1 year 6×

• extract actionable insights from support tickets and usage questions • high-security environment • easy to adapt to new scenarios and business questions • separated general-purpose features from product-specific logic 6× speedup of support tickets explosion.ai/blog/gitlab-support-insights

Case Stud y : GitLab 1 year 1 year 6×

Language is just another interface.
“knocker-uppers”
The Window K nocking Machine Tes t ines.io/blog/window-knocking-machine-test “knocker-uppers”
Hello, I ’ m Toni ’ s virtual assistant and

I help schedule appointments. Do you have time at 1pm on Monday? No, but Tuesday would work for me. Okay, please confirm: Tuesday at 1pm? 1pm is unideal but 3pm would work. Toni doesn ’ t have availability at 3pm but I could offer a slot at 4pm or 5 : 30pm. Which time zone is this by the way? I ’ m in CET. ines.io/blog/window-knocking-machine-test

Hello, I ’ m Toni ’ s virtual assistant and

Hello, I ’ m Toni ’ s virtual assistant and

2023 Year Services Type ACME Inc. FooBar GmbH NLPCorp XKCD

Ltd. Python AG 432,032 82,000 1,500 193,000 91,320 $ 2,625,032 Clients (28) Revenue What ’ s the total services revenue from 2023? $2,923,531 How many clients is that in total? 29 ⏺ ⏺ ⏺ 🔮 LLM 📚 database 🤖 agents ⚙ query Retrieval-Augmented Generation ines.io/blog/window-knocking-machine-test

2023 Year Services Type ACME Inc. FooBar GmbH NLPCorp XKCD

Ltd. Python AG 432,032 82,000 1,500 193,000 91,320 $ 2,625,032 Clients (28) Revenue A I still needs produc t decisions! Kim Miller Analyst What ’ s the total services revenue from 2023? $2,923,531 How many clients is that in total? 29 ⏺ ⏺ ⏺ 🔮 LLM 📚 database 🤖 agents ⚙ query Retrieval-Augmented Generation ines.io/blog/window-knocking-machine-test

Reason and refactor. The key to success lies in your

data and may surprise you! LLM Stay ambitious. Don’t compromise on best practices, e iciency and privacy. Summar y APPLIED NLP & GEN AI APPLIED NLP & GEN AI Think beyond chat bots. You don’t want to build a “window-knocking machine”.

Original source

Explosion AI Blog

https://speakerdeck.com/inesmontani/reality-is-not-an-end-to-end-prediction-problem-applied-nlp-in-the-age-of-generative-ai

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

prediction

Models

Health state prediction with reinforcement learning for predictive maintenance - Frontiers

<a href="https://news.google.com/rss/articles/CBMiogFBVV95cUxQa1FpdUVIdWxkVDJScHRrRnllMEV0V2p1ZGs4MW96RzQybDl2RVc3NU1yaFhkRmNxTUtVeUZXM3NrdHYtOW9xdDhMZHV0OWtETGlpTlBpQW1zQUZKbENCbFI4S1pkWVI2UGJBNDQwWWRtaS1PTFpsZy1CQXRMZjVPZ0gyVkZneXFGZlpWRnUya1h4bk1RUnRxQWZGNzVwRFN1TEE?oc=5" target="_blank">Health state prediction with reinforcement learning for predictive maintenance</a> Frontiers

GNews AI reinforcement learning

1m3 months ago

Analyst News

AI is proving a 100-year-old prediction true - The Japan Times

<a href="https://news.google.com/rss/articles/CBMilgFBVV95cUxPbzZ5dlkyaXgxbjFyWEhXYmZjdE5jM2phSm51dXBZNzBQNWd5LVRQZ29mR3BBWUx2RHU3aU50eUFzVGdQVFAzVXBaN2h1OUtVQmdicGhVRnRXZG1HV1lOeldIbUxoNVRvNXNVRWVaLXZFTWNiRkp0a1JNZkpwOHlpcHJHVng1Yzg0azE0VTR4UjJ5YXpIQXc?oc=5" target="_blank">AI is proving a 100-year-old prediction true</a> The Japan Times

GNews AI Japan

1m26 days ago

ProductsFresh

RHINO-MAG: Recursive H-Field Inference based on Observed Magnetic Flux under Dynamic Excitation

arXiv:2603.29745v1 Announce Type: cross Abstract: Driven by the MagNet Challenge 2025 (MC2), increased research interest is directed towards modeling transient magnetic fields within ferrite material. An accurate time-resolved and temperature-aware H-field prediction is essential for optimizing magnetic components in applications with quasi-stationary / non-stationary excitation waveforms. Within the scope of this investigation, a selection of model structures with varying degrees of physically motivated structure are compared. Based on a Pareto investigation, a rather black-box gated recurrent unit (GRU) model structure with a graceful initialization setup is found to offer the most attractive model size vs. model accuracy trade-off, while the physics-inspired models performed worse. For

arXiv eess.SP

1mabout 6 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 156 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in Products

Products

Slovakia’s AI Startups Outpaced Czechia in 2024 Funding, AI Report Shows - therecursive.com

<a href="https://news.google.com/rss/articles/CBMingFBVV95cUxQV014X1FmZnVtaGlmYm95a0pUTGJYdGwzUzZJUlM2dGNxbUliSi1qU3NHd01hSDNlZHBZZER1TWZlSWNTYmNhTURqTjI1ZzVlVDMwYW5nYnd3d204dmVqXzhzVUtnamxtX3V3ZWFXTHp3UTNqLXNSRVRQSXh3UzRRYkZFNldaX2xvaHZJRGNSVmtTN1F4VGJ1Nm5rdUhLUQ?oc=5" target="_blank">Slovakia’s AI Startups Outpaced Czechia in 2024 Funding, AI Report Shows</a> therecursive.com

Google News - AI Slovakia

1mabout 1 year ago

Products

EdgeMode Expands AI Infrastructure Pipeline to 4.35GW in Spain, Establishing One of Europe’s Largest AI Development Platforms - WFMZ.com

<a href="https://news.google.com/rss/articles/CBMingJBVV95cUxQMm80Y0VmZlpjV2Y5QV9hUzQtNHp2bkN2M0tOaDRlWHpjbHhsQ21OSTV1THlKZklkSE5DR0w1TjgwNzY0OFRXaGlZb0FHX1QtcGlQcTBJNEloY01EUENTSzR1QlhaVFh6MTBTWW8waVFkN0ZoWGNzZTR1R2o4RmVtMi1mNHFVRWFHemNvUEtpbnBlTDRORHhBQ3ZMUWhkaXlpY250cF9DX0o0Mkp4enl3eHdSTmRzT3dyUF81TFhmU2xwazhPWTVEY2c2Vm8wLTU4eTVuNl8xMjhDc1BnWUFjWmhyRXVVT3EyVFVJcG9PTElPNmZGT0YxM2pabzN5bGwzMWpnSmNuVGtrczc0TElKOGFqOWJqaXRDRlRENTZ3?oc=5" target="_blank">EdgeMode Expands AI Infrastructure Pipeline to 4.35GW in Spain, Establishing One of Europe’s Largest AI Development Platforms</a> WFMZ.com

Google News - AI Spain

1m7 days ago

Products

AI Emerges as Crucial Tool for Groups Seeking Justice for Syria War Crimes - wsj.com

<a href="https://news.google.com/rss/articles/CBMixwNBVV95cUxNX2hnRkZsMXdWNGVsbUdLekUzeFlQd1M4ZXdqQXRkOEpMME9TeHg0amZZNzg1WHk2UE54d1pSaFJic3dyOW56M3NoMkhyQWNXd2hMMW9kNWo2TGtFOHhPS1Z6QkZUbDMxU1hncUF1UHZncHNTODNFRmZuWHBMV2puV1VSTUQycmkxa3l5UUNRMmhQMk5GeDVRRTJoVTdkQVJMN1N3R2habUNLcWt4VDNSbFQyclhMNDdKaUtJSk82WGQzejN6VXo4M2pPUTJmTUNsVURZWl9Ocjh5U2tLUER6RDhLVUkwbTdvaTNxa2YybDJKLVk2N3dKeVpGSVVXcEtoXy1OdkpsMkNMRUJ5MlRkcVVpRTFjT0pqY20ySVVKOGlfbVJGaXh0dkU2QVhtUDNXZjMxdlZsZXduNjdoR0hmR3hqSURsT0txdzlYd2FWQmVTS2JTNkQ5YnNQTFgyVldGM29HT2RUZEtlVU1WUlRxc1BZU24zUE0teDZYNnh1LVVGakVGak1tM2VoajlObTJQNTJvLUZQVXVsQ1lpWW9jdjY5ZkZ3aXE3Zk1sWWFFbkdNQjNwajdOc3A1ZHZkNHFwaXdyVlNqSQ?oc=5" target="_blank">AI Emerges as Crucial Tool for Groups Seeking Justice for Syria War Crimes</a> wsj.com

Google News - AI Syria

1mabout 5 years ago

ProductsRecent

OpenBox

See, verify, and govern every agent action. <a href="https://www.producthunt.com/products/openbox?utm_campaign=producthunt-atom-posts-feed&utm_medium=rss-feed&utm_source=producthunt-atom-posts-feed">Discussion</a> | <a href="https://www.producthunt.com/r/p/1112203?app_id=339">Link</a>

Product Hunt

1m1 day ago

Reality is not an End-to-End Prediction Problem: Applied NLP in the Age of Generative AI

Video

Resources

A practical guide to human-in-the-loop distillation

How S&P Global is making markets more transparent with NLP, spaCy and Prodigy

How GitLab uses spaCy to analyze support tickets and empower their community

Applied NLP Thinking: How to Translate Problems into Solutions

The Window-Knocking Machine Test

Using LLMs for human-in-the-loop distillation in Prodigy

Transcript

Ines Montani Explosion LLM

de fi nition s E volution

de fi nition s E volution rules or instructions ✍

de fi nition s E volution rules or instructions ✍

de fi nition s E volution rules or instructions ✍

Falcon MIXTRAL GPT-4 LLM

Falcon MIXTRAL GPT-4 good contextual results LLM

in the loop H uma n explosion.ai/blog/human-in-the-loop-distillation LLM

Case Stud y : S&P Global 99% 99% • real-time

Case Stud y : S&P Global 99% 99% • real-time

Case Stud y : S&P Global 99% 99% • real-time

Case Stud y : S&P Global 99% 99% • real-time

Case Stud y : S&P Global 99% 99% • real-time

Refactor your code and data.

Software 1.0 Software 1.0 📄 code 💾 program compiler

Case Stud y : GitLab 1 year 1 year 6×

Case Stud y : GitLab 1 year 1 year 6×

Case Stud y : GitLab 1 year 1 year 6×

Case Stud y : GitLab 1 year 1 year 6×

Language is just another interface.

“knocker-uppers”

The Window K nocking Machine Tes t ines.io/blog/window-knocking-machine-test “knocker-uppers”

Hello, I ’ m Toni ’ s virtual assistant and

Hello, I ’ m Toni ’ s virtual assistant and

Hello, I ’ m Toni ’ s virtual assistant and

2023 Year Services Type ACME Inc. FooBar GmbH NLPCorp XKCD

2023 Year Services Type ACME Inc. FooBar GmbH NLPCorp XKCD

Reason and refactor. The key to success lies in your

Daily AI Digest

More about

Health state prediction with reinforcement learning for predictive maintenance - Frontiers

AI is proving a 100-year-old prediction true - The Japan Times

RHINO-MAG: Recursive H-Field Inference based on Observed Magnetic Flux under Dynamic Excitation

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Products

Slovakia’s AI Startups Outpaced Czechia in 2024 Funding, AI Report Shows - therecursive.com

EdgeMode Expands AI Infrastructure Pipeline to 4.35GW in Spain, Establishing One of Europe’s Largest AI Development Platforms - WFMZ.com

AI Emerges as Crucial Tool for Groups Seeking Justice for Syria War Crimes - wsj.com

OpenBox