Production : déployer ton agent IA — Agents IA + LangChain + RAG

🎬

Vidéo en production

Notre équipe pédagogique tourne actuellement cette leçon avec un·e formateur·rice expert·e. Le contenu textuel ci-dessous est complet et utilisable dès maintenant.

Stack production-ready

Backend : FastAPI (Python) ou Node.js + LangChain.js
Frontend : Next.js + React + ai-sdk Vercel
Vector DB : Pinecone (managed) ou Postgres+pgvector (self)
Cache LLM : Redis pour économiser les coûts
Monitoring : LangSmith (gratuit) ou Helicone
Auth : NextAuth ou Clerk
Hosting : Vercel (front) + Render/Railway (backend)

Coûts à anticiper

API Claude Sonnet 4 : 3 USD/million tokens input + 15 USD/million output
Embedding text-3-small : 0.02 USD/million tokens
Pinecone serverless : 70 USD/mois pour 5M vectors
Vercel + Render : 50 USD/mois total
Total pour MVP avec 100 users actifs : ~150-300 USD/mois

💡 Conseil ROI : commence par Claude Haiku (5x moins cher que Sonnet) pour 80% des cas, et bascule sur Sonnet/Opus pour les cas complexes.

📚 Programme du cours

1

Chapitre 1 — LangChain : framework agents IA
🔒

Chapitre 2 — RAG : Retrieval Augmented Generation
🔒

Chapitre 3 — Vector databases comparées
🔒

Chapitre 4 — Agents IA pour entreprises (cas réels)
🔒

Chapitre 5 — Déploiement agents en production
🔒

LangChain : framework pour LLM apps
🔒

RAG (Retrieval Augmented Generation)
🔒

Agents : LLM qui utilisent des outils
9

Production : déployer ton agent IA