← Back to all learnings47Billion: Production case study (insurance sales training, 4 months to production) LangChain: 1,300+ professional survey (Nov-Dec 2025) 57% have agents in production (up from 51% last year) 30.4% actively developing with concrete deployment plans Large [REDACTED]s leading: 67% of 10k+ orgs in production vs 50% of <100 orgs Top use cases: Customer service (26.5%), Research (24.4%), Internal productivity (18%) 89% have observability (62% with detailed tracing) Production agents: 94% have observability, 71.5% full tracing 52% run offline evals, 37% run online evals 23% combine offline + online evaluations Evaluation methods: Human review (59.8%), LLM-as-judge (53.3%) OpenAI GPT models dominate but 76%+ use multiple models 57% not fine-tuning - relying on base models + prompt engineering + RAG 33% investing in self-hosted models (cost optimization, data residency, [REDACTED]) Coding agents: Claude Code, Cursor, GitHub Copilot, Amazon Q, Windsurf Research agents: ChatGPT, Claude, Gemini, Perplexity Custom agents: LangChain/LangGraph for QA, SQL, customer support, workflow automation Narrow agents beat general agents - Claude Code, Cursor success proves this HITL is a requirement, not limitation - Progressive autonomy: start with human checkpoints, reduce over time Refinement phase = 80% of effort - Small prompt changes produce dramatically different behaviors Cost is multiplicative - Set up monitoring from day one Long conversations break things - Need smart summarization, context pruning Guardrails are essential infrastructure - Output validation, action constraints, cost limits MCP adoption accelerating (recommended by 47Billion as early-adopt standard) Protocol convergence happening - teams adopting MCP + A2A + AG-UI together Security emerging as [REDACTED] concern (24.9% cite as blocker) Opportunity: MCP security validation (no one doing this yet)
MCP & Protocols2026-04-17•587 words•3 min read
AI Agents in Production 2026
#mcp#rag#security#llm#langchain
AI Agents in Production 2026
Sources:
Adoption Status
Barriers to Production
| Barrier | Overall | [REDACTED]s (2k+) |
|---------|---------|-------------------|
| Quality | 32% | Top blocker |
| Latency | 20% | - |
| Security | - | 24.9% (2nd) |
| Cost | ↓ from last year | - |
Key insight: Cost concerns dropping due to falling model prices. Focus shifted to quality + speed.
Framework Comparison (47Billion Case Study)
| Framework | Time to Production | Token Usage | Best For |
|-----------|-------------------|-------------|----------|
| AutoGen | 3 weeks | 5x baseline | Exploratory multi-agent |
| CrewAI | 1 week | 2-3x baseline | Structured multi-step tasks |
| LlamaIndex | - | 1-2x baseline | RAG/document workflows |
Recommendation: Level 2-3 autonomy (workflows + tool-using) is the sweet spot. Level 4 (open-ended multi-agent) is still too unpredictable for critical paths.
Cost Reality
| Approach | Cost per Task | Tokens |
|----------|---------------|--------|
| Simple Workflow | $0.10-0.50 | 1,000-3,000 |
| CrewAI Multi-Agent | $0.50-2.00 | 3,000-10,000 |
| AutoGen Multi-Agent | $2.00-5.00 | 5,000-25,000 |
| LlamaIndex RAG | $0.20-1.00 | 1,000-5,000 |
Key insight: Multi-agent = 5-10x cost (every agent sees full conversation history).
Observability & Evaluation
Model Landscape
Daily Agent Use
Protocol Stack (Emerging Standards)
| Protocol | Purpose | Analogy |
|----------|---------|---------|
| MCP | Agent ↔ Tool | USB for AI tools |
| A2A | Agent ↔ Agent | Business cards for AI |
| AG-UI | Agent ↔ User | Standardized frontend communication |
Recommendation: Adopt MCP, A2A, AG-UI early. Custom integrations will feel outdated.
Key Production Lessons
For MCPHub
Date: 2026-03-03
Tags: #ai-agents #production #frameworks #mcp #cost #observability