Real-World Application Patterns

Learn production-grade patterns for building scalable, reliable LLM applications with LLM4S. These guides cover architecture decisions, implementation strategies, and best practices based on real-world deployments.

Multi-Agent Orchestration - Design patterns for agent-to-agent communication, delegation, and failure handling
RAG for Enterprise - Production strategies for document management, hybrid search, and quality assurance
Production Monitoring - Comprehensive observability, alerting, and cost tracking in production
Scaling Strategies - Techniques for handling high throughput, caching, and distributed execution
Error Recovery - Resilience patterns including retries, circuit breakers, and graceful degradation
Security Best Practices - API key management, input validation, and audit logging

Getting Started

Choose a pattern based on your current challenge:

Challenge	Guide	Key Topics
Multiple agents need to work together	Multi-Agent Orchestration	Delegation, handoffs, context passing
Building with documents and retrieval	RAG for Enterprise	Ingestion, search, quality
Need production visibility	Production Monitoring	Metrics, alerts, dashboards
Handling high request volume	Scaling Strategies	Caching, batching, distribution
Dealing with failures	Error Recovery	Retries, fallbacks, circuit breakers
Protecting sensitive data	Security Best Practices	Secrets, validation, audit

Architecture Decision Trees

How Should Agents Communicate?

Need agents to coordinate?
├─ Synchronous response required
│  └─> Direct delegation (see handoff pattern)
├─ Fire-and-forget updates
│  └─> Event-based communication
└─> Complex multi-step workflows
   └─> Orchestrator agent pattern

Which Search Strategy?

Choosing RAG search approach?
├─ Fast, simple queries
│  └─> Vector search only
├─ Mixed structured/unstructured
│  └─> Hybrid search (keyword + vector)
├─ Complex business logic
│  └─> Multi-stage retrieval
└─> Real-time freshness critical
   └─> Time-aware search with reranking

Error Recovery Strategy?

API call failed, what now?
├─ Transient error (timeout, rate limit)
│  └─> Exponential backoff retry
├─ Model unavailable
│  └─> Fallback to alternate model
├─ Entire provider down
│  └─> Circuit breaker + cached response
└─> Recoverable from degraded mode
   └─> Graceful degradation

Pattern Comparison Matrix

Pattern	Complexity	Resilience	Performance	When to Use
Synchronous delegation	Low	Medium	High	Simple sequential workflows
Event-based agents	Medium	High	High	Loosely coupled systems
Vector-only search	Low	High	Very High	Semantic similarity priority
Hybrid search	Medium	High	Medium	Balanced search needs
Exponential backoff	Low	Medium	Good	Transient failures
Circuit breaker	High	Very High	Good	Preventing cascade failures
Caching layer	Medium	Medium	Excellent	High-volume, repeated queries

Common Use Cases

E-Commerce Product Search Agent

Pattern: RAG for Enterprise + Scaling Strategies
Why: Needs fast search over large catalog with cost control
Key: Hybrid search + caching + batch processing

Multi-Department Support System

Pattern: Multi-Agent Orchestration + Error Recovery
Why: Different agents for different departments with fallback
Key: Agent delegation + circuit breaker pattern

Financial Analysis Platform

Pattern: Security Best Practices + Production Monitoring
Why: Sensitive data + audit requirements + high availability
Key: API key vaults + detailed logging + alerts

Document Intelligence System

Pattern: RAG for Enterprise + Production Monitoring
Why: Large documents + need quality metrics
Key: Chunking strategy + grounding scores + observability

Quick Links

Run Samples: sbt "samples/runMain org.llm4s.samples.patterns.MultiAgentExample"
View Sample Code: modules/samples/src/main/scala/org/llm4s/samples/patterns
Report Issues: GitHub Issues
Discuss Patterns: GitHub Discussions

Feedback & Contributions

These patterns evolve based on community experience. If you’ve implemented a pattern not covered here, please share it in Discussions or contribute a guide!

Last Updated: February 2026
Status: Stable - Production Ready