Skip to main content

Agent Examples

Browse 108 demo agents demonstrating waxell-observe SDK patterns across LLM providers, vector databases, agent frameworks, and specialized pipelines. Each agent runs in dry-run mode by default and produces a complete observability trace.

Category

DifficultyPattern

108 agents

LLM Providers

OpenAI Agents SDK

Runner, triage, and handoff patterns with OpenAI Agents SDK.

OpenAI Agents SDK

multi-agenttool-use

Anthropic

Multi-step content analysis pipeline with Claude models.

multi-agentpipeline

Gemini

Multi-agent multi-model pipeline with Google Gemini API.

Groq

Function calling with fast inference using Groq and OpenAI.

multi-agenttool-use

Mistral

Multi-model pipeline with Mistral chat completion API.

Cohere

Multi-model classify + generate pipeline with Cohere V2 API.

Together AI

Multi-model inference with Together AI API.

HuggingFace

Inference API integration with HuggingFace text generation.

HuggingFace Inference API

AI21 Labs

Jamba multi-model inference with AI21 Labs.

Azure OpenAI

Azure-hosted OpenAI models with decorator-based observability.

AWS Bedrock

Bedrock model invocation with Converse API and Nova models.

AWS Bedrock Agents

Bedrock Agents orchestration with action groups and knowledge base retrieval.

AWS Bedrock Agents

multi-agenttool-userag

Vertex AI

Vertex AI model pipeline with generate and chat modes.

Google Cloud Vertex AI

Meta Llama

Llama ecosystem integration with Meta Llama and Llama Stack.

Meta LlamaLlama Stack

multi-agenttool-use

LiteLLM

Multi-provider proxy through LiteLLM unified API.

Ollama

Local inference with Ollama running Llama 3.2.

All Providers

All providers in one trace: OpenAI, Anthropic, and LiteLLM.

OpenAIAnthropicLiteLLM

Cloud LLM Providers

Cloud LLM providers comparison: DashScope, WatsonX, Azure AI.

DashScopeWatsonXAzure AI

multi-agenttool-use

Vector Databases

FAISS RAG

Gold-standard multi-agent RAG with FAISS, 3 LLM providers, and 5 child agents.

FAISSOpenAIAnthropic+1

ragmulti-agenttool-use

ChromaDB

Multi-agent document search pipeline with ChromaDB vector operations.

ragmulti-agenttool-use

Pinecone

Multi-agent vector database pipeline with Pinecone.

ragmulti-agenttool-use

Qdrant

Multi-agent vector database pipeline with Qdrant.

ragmulti-agenttool-use

Weaviate

Semantic search pipeline with Weaviate v4.

Weaviate v4OpenAI

ragmulti-agenttool-use

Milvus

Multi-agent vector search pipeline with Milvus/Zilliz.

ragmulti-agenttool-use

pgvector

PostgreSQL vector search with pgvector extension.

PostgreSQL pgvector

Redis Vector

Redis vector search with HNSW index and KNN search.

LanceDB

Serverless vector search pipeline with LanceDB.

ragmulti-agenttool-use

MongoDB Vector

MongoDB vector search with cosine similarity aggregation.

Elasticsearch

Elasticsearch knn + hybrid search with dense_vector mapping.

Neo4j

Graph DB + vector search with Cypher and Neo4j.

Cloud Vector

Cloud vector platforms comparison: Turbopuffer, Vespa, Marqo, Cassandra, OpenSearch.

TurbopufferVespaMarqo+2

multi-agenttool-userag

Managed Vector

Managed vector DB comparison: Supabase, SingleStore, Vectara.

SupabaseSingleStoreVectara

multi-agenttool-userag

Lightweight Vector

Lightweight vector search comparison: Annoy, hnswlib, USearch, ScaNN, DuckDB.

AnnoyhnswlibUSearch+2

multi-agenttool-userag

Agent Frameworks

LangChain

Multi-agent pipeline with LangChain chains and auto-instrumented LLM.

LangChainOpenAI

multi-agenttool-userag

LangGraph

Stateful graph with conditional edges using LangGraph.

LangGraphOpenAI

multi-agenttool-userag

LlamaIndex

Multi-agent RAG pipeline with LlamaIndex.

LlamaIndexOpenAI

CrewAI

Multi-agent crew execution with researcher and writer agents.

multi-agenttool-use

AutoGen

Multi-agent group chat conversation with AutoGen.

Haystack

RAG pipeline with Haystack components.

multi-agentragtool-use

DSPy

Module execution and optimization with DSPy.

Semantic Kernel

Multi-agent orchestration with Semantic Kernel plugins.

Semantic KernelOpenAI

multi-agenttool-use

PydanticAI

Multi-agent pipeline with type safety using PydanticAI.

PydanticAIOpenAI

multi-agenttool-userag

smolagents

Lightweight multi-agent with HuggingFace smolagents.

smolagentsOpenAI

multi-agenttool-use

Strands Agents

Multi-agent orchestration with AWS Strands.

multi-agenttool-use

Agno

Framework with tool use and reasoning using Agno.

multi-agenttool-use

Letta (MemGPT)

Stateful agent with long-term memory management using Letta.

multi-agenttool-use

Google ADK

Agent Development Kit multi-agent with sub-agents and tools.

Google ADKOpenAI

multi-agenttool-use

Claude Agents

Multi-agent with Claude and Anthropic tool use.

Claude AgentsAnthropic

multi-agenttool-use

RAG & Retrieval

RAG Pipeline

Multi-agent RAG pipeline with retriever and synthesizer.

ragmulti-agenttool-use

Full RAG Pipeline

Full RAG stress test: 12-step pipeline across scrape, embed, index, query, rerank, eval.

FAISSQdrantWeaviate+2

Knowledge Graph RAG

Graph + vector hybrid retrieval stress test with 12-step pipeline.

Neo4jFalkorDBPinecone+6

RAG Frameworks

RAG frameworks comparison: GraphRAG, LightRAG, Pathway, RAGFlow, R2R.

GraphRAGLightRAGPathway+2

ragmulti-agenttool-use

Cohere Rerank

Cohere Embed + Rerank RAG pipeline with multi-agent lineage.

ragmulti-agenttool-use

Voyage Rerank

Voyage AI Reranker RAG pipeline with token tracking.

Voyage AIOpenAI

ragmulti-agenttool-use

Reranker Comparison

Reranking strategies comparison: Cross-encoder, Pinecone, FlashRank, ColBERT.

Cross-encoderPineconeFlashRank+1

ragmulti-agenttool-use

Embeddings

OpenAI Embeddings

Multi-agent OpenAI Embeddings pipeline with batch processing and similarity.

multi-agenttool-use

Sentence Transformers

Local embedding with sentence-transformers, zero-cost attribution.

Sentence Transformers

FastEmbed

Local embedding with FastEmbed ONNX-based inference.

Nomic AI

Multi-agent embedding pipeline with Nomic AI.

multi-agenttool-use

Voyage AI

Multi-agent embedding pipeline with Voyage AI and cost tracking.

Voyage AIOpenAI

multi-agenttool-use

Jina AI

Multi-agent reranking pipeline with Jina AI.

multi-agenttool-userag

Embedding Models

Embedding model comparison across 6 providers: BGE, E5, Instructor, TEI, Mixedbread, Transformers.

BGEE5Instructor+3

multi-agenttool-use

Safety & Governance

Governance

Governance and policy deep dive with record_events, check_policy, and sync wrappers.

Waxell GovernanceOpenAI

multi-agenttool-use

Guardrails AI

Multi-agent guardrails validation with Guardrails AI.

Guardrails AIOpenAI

multi-agenttool-use

LLM Guard

Multi-agent LLM Guard pipeline with input and output scanners.

LLM GuardOpenAI

multi-agenttool-use

NeMo Guardrails

NVIDIA NeMo Guardrails with Colang-based topical and safety rails.

NeMo GuardrailsOpenAI

multi-agenttool-use

Prompt Guard

Prompt guard showcase: block, warn, and redact modes for PII and injection.

Waxell Prompt GuardOpenAI

multi-agenttool-use

Safety Guardrails

Safety and content moderation comparison: Lakera Guard, Presidio, PolyGuard, Azure Content Safety.

Lakera GuardPresidioPolyGuard+1

multi-agenttool-use

Safety Gauntlet

Safety gauntlet stress test: 5 input + 3 output safety systems in a 12-step pipeline.

PresidioLLM GuardOpenAI Moderation+5

OpenAI Moderation

OpenAI Moderation API integration with per-category flagging.

OpenAI ModerationOpenAI

multi-agenttool-use

Evaluation

DeepEval

Evaluation with DeepEval metrics: AnswerRelevancy, Faithfulness.

evaluationmulti-agent

RAGAS

RAG evaluation with RAGAS metrics: faithfulness, answer_relevancy.

evaluationmulti-agent

Eval Battery

Evaluation battery stress test: 6 frameworks, 24 metrics, aggregate verdict.

DeepEvalRAGASBraintrust+3

evaluationpipeline

Eval Frameworks

LLM evaluation framework comparison: Braintrust, TruLens, Giskard, Inspect AI, PromptFoo.

BraintrustTruLensGiskard+2

evaluationmulti-agenttool-use

Multi-Agent Patterns

Multi-Agent Coordinator

Coordinated agents with shared session: planner, researcher, executor.

Multi-Agent Coordination

Coordination stress test: CrewAI + AutoGen + Agno with shared Zep memory.

CrewAIAutoGenAgno+5

multi-agentpipeline

Multi-Agent Swarm

Collaboration frameworks comparison: Agency Swarm, SuperAGI, CAMEL.

Agency SwarmSuperAGICAMEL+1

multi-agenttool-use

Workflow Agents

Workflow-oriented frameworks comparison: Julep, Langroid, ControlFlow.

JulepLangroidControlFlow+1

multi-agenttool-use

Multi-Provider Shootout

Multi-provider shootout stress test: 6 LLMs, 18 eval scores, rerank, winner.

OpenAIAnthropicGroq+4

pipelineevaluation

Specialized Pipelines

Streaming

Streaming capture comparison: OpenAI vs Anthropic.

OpenAIAnthropic

Tool Use

Tool use and inter-agent communication: Computer Use, A2A, Composio.

Computer UseA2AComposio+1

multi-agenttool-use

Code Review

Code review agent pipeline with static analysis and Anthropic.

multi-agenttool-use

Code Sandbox

Sandboxed code execution with E2B Code Interpreter.

multi-agenttool-use

Research

Multi-agent research pipeline with agentic behavior tracking.

multi-agentragtool-use

Customer Support

Customer support agent pipeline: classify, lookup, route, respond.

multi-agenttool-use

Data Ingestion

Data ingestion pipeline stress test: scrape, embed, index, query, rerank, eval across 14 steps.

Crawl4AIScrapeGraphAISentence Transformers+5

Enrichment

SDK enrichment showcase: scores, tags, metadata across multi-agent pipeline.

Waxell SDKOpenAI

Prompt Management

Prompt retrieval, rendering, background collector, and capture_content mode.

Waxell SDKOpenAI

multi-agenttool-use

Sync Pipeline

Batch ticket processing pipeline: classify, extract, route, respond.

multi-agentpipeline

MCP

MCP tool-calling integration with filesystem and search tools.

multi-agenttool-use

Web Scraping

AI-powered web scraping comparison: Crawl4AI, ScrapeGraphAI, Firecrawl.

Crawl4AIScrapeGraphAIFirecrawl+1

multi-agenttool-use

Voice & Speech

Speech-to-Text

Multi-provider STT pipeline: Google Cloud, Azure, AWS, Faster Whisper, whisper.cpp, Deepgram, AssemblyAI.

Google Cloud STTAzure SpeechAWS Transcribe+4

multi-agenttool-usepipeline

Text-to-Speech

Multi-provider TTS pipeline: Google Cloud, Azure, AWS Polly, Cartesia, Coqui, ElevenLabs, PlayHT.

Google Cloud TTSAzure TTSAWS Polly+4

multi-agenttool-usepipeline

Voice AI

Voice AI agent frameworks: LiveKit Agents and Pipecat.

LiveKit AgentsPipecatOpenAI

multi-agenttool-use

Voice Memory

Voice-first AI with long-term memory: STT, memory, graph, LLM, TTS in 12-step pipeline.

DeepgramAssemblyAIZep+5

Voice Platforms

Managed voice AI platforms comparison: Vapi and Retell.

multi-agenttool-use

Structured Generation & Inference

Structured Generation

Structured generation frameworks: Outlines, Guidance, LMQL.

OutlinesGuidanceLMQL+1

multi-agenttool-use

Instructor

Structured extraction with Instructor and Pydantic models.

InstructorOpenAI

multi-agenttool-use

Image Generation

Image generation model comparison: Stable Diffusion, Flux, Fal AI.

Stable DiffusionFluxFal AI+1

multi-agenttool-use

BentoML

Model serving pipeline with BentoML runners.

multi-agenttool-use

Inference Servers

Production inference servers: SGLang, TGI, TensorRT-LLM, Triton.

SGLangTGITensorRT-LLM+1

multi-agenttool-use

Local Inference

Local inference engines comparison: llama.cpp, llamafile, LocalAI, ExLlamaV2.

llama.cppllamafileLocalAI+1

multi-agenttool-use

vLLM

vLLM local inference with PagedAttention optimization tracking.

LLM Wrappers

LLM wrapper libraries: Mirascope, Magentic, Marvin.

MirascopeMagenticMarvin+1

multi-agenttool-use

Memory

Mem0

Memory layer with modern SDK patterns using Mem0.

multi-agenttool-use

Zep Memory

Long-term conversational memory with Zep: add, get, search, delete, graph.

multi-agenttool-use

Observability

Observability Platforms

LLM observability platforms comparison: Arize Phoenix, Opik, LangSmith, Langfuse.

Arize PhoenixOpikLangSmith+2

multi-agenttool-use

Graph Databases

Graph Databases

Graph database comparison: FalkorDB, ArangoDB, Memgraph, Neptune.

FalkorDBArangoDBMemgraph+2

multi-agenttool-userag