Skip to main content
W&B Weave provides a complete toolkit for building, testing, and operating LLM applications. This page summarizes the key features available in Weave.

Tracing and observability

Track every call in your LLM application to understand behavior, debug issues, and optimize performance.
FeatureDescription
Automatic LLM tracingWeave automatically traces calls to supported LLM providers (OpenAI, Anthropic, Google, and others) without code changes.
Custom function tracingUse @weave.op() to trace any function and capture inputs, outputs, timing, and errors.
Nested call trackingSee parent-child relationships between calls to understand your application’s execution flow.
Cost trackingAutomatically calculate token usage and costs for LLM calls based on provider pricing.
Rich media supportLog and display images, audio, video, and other media types in traces.
Trace comparisonCompare traces side-by-side to understand differences between executions.
Thread trackingGroup related calls into threads for multi-turn conversation analysis.
OpenTelemetry supportSend traces from OpenTelemetry-instrumented applications to Weave.

Evaluation

Systematically test your LLM application against curated examples to measure and improve quality.
FeatureDescription
Evaluation pipelinesRun your application against datasets and score outputs with custom or built-in scorers.
DatasetsCreate, version, and manage collections of test examples.
Custom scorersWrite scoring functions tailored to your application’s quality criteria.
Built-in scorersUse pre-built scorers for common tasks like hallucination detection, summarization quality, and more.
LLM judgesUse LLMs to evaluate outputs with customizable judge prompts.
LeaderboardsCompare model performance across evaluations with configurable leaderboard views.
EvaluationLoggerLog evaluation results from external systems or custom evaluation workflows.

Versioning

Track changes to your prompts, models, and data to understand what changed and reproduce results.
FeatureDescription
Object versioningWeave automatically versions any tracked object when it changes.
Model trackingVersion model configurations, parameters, and code together.
Prompt managementVersion prompt templates and track which prompts were used in each call.
Dataset versioningTrack changes to evaluation datasets over time.
Op versioningAutomatically version functions when their code changes.

Production tools

Collect feedback, monitor quality, and protect your application in production.
FeatureDescription
Feedback collectionCapture user feedback, annotations, and corrections on production traces.
GuardrailsRun scorers on production traffic to catch issues before they reach users.
MonitorsTrack quality metrics over time and set up alerts for regressions.
PII redactionAutomatically redact sensitive information from traces.
AttributesTag traces with custom metadata for filtering and analysis.

Developer experience

Tools to help you iterate faster during development.
FeatureDescription
PlaygroundTest prompts and models interactively with the Weave Playground.
Evaluation playgroundRun quick evaluations on traced calls directly from the UI.
Saved viewsSave and share filtered views of your traces and evaluations.
Comparison toolsCompare objects, traces, and evaluation results side-by-side.

Integrations

Connect Weave to your existing tools and infrastructure.

LLM providers

Weave automatically traces calls to these providers:
  • Amazon Bedrock
  • Anthropic
  • Azure OpenAI
  • Cerebras
  • Cohere
  • Google (Gemini, Vertex AI)
  • Groq
  • Hugging Face
  • LiteLLM
  • Mistral
  • NVIDIA NIM
  • OpenAI
  • OpenRouter
  • Together AI
  • Local models (Ollama, vLLM)

Frameworks

Weave integrates with popular LLM frameworks:
  • OpenAI Agents SDK
  • LangChain
  • LlamaIndex
  • DSPy
  • Instructor
  • CrewAI
  • Smolagents
  • Pydantic AI
  • Google ADK
  • Agno
  • Koog
  • AutoGen
  • Verdict
  • Verifiers

Protocols

  • Model Context Protocol (MCP)

Platform and deployment

FeatureDescription
W&B CloudUse Weave on W&B’s managed cloud infrastructure.
Self-managed deploymentDeploy Weave on your own infrastructure for data residency requirements.
Team collaborationShare projects, traces, and evaluations with your team.
Access controlManage permissions with W&B’s identity and access management.

SDKs

Weave provides official SDKs for:
  • Python: Full-featured SDK with all Weave capabilities
  • TypeScript: SDK for Node.js applications with core tracing and evaluation features

Next steps