Blog / Prompt engineering / Per-Trace vs Data-Volume Pricing

Per-Trace vs Data-Volume Pricing

Learn how observability pricing changes coverage, retention, and alerting quality, and choose the model that fits your growth. Read the full guide.

Ilia Ilinskii
Rephrase · June 6, 2026

Prompt engineering8 min read

On this page

Key Takeaways Why does pricing shape observability coverage?What is per-trace pricing good at?What is data-volume pricing good at?How do the two models compare?Why hidden unit economics matter more than you think When should you choose per-trace pricing?When should you choose data-volume pricing?What does a good decision process look like?Final thought References

If you've ever looked at an observability invoice and thought, "Wait, why did coverage drop when traffic barely changed?" you've already met the real problem. Pricing is not just procurement trivia. It quietly decides what you collect, what you sample away, and what you never see in production.

Key Takeaways

Per-trace pricing makes it easier to keep broad coverage, especially when traces are small and errors are rare.
Data-volume pricing pushes teams toward sampling, compression, and aggressive retention controls.
Hidden cost drivers like thinking tokens in AI systems are a good analogy: the visible unit is not always the true cost unit [1].
The best model depends on your failure mode, not just your budget.
Tools like Rephrase can help you turn messy internal notes into sharper prompts for cost analysis and vendor evaluation.

Why does pricing shape observability coverage?

Pricing shapes coverage because every observability team eventually optimizes to the billing metric. If you pay per trace, you tend to keep more traces and sample less. If you pay by data volume, you start trimming attributes, shrinking payloads, and dropping noisy spans. The contract itself becomes a product decision, not just a finance decision.

That's the core lesson from recent work on AI pricing: the unit you see is often not the unit that matters most. Research on reasoning models shows that listed prices can badly mismatch actual cost because hidden consumption dominates the bill [1]. Observability has the same trap.

What is per-trace pricing good at?

Per-trace pricing is best when you want broad, predictable visibility across many services without turning every byte into a budget problem. It works well for developer teams that care about debugging, service maps, and rare request paths. The big advantage is psychological as much as financial: people are less afraid to keep data when the unit cost feels stable.

The downside is obvious. Once trace volume grows, teams may start fighting cardinality, "trace spam," and noisy instrumentation because every extra trace feels expensive.

What is data-volume pricing good at?

Data-volume pricing is best when your telemetry payloads are large, uneven, or heavily enriched. It gives finance a direct line of sight into storage and ingestion cost, which is useful for compliance-heavy environments and logs-at-scale setups. It also rewards disciplined schemas, compression, and targeted sampling.

The catch is that volume pricing often penalizes the exact data you need for root-cause analysis: long stack traces, rich attributes, and high-cardinality context. In practice, teams end up removing fields instead of fixing instrumentation.

How do the two models compare?

The difference is less about accounting and more about behavior. Per-trace pricing encourages completeness. Data-volume pricing encourages minimalism. One model makes you think, "Can we afford another trace?" The other makes you think, "Can we afford another field?"

Pricing model	What you pay for	What it encourages	Common failure mode
Per-trace	Number of traces/events	Wider collection, less sampling	Trace explosion, noisy telemetry
Data-volume	Bytes ingested/stored/queried	Smaller payloads, tighter schemas	Missing context, over-sampling
Hybrid	Traces plus bytes or tiers	Balanced control	Hard-to-predict bills

That tradeoff matters because observability coverage is not binary. You do not just have "monitoring" or "no monitoring." You have a coverage envelope. Pricing determines where that envelope shrinks first.

Why hidden unit economics matter more than you think

This is where the analogy to AI pricing gets useful. In the Portkey analysis, reasoning tokens, cache asymmetry, context thresholds, and new billing dimensions all made the true cost drift away from the visible price [2]. Observability teams face the same kind of drift when vendors add dimensions like spans, retention tiers, query charges, or enrichment fees.

So if you only compare "per trace" versus "per GB," you are probably missing the real bill. What matters is the full cost surface: ingestion, retention, indexing, query load, egress, and the operational tax of the sampling strategy you'll need to survive.

That's also where Rephrase can be handy. When you're drafting vendor questions or internal evaluations, it helps to rewrite a vague ask like "compare pricing models" into a sharper prompt like "compare the effect of pricing units on trace coverage, sampling behavior, and mean time to resolution."

When should you choose per-trace pricing?

Choose per-trace pricing when you care most about diagnostic completeness and your average trace footprint is modest. It tends to fit early-stage products, service-mesh-heavy systems, and teams still learning what "normal" looks like. It is especially attractive when the value of a missed trace is much higher than the cost of extra collection.

If your incident response depends on seeing rare edge cases, per-trace pricing usually aligns better with that goal.

When should you choose data-volume pricing?

Choose data-volume pricing when payload size is the main cost risk and your telemetry is already rich. This usually shows up in enterprise environments, log-heavy products, and systems with huge attribute sets or long retention windows. It gives you a clearer mapping between usage growth and spend.

The tradeoff is coverage discipline. You will almost always need policies for field truncation, tiered retention, and sampling by environment or service.

What does a good decision process look like?

A good decision process starts with failure analysis, not vendor brochures. Ask which blind spots would hurt most: missing rare traces, losing payload context, or underpricing retention. Then model expected traffic under both normal and incident conditions. Finally, test whether your team will actually keep the telemetry you need once the bill lands.

That's the practical part. Cost model selection is really coverage selection in disguise.

Final thought

If I had to compress this into one sentence, it would be: the billing unit you choose becomes the shape of your visibility. Pick trace-based pricing if you want to maximize coverage. Pick volume-based pricing if you want to maximize control. And if you want to turn a messy vendor comparison into a clear decision memo, Rephrase can help you get there faster.

References

Documentation & Research

The Price Reversal Phenomenon: When Cheaper Reasoning Models End Up Costing More - arXiv (link)
Your SaaS Is an Insurance Product: A Modeling Framework - arXiv (link)

Community Examples 3. LLM pricing is 100x Harder than you think - Hacker News (LLM) (link)

Frequently asked

What is per-trace pricing in observability?

Per-trace pricing charges for each trace or event the platform processes. It makes spend predictable when traffic is stable, but it can discourage full-fidelity collection when trace volume spikes.

Which pricing model gives better observability coverage?

Per-trace pricing usually favors wider coverage because teams can keep more spans if payloads are small. Data-volume pricing often pushes teams to sample harder, especially in high-cardinality systems.