Blog / Prompt engineering / OpenLLMetry: Avoid Lock-In From Day One

OpenLLMetry: Avoid Lock-In From Day One

Master OpenLLMetry vendor-neutral instrumentation for LLM apps and avoid lock-in. Learn the setup, pitfalls, and migration-safe patterns. Try free.

Ilia Ilinskii
Rephrase · June 6, 2026

Prompt engineering8 min read

On this page

What does vendor-neutral instrumentation mean for LLM apps?Why does lock-in happen so early?How do you avoid vendor lock-in from day one?What should you instrument first?What does a migration-safe setup look like?What do real-world examples teach us?What are the main trade-offs?A practical before-and-after prompt for teams What should you do next?References

LLM observability gets messy fast. The worst part isn't the tracing itself. It's realizing six months later that your "easy" instrumentation is welded to one vendor's SDK and one backend's data model.

Key Takeaways

OpenLLMetry-style instrumentation keeps your LLM telemetry portable from the start.
OpenTelemetry and OTLP are the backbone of vendor-neutral traces, metrics, and logs [1].
GenAI semantic conventions are still evolving, so you should design for change, not perfection [1].
Research on agent observability shows why structured, runtime traces matter for debugging and governance [2].
You can avoid lock-in by standardizing trace schemas, exporters, and capture policy early.

What does vendor-neutral instrumentation mean for LLM apps?

Vendor-neutral instrumentation means your application emits telemetry in an open format, not a vendor-specific one. In practice, that usually means OpenTelemetry spans, metrics, and logs exported through OTLP, with LLM-specific fields mapped to evolving GenAI conventions. You keep the instrumentation layer stable even if you replace the backend later [1].

The nice thing is that this doesn't have to be theoretical. The OpenTelemetry GenAI work already gives teams a shared vocabulary for prompts, token usage, model metadata, and guardrails. The catch is that the conventions are still developing, so the safest move is to treat them as a contract that will change, not a frozen API [1].

Why does lock-in happen so early?

Lock-in usually starts with the first convenience decision. You install a vendor SDK, log prompts through its wrapper, and rely on its custom trace UI. That feels fine until you need to migrate, self-host, or send data to a different stack. Then your instrumentation, dashboards, and export paths all become hidden dependencies [1][3].

Research on agent observability makes the same point from a different angle: runtime behavior is only useful if traces are structured, interoperable, and linked to actual execution flow. If the schema is proprietary, the telemetry becomes disposable. Open standards are what let traces survive platform changes [2].

How do you avoid vendor lock-in from day one?

You avoid lock-in by making OpenTelemetry the default boundary between your app and your observability backend. That means standardizing on OTLP export, using resource attributes consistently, and keeping LLM-specific metadata in open semantic fields instead of custom vendor-only structures. The instrumentation should describe the app, not the platform.

I'd also separate three concerns immediately: capture, transport, and visualization. Capture should happen in your code or framework layer. Transport should be OTLP or a compatible collector. Visualization should be swappable. That separation is what keeps your stack portable when the market shifts [1][3].

What should you instrument first?

Start with the signals that answer real production questions: which model ran, how long it took, how many tokens it used, whether a guardrail fired, and which tool calls happened along the way. That gives you enough visibility to debug latency, cost, and failure modes without overcomplicating the first implementation [1][2].

Here's the thing: you do not need perfect semantic coverage on day one. You need stable, useful spans. If the GenAI schema changes later, you can map your existing telemetry forward. If you start with a vendor-native wrapper, you'll probably map it backward and hate every minute of it.

What does a migration-safe setup look like?

A migration-safe setup keeps your app code thin and your export path boring. That means one instrumentation layer, one collector, and a backend-agnostic policy for sensitive content. Tools like OpenLLMetry can help teams operationalize this mindset by turning rough telemetry ideas into cleaner, more reusable prompts and workflows. For teams documenting the stack, the Rephrase blog is a useful place to keep pattern notes and examples.

Layer	Bad default	Better default
App instrumentation	Vendor SDK everywhere	OpenTelemetry spans with GenAI attributes
Transport	Direct-to-vendor API	OTLP via collector
Data model	Custom prompt schema	Open, documented semantic conventions
Privacy	Capture everything	Capture only what you need
Portability	One backend assumption	Swappable visualization and storage

This is the difference between "we use a tool" and "we own a system." The first version is fast but brittle. The second version is boring in the best way: it keeps working when your vendor, model, or compliance rules change.

What do real-world examples teach us?

Community writeups consistently point to the same practical lesson: OpenTelemetry is attractive because it preserves choice. In a RubyLLM example, the team configured OpenTelemetry once and got traces for completions, tool calls, and token counts without wrapping every call manually [4]. In a LangWatch writeup, the big selling point was that instrumentation outlived the vendor because the trace path started with OTLP, not a proprietary SDK [3].

That lines up with the research. Structured logging frameworks for agents work best when operational, cognitive, and contextual events can be linked under one trace model [2]. If you're building LLM features that will evolve from prototype to production, portability is not a nice-to-have. It is the design constraint.

What are the main trade-offs?

The trade-off is simple: vendor-neutral design asks for a little more discipline upfront, but it saves you from painful rewrites later. You may spend more time thinking about span names, attributes, and export routes. In return, you get freedom to switch backends, self-host when needed, and keep your observability code alive across product cycles [1][2].

The biggest downside is that the GenAI ecosystem is still maturing. Some libraries lag behind newer APIs, and coverage can be uneven. That's exactly why a vendor-neutral layer matters. If your instrumentation already lives in open standards, gaps in one library are annoying. If you're locked into a vendor SDK, they become architecture problems.

A practical before-and-after prompt for teams

A lot of teams ask for "OpenTelemetry for LLMs" when they really mean "please make this observability plan less confusing." This is where a prompt helper can save time. I've seen teams get much better results by asking for a vendor-neutral design instead of a vendor-specific implementation.

Before:
Set up LLM tracing for our app.

After:
Design a vendor-neutral LLM observability plan using OpenTelemetry.
Include spans, GenAI semantic attributes, OTLP export, privacy defaults,
and a migration path away from any single vendor backend.

That small change matters. It forces the answer to focus on portability, not just implementation. If you're prompting an AI tool for architecture help, that framing is often the difference between a useful plan and a glossy demo.

What should you do next?

If you're starting a new LLM feature, build the observability layer like you expect to migrate it someday. That mindset keeps you honest. It also makes vendor conversations easier, because you're buying capability, not captivity.

My take: OpenLLMetry is less about a specific product and more about a rule. Keep your traces open, your exporter swappable, and your schema documented. If you want more practical prompting and workflow ideas for AI tooling, check Rephrase and browse the Rephrase blog for more examples.

References

Documentation & Research

Observing LLM Applications with OpenTelemetry - Hacker News (LLM) (link)
AgentTrace: A Structured Logging Framework for Agent System Observability - arXiv cs.AI (link)

Community Examples 3. LangWatch: OpenTelemetry-Native LLM Observability Without the Vendor Lock-In - Hacker News (LLM) (link) 4. Observability for your LLM-powered apps: OTel Instrumentation for RubyLLM - Hacker News (LLM) (link)

Frequently asked

What is OpenLLMetry?

OpenLLMetry is a vendor-neutral way to instrument LLM apps on top of OpenTelemetry. The goal is simple: keep your traces portable so you can switch backends without rewriting instrumentation.

How is OpenLLMetry different from proprietary observability SDKs?

Proprietary SDKs often make the backend the center of gravity. OpenLLMetry-style instrumentation makes OpenTelemetry the source of truth, so export stays flexible and backend changes are easier.

Can OpenLLMetry work with existing observability stacks?

Yes. That's the point. You can route OTLP data into an OpenTelemetry-compatible backend and keep the same instrumentation even if you change vendors later.