-
Notifications
You must be signed in to change notification settings - Fork 109
feat: OpenTelemetry context activation #202
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: OpenTelemetry context activation #202
Conversation
I have been passively maintaining tracing-opentelemetry (mostly by reviewing relatively small PRs) for a few months, but I don't currently have a use case for this crate. Unless some funding for doing this work materializes, I'm not motivated to review larger efforts like this, so I suggest the OpenTelemetry and/or tracing organizations find some other people to move this effort forward. |
ce5421a
to
51e471b
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'll take a closer look at everything (including the previous discussions we've had in the other repo) later, but for now, I think the most useful thing would be to get rid of the feature and possibly other unrelated changes (maybe the patched dependencies?) to make it easier to understand which parts here are left because they are needed and which parts are here just for compatibility with the old way.
Generally, I think this would be a step in the right direction, though I'd still want to explore the possibility of providing parents at span creation time and hopefully starting the context then, which would be closer to what opentelemetry specifies.
Thanks for the comments @mladedav. I think that getting rid of the feature is a bit risky, since these changes do have performance implications, and it would be good to allow current users to continue. I'll rebase the PR and try to clean it up a bit. |
68d210f
to
133ab73
Compare
I've updated the PR with the proposed changes except returning the |
Regarding the feature, would it make sense to have it instead as a config on the layer? Features are notoriously problematic for changing behavior due to feature unification, especially turning off default features since crates tend not to opt out of them. I wouldn't expect this to be a transitive dependency though so it might be fine, but I still think normal configuration would suit it more than a feature. |
@mladedav I've change the code from a feature to a configuration. |
@mladedav Would you have bandwidth to continue reviewing this PR? We have delayed cleaning up OTel Tracing API as it'll break tracing-opentelemetry bridge, without this PR merged. |
Sorry, I've created some threads some 23 days ago but did not send the review 🤦 |
It's just some minor things so if @bantonsson doesn't have time right now, I can fix those things. |
np! (has happened to me too!). Thanks for your time. |
@mladedav Just got back from vacation. I'll go through your comments. |
Right now this is completely focused on the Spans but there could be a feature that only propagates the context as well.
Ensure that the we materialize the span and activate the context when accessing the span. This ensures the correct parent child relationships.
5604802
to
7e86613
Compare
Hey @mladedav I forgot to comment that I have addressed the review comments. It would be great if you could take a look when you have the time. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a small final cleanup and I think we can merge this.
7e86613
to
125e34b
Compare
Changed the test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for pushing this forward.
TL;DR
This PR is opened as a basis for discussion around providing a more seamless interoperability for users of the OpenTelemetry APIs when using frameworks and libraries that use
tracing
at their core.An OpenTelemetry
Context
that mirrors the activetracing
Span
is now active on the executing thread while in atracing
Span
. Having proper OpenTelemetry context activation will not only make interoperability easier for users of the OpenTelemetry API, but will also open up the possibility for simpler log/trace/baggage correlation and in the future profiling/trace correlation among other things.Motivation
The aim is to provide users of the OpenTelemetry API a more seamless and consistent experience. One such thing is that calling the OpenTelemetry API
Context::current()
did not give you aContext
that contained an OpenTelemetrySpan
that represented the currenttracing
Span
. Another thing is propagation ofBaggage
or user defined types that were not picked up either.A lot of effort has been put into making the existing
OpenTelemetrySpanExt
API work the same way to maintain maximum backwards compatibility and to avoid fragmentation by building a separate OpenTelemetry and Tokio Tracing bridge.Long discussion about OpenTelemetry and Tokio Tracing interoperability, and an issue specifically discussing this POC.
Solution
This solution adds a new feature
activate_context
that will activate/deactivate an OpenTelemetryContext
that mirrors the currenttracing
Span
. This feature is currently on by default, but can be turned off to avoid any performance overhead incurred by the context activation and bookkeeping.The second big change is that the OpenTelemetry
SpanBuilder
is lazily consumed and a real OpenTelemetrySpan
is created when necessary, so there is no longer any need for thePreSampledTracer
and the special code in OpenTelemetry that it relies on. This has some implications, like you can no longer set theparent_context
after you have entered aSpan
or called thecontext
method on theSpan
.Benchmark numbers are available in this comment, and when the
activate_context
feature is turned off, the performance stays the same. (There has been one more optimization since the last run there, fixing the regression in themany_events
benchmarks)