Bugfix/duplicate tags #13166

andrewstuart · 2022-08-18T20:35:06Z

What does this PR do?

This PR demonstrates one possible solution to an issue we've observed in production, where env tags can be merged with the official deployment.environment resource-level "tag," resulting in confusing behavior if that attribute is not also overridden (or otherwise does not match the env tag).

Motivation

This PR is inspired by a particularly nasty bug we have encountered recently with OTLP metrics being sent to DataDog. When setting env attributes via the attributeprocessor, we were inadvertently creating situations where the Resource level tags did not match the metric series level tags. This caused double entries in the JSON being sent to datadog, and very confusing results in dashboards.

Possible Drawbacks / Trade-offs

The current solution is not configurable and may result in tag values that don't make sense, depending on the tag source that created the original tag.

Describe how to test/QA your changes

go test

You could also set up an otel-contrib-collector pipeline with a DD exporter, and different upsert for env in resourceprocessor and attributeprocessor like so:

  exporters:
    datadog:
      api:
        key: SET_VIA_ENV
  processors:
    resource:
      attributes:
        - key: deployment.environment # technically we override this and env on both processors, now, just to be safe, but IIRC this should replicate
          action: upsert
          value: prod
    attributes:
      actions:
        - key: env
          action: upsert
          value: dev
  service:
    pipelines:
      metrics:
        exporters:
          - datadog
        processors:
          - resource
          - attributes
        receivers:
          - prometheus # or whatever

Reviewer's Checklist

bits-bot · 2022-08-18T20:35:10Z

All committers have signed the CLA.

andrewstuart · 2022-08-18T20:38:03Z

I wanted to get this out there to discuss the solution, and whether or not it is viable or if you'd like some more changes or design to happen before proceeding, but we've been bitten by this and have a workaround currently, but it seems ideal that others would not have to figure out the underlying conversions in play, quite to the same level that we've had to debug.

Please let me know if there's anything you'd like to change here, or discuss better alternatives that I may have missed, not knowing the codebase quite as well as I'm sure the team does.

dineshg13 · 2022-08-23T14:37:10Z

@andrewstuart Thanks for finding & fixing this bug. This might an issue with traces or even logs. I will get back to you if we want to fix in all places or there is some central place we can do it .

gbbr · 2022-08-30T13:25:12Z

pkg/otlp/model/translator/dimensions.go

+	for _, tag := range d.tags {
+		sp := strings.Split(tag, ":")
+		if v, ok := collisionCheck[sp[0]]; ok && v != strings.Join(sp[1:], ":") {
+			d2.tags = append(d2.tags, "resource."+tag)


Why are we assuming that any duplicate that is a resource attribute and prefixing it as such? If we want to prefix all resource attributes like this shouldn't it be done at a different level where we are actually aware of this fact? It's a bit of a wild assumption.

I mentioned that explicitly in the tradeoffs above. I just wanted to get this out there, and figured a better solution (at the very least, configurable) would surface as part of the discussions.

I don't think such a tradeoff is acceptable. I think we should instead simply ensure that the environment is taken from the right tag.

Feel free to fix this however you would like. I put a few solid days of discovery into this issue and fix, so I'm tapped out if you decide to go a different direction, but I won't be offended if you do. Either way, just know that the potential for duplicate tags will still be a fundamental issue with the way the DataDog otlp adapter, and thereby the otel DD exporter, is currently designed.

diguardiag · 2022-09-13T22:25:40Z

Any update to this? we are affected by this issue and i'm sure other people as well

andrewstuart added 2 commits August 8, 2022 23:41

Add tests for new collision checking

f703d24

Merge remote-tracking branch 'origin/main' into bugfix/duplicate-tags

09defa1

andrewstuart requested a review from a team as a code owner August 18, 2022 20:35

andrewstuart changed the title ~~[WIP] Bugfix/duplicate tags~~ Bugfix/duplicate tags Aug 18, 2022

gbbr reviewed Aug 30, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix/duplicate tags #13166

Bugfix/duplicate tags #13166

andrewstuart commented Aug 18, 2022 •

edited

Loading

bits-bot commented Aug 18, 2022 •

edited

Loading

andrewstuart commented Aug 18, 2022

dineshg13 commented Aug 23, 2022 •

edited

Loading

gbbr Aug 30, 2022

andrewstuart Aug 30, 2022

gbbr Aug 31, 2022

andrewstuart Aug 31, 2022 •

edited

Loading

diguardiag commented Sep 13, 2022

Bugfix/duplicate tags #13166

Are you sure you want to change the base?

Bugfix/duplicate tags #13166

Conversation

andrewstuart commented Aug 18, 2022 • edited Loading

What does this PR do?

Motivation

Possible Drawbacks / Trade-offs

Describe how to test/QA your changes

Reviewer's Checklist

bits-bot commented Aug 18, 2022 • edited Loading

andrewstuart commented Aug 18, 2022

dineshg13 commented Aug 23, 2022 • edited Loading

gbbr Aug 30, 2022

Choose a reason for hiding this comment

andrewstuart Aug 30, 2022

Choose a reason for hiding this comment

gbbr Aug 31, 2022

Choose a reason for hiding this comment

andrewstuart Aug 31, 2022 • edited Loading

Choose a reason for hiding this comment

diguardiag commented Sep 13, 2022

andrewstuart commented Aug 18, 2022 •

edited

Loading

bits-bot commented Aug 18, 2022 •

edited

Loading

dineshg13 commented Aug 23, 2022 •

edited

Loading

andrewstuart Aug 31, 2022 •

edited

Loading