-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: Blacklist Host Tags #3130
Comments
To give an example, in a cattle-like environment, I don't need Maybe I need 1 of |
We could really use this too. We have high cardinality of tags, but ALL of our metrics are annotated by the various hosts too, multiplying the whole thing enormously. I believe all the instance-related tags are all added by the DataDog agent itself, right? Based on where the data is sourced from. |
👍 on a possibility to remove host tags! I'd also appreaciate a possibility to remove the Maybe one could have a generic blacklist in the agent? |
hmm, the issue was created in 2019, but I assume this exclude option is still not implemented, right? Or did you guys figure out some workaround by any chance? |
I think Metrics Without Limits can work around the whole pricing aspect of this extra cardinality. But the actual remove-tags feature doesn't exist that i know of |
We would like to have this feature as well. This should be configurable through AWS integration configuration OR Datadog agent. |
I will say that I've tried overwriting these tags to a single dummy value in the
But that |
My team could really use this too. |
We require tags to be filtered/removed at source(ingest) also. Metrics without limits only removes tags during indexing not ingest. |
Our company require this features too, manty unused tags cost is high |
Same here - I would appreciate the possibility to create filter mask for tags. If tags cannot be removed - by principle monitoring should not affect infrastructure nor it should impact cost (tags play functional role in some scenarios). |
I wrote a Ruby script that adds custom tag groups that exclude tags you don't want. You'll want to tailor it to your use case, or use it as inspiration. It takes about 8 hours to run against the Datadog account I work on, which has around 10k metrics. My company set up a Datadog monitor for custom-metrics usage, and we re-run this script whenever it alerts. I agree with others that Datadog's tooling for managing tags on large numbers of metrics is poor. Having Terraform configuration for thousands of metrics isn't practical, and neither is manually configuring them through the web UI. All we're asking for is a blacklist, which seems a lot easier to implement than many other parts of Datadog's tooling, which I'm generally impressed by. |
Following-up on this one. There was a question to not only support it for statsd but also for openmetrics. Also as i see we currently have the tag exclusion for EC2 tags. Do we want to make a generic I'm happy to implement this if i got a green flag from maintainer team. tagging @alexb-img and @olivielpeau because you were active on #6526 |
I'm a PM at Datadog and want to chime in here to provide context that we are aware of this feature request and are actively looking for details about the use cases, telemetry pipeline needs and pain-points. We highly encourage customers to reach out to us via our support channel (https://www.datadoghq.com/support/) or your CSM contact about this topic! Thanks! |
@LutaoX Let me just say that it's super awesome to hear from a PM in a public setting, because until now that's definitely not been the norm from my experience with requests on this repo; my company had internally started assuming that filing feature requests in Github to go along with our support cases was pointless. Anyway, I'm pretty sure I've reached out before about this and ended up having a support case where I linked this issue, but I don't want to go dig it back up and cause confusion on the customer support end by necro-ing a 2-3 year old issue. One of the big use cases for us is in Kubernetes, where we don't want the Obviously this would be a different part of the codebase, but there's also the matter of EC2 tags where we want some of those tags (such as I'm only commenting about my use cases here to make sure they get a little bump and to let anyone who's in the same boat as I am just reference my comment when they open their support tickets. Like, if one can summarize one's request by saying "please do what this guy on github said" then hey, I saved them time :) |
I implemented this for k8s workloads, I guess my implementation could be expanded for host tags also. |
I created a support case around the issue not long ago. @LutaoX , I don't know if you can access it, but here's the case: http://help.datadoghq.com/hc/requests/1402253 |
And that's exactly the issue. While you can define which tags you want to have indexed with Metrics Without Limits (and that's good as the indexing is really expensive), you cannot define what is ingested. And though the price per ingested metric is much lower than indexed, if you have many hosts with many custom metrics, you still end up with a super high bill. |
Pragmatically speaking, is there a workaround other than using something other than Datadog to reduce high counts of custom metrics? More precisely, my reading of this GitHub issue and the problem at hand is that the Datadog agent always adds tags, some of which have high cardinalities like IMO ideally the agent would allow for custom tag transformers that receive a tag–value pair from the agent and return a tag–value pair to send to the Datadog service, where returning |
Hi everyone! I think the concern to blacklist HostTags here is super valid despite Datadog-agent being supported or not but currently I have tested with Vector.Dev to disable some unnecessary tags before it was ingested & indexed to Datadog. Our architecture like this Datadog-Agent --> Vector.Dev ---> Datadog Platform Step 1: It is important to note that your DD-Agent version must be higher than 7.45.1
Step 2: Setup vector.dev service (it can be managed by HELM, so please utilise it). The outcome is you have that service up and running and you have a practical endpoint to receive the data Step 3: The rule that I used on vector.dev to discard the unnecessary tags
Finally, you can check your metric after it arrives in Datadog Platform, surely the tag pod_phase / namespace will not appear anymore Hope it helps to rescue everyone (not just short-term but even for a long-term model). |
We have this same issue as well, particularly in a Azure AKS environment. |
For those facing issues with high cardinality due to host tags in Datadog, I’d like to share a solution that worked for me. In the official Datadog documentation on DogStatsD metrics submission - host tags, there is a somewhat vague but crucial statement: “The submitted host tag overrides any hostname collected by or configured in the Agent.” Leveraging this, I discovered that by including an empty My breakthrough came from Datadog Agent release 6.6.0, which introduced an enhancement allowing DogStatsD to support the removal of hostnames on events and services checks, similar to metrics, by adding an empty |
@lifttocode Thank you for sharing that. Specifying Relatedly, I tried doing the same for other tags ( |
THIS |
That's good to know; unfortunately, it seems that autodiscovery-scraped metrics (e.g. |
How is this not an obvious feature to include on the agent. At this point this just feels like Datadog is being overly greedy and not implementing this as a way to extract more money out of it's customers. |
Unfortunately we tried to do multiple options with vector as a proxy but:
Then you can compare the results and check how much your data differs, especially when you sum the series by environment tags. Is worth to mentioned that by default We have a lot of metrics that only need 1-2 tags like When we dropped any other tag everything works as expected and summary data points are almost identical. The only working solution is enabling DD MWL and reduce indexed tags; droping |
It would be great if it was possible to strip off Host tags off of metrics. Tags such as what autoscaling group a metric is coming from is not very valuable and can clutter the tags for a particular metric. Being able to exclude tags based on a regex would enable to strip multiple at a time and allow to strip generated tags.
The text was updated successfully, but these errors were encountered: