-
Notifications
You must be signed in to change notification settings - Fork 335
Skyline analyzer Crashed on 2k Metric's #121
Comments
@mehmood-007 you may be better off posting to https://groups.google.com/forum/#!forum/skyline-dev with any queries as this skyline is an Archived Project. If you do post, add as many details about your setup as you can, it generally makes things quicker to diagnosis. |
Thanks for your response and apologies for the delay following is the setting.py and how i'm starting skyline analyzer . but some times it crashes automatically and no logs in /var/log/analyzer.log root@skyline:/opt/skyline/bin# ./analyzer.d start settings.py #The path for the Redis unix socket #The Skyline logs directory. Do not include a trailing slash. #The Skyline pids directory. Do not include a trailing slash. #Metrics will be prefixed with this value in Redis. #The Horizon agent will make T'd writes to both the full namespace and the #This is the rolling duration that will be stored in Redis. Be sure to pick a #This is the duration of the 'mini' namespace, if you are also using the #If you have a Graphite host set up, set this metric to get graphs on #The Graph url used to link to Graphite (Or another graphite dashboard) #If you have a Graphite host set up, set its Carbon port. #If you have Oculus set up, set this metric to set the clickthrough on the """ #This is the location the Skyline agent will write the anomalies file to disk. #This is the number of processes that the Skyline analyzer will spawn. #This is the duration, in seconds, for a metric to become 'stale' and for #This is the minimum length of a timeseries, in datapoints, for the analyzer #Sometimes a metric will continually transmit the same number. There's no need #By default, the analyzer skips a metric if it it has transmitted a single #The canary metric should be a metric with a very high, reliable resolution #These are the algorithms that the Analyzer will run. To add a new algorithm, #This is the number of algorithms that must return True before a metric is #This is to enable second order anomalies. This is an experimental feature, so #This enables alerting. #This is the config for which metrics to alert on and which strategy to use for each. #Each alert module requires additional information.
} #The IP address for Horizon to listen on. Defaults to gethostname() #This is the port that listens for Graphite pickles over TCP, sent by Graphite's #This is the port that listens for Messagepack-encoded UDP packets. #This is how big a 'chunk' of metrics will be before they are added onto #This is the maximum allowable length of the processing queue before new #This is the number of Roomba processes that will be spawned to trim #Normally Roomba will clean up everything that is older than FULL_DURATION #The Horizon agent will ignore incoming datapoints if their timestamp #These are metrics that, for whatever reason, you do not want to store """ #The IP address for the webapp #The port for the webapp |
@mehmood-007 OK - you did not read the https://groups.google.com/forum/#!forum/skyline-dev :) First two things that really stand out there that may be problem are your Try setting those to lower values similar to the number of CPUs you have, .e.g:
Hopefully you will find that stops the crashing. |
@earthgecko Thanks for your help. I have configured this ( ANALYZER_PROCESSES=2 , WORKER_PROCESSES=2) and currently i'm relaying 2000s metrics/min and I am currently analyzing the stability of the code for the mentioned parameters. Similarly in future I have to pass 25000s metrics/min. Will it be capable to analyze it ? |
@mehmood-007 Do not take this a firm benchmark, however it is definitely possible to push 40000 metrics per minute with 4 CPUs and 4GB RAM on a SSD backed server - running with the following settings:
This allows overhead for redis. |
@earthgecko Thanks i'll implement this and can you please inform me on how many metrics are you passing to skyline |
Skyline is crashing every time it reaches 2k Metrics. Analyzer.log stops recording any metrics. While horizon logs are working fine. Please advise on how can i fix these issues and are there any alternatives available in the market that I can look into.
The text was updated successfully, but these errors were encountered: