-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Process stuck with growing message queue #5420
Comments
@jamesaimonetti thanks for reaching out Backups in the couch_server message queue are often due to frequent opens and closes, especially when the db handle lru is full and there are not enough idle handles to replace. So a few things to try could be:
|
@nickva thanks for the pointers! We were able to halt/1 one of the couch VMs so we'll poke in the crash dump and see if anything jumps out. I'm also working on a test tool to load couch up like we were seeing and try to make this happen reliably. Will take your points above and incorporate them into our configs. |
@nickva thanks again for the pointers. Turns out the customer had 12 virtual CPUs with other CPU-hogging services running at the same time as a data migration was being run in Couch, causing lots of contention. We've moved their Couch instances to separate servers and increased CPU count and Much appreciated! |
We are seeing ever worsening performance in couch 3.3.3
Description
Over time queries to couch take longer and eventually start return 500s and we see perf continue to degrade.
We've found a process with a growing mailbox:
Looking at the linked processes we see a lot of db updates appearing to be stuck in do_call:
Steps to Reproduce
This develops over time but appears correlated with a number of tasks we run at the beginning of the month
Expected Behaviour
Don't lock up.
Your Environment
Additional Context
Its a 3-node cluster and we see this on all three nodes.
The text was updated successfully, but these errors were encountered: