You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix vLLM queue overflow with serialized semaphore release
Multiple workers could acquire semaphore in rapid succession when queue dropped,
causing bursts of 1000+ page submissions and vLLM crashes.
Race condition in semaphore release logic - multiple threads could evaluate
conditions and release simultaneously before queue updated.
Add asyncio.Lock() to serialize release checks, ensuring atomic evaluation
and release. All condition checks now happen inside the lock.
0 commit comments