Blocking on pending requests despite block == false #43

dacorvo · 2024-04-03T14:21:06Z

I am using the litellm client to benchmark a HuggingFace TGI server.

In token_benchmark_ray.py, req_launcher.get_next_ready() is called periodically to fetch pending results, with the block parameter set to False.

However, the call is actually blocking until all pending requests are complete, which can be very long if I set a high number of concurrent requests (typically 128).

The result is that instead of continuously injecting new requests as they complete, the benchmark script instead sends a batch of max_concurrent_requests, waits for them to complete, then sends another batch.

Is this the expected behaviour ? I double-checked why the call is blocking and from the code in request launcher this seems to be the normal behaviour because it only checks if there are still requests in the ray actor pool.

The text was updated successfully, but these errors were encountered:

llsj14 · 2024-06-18T17:36:20Z

I found that I have the same issue as you. (#56)
I think the get_next_ready function should return the result as soon as the request is finished in nonblock mode.

llsj14 mentioned this issue Jun 18, 2024

fix: subsequent requests cannot be sent until 'num_concurrent_requests' requests have all finished in non-block mode #57

Closed

llsj14 mentioned this issue Jul 1, 2024

fix: subsequent requests cannot be sent until 'num_concurrent_requests' requests have all finished in non-block mode #59

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Blocking on pending requests despite block == false #43

Blocking on pending requests despite block == false #43

dacorvo commented Apr 3, 2024 •

edited

Loading

llsj14 commented Jun 18, 2024 •

edited

Loading

Blocking on pending requests despite block == false #43

Blocking on pending requests despite block == false #43

Comments

dacorvo commented Apr 3, 2024 • edited Loading

llsj14 commented Jun 18, 2024 • edited Loading

dacorvo commented Apr 3, 2024 •

edited

Loading

llsj14 commented Jun 18, 2024 •

edited

Loading