[BugFix] Fix recovery logic for sequence group #2186

WoosukKwon · 2023-12-18T21:33:22Z

When a sequence group has N sequences, N - 1 of which are terminated, the remaining sequence can be recovered from preemption using the re-computation mechanism. However, currently the scheduler assumes that every sequence in a waiting sequence group is in the waiting state. This PR fixes this error.

An example test case:

from vllm import LLM, SamplingParams

# Configured for A100-80GB GPU.
llm = LLM("meta-llama/Llama-2-13b-hf", gpu_memory_utilization=0.5, swap_space=20)

num_prompts = 1000
prompt_len = 300
llm.generate(
    prompt_token_ids=[[0] * prompt_len for _ in range(num_prompts)],
    sampling_params=SamplingParams(max_tokens=100, n=2))

WoosukKwon · 2023-12-20T01:04:16Z

@zhuohan123 This PR is ready for review. Please take a look at it!

zhuohan123

LGTM! Thanks for the fix!

Fix recovery logic for sequence group

1f8e16a

WoosukKwon requested a review from zhuohan123 December 18, 2023 21:33

Fix block manager

b084c25

zhuohan123 approved these changes Dec 21, 2023

View reviewed changes

WoosukKwon merged commit a1b9cb2 into main Dec 21, 2023
2 checks passed

WoosukKwon deleted the fix-recovery branch December 21, 2023 05:52

WoosukKwon mentioned this pull request Dec 21, 2023

[FIX] Fix shape mismatch for swapped sequences when logprobs > 0 #1971

Open

hongxiayang pushed a commit to hongxiayang/vllm that referenced this pull request Feb 13, 2024

[BugFix] Fix recovery logic for sequence group (vllm-project#2186)

c40bbd1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BugFix] Fix recovery logic for sequence group #2186

[BugFix] Fix recovery logic for sequence group #2186

WoosukKwon commented Dec 18, 2023

WoosukKwon commented Dec 20, 2023

zhuohan123 left a comment

[BugFix] Fix recovery logic for sequence group #2186

[BugFix] Fix recovery logic for sequence group #2186

Conversation

WoosukKwon commented Dec 18, 2023

WoosukKwon commented Dec 20, 2023

zhuohan123 left a comment

Choose a reason for hiding this comment