Use token_ids
to track the FSM state for each sequence in the vLLM integration
#1582
Job | Run time |
---|---|
7m 37s | |
15s | |
7m 50s | |
0s | |
15m 42s |