Improve Cuda compatibility of vllm-openai image #2393

RonanKMcGovern · 2024-01-09T13:23:29Z

Currently the 'https://hub.docker.com/r/vllm/vllm-openai/' image uses Cuda 12.1 - this runs into a lot of cuda versioning issues depending on the drivers used on the underlying GPU.

This makes the image an inconsistent starting point for running on services like vast ai or runpod.

Could the docker image be updated to more dynamically support cuda versions from 11.8 and up?

RonanKMcGovern · 2024-01-31T14:33:11Z

for comparison, the text-generation-inference docker image doesn't have these issues. See here

LronDC · 2024-05-20T03:20:08Z

really expect an official image that supports cuda 11.8

or please 🙏 provide a guide on how to build cuda 11.8 version vllm-openai image

RonanKMcGovern mentioned this issue Jan 23, 2024

Mixtral Instruct AWQ vLLM API TrelisResearch/one-click-llms#2

Closed

hmellor added the feature request label Apr 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Cuda compatibility of vllm-openai image #2393

Improve Cuda compatibility of vllm-openai image #2393

RonanKMcGovern commented Jan 9, 2024

RonanKMcGovern commented Jan 31, 2024

LronDC commented May 20, 2024 •

edited

Loading

Improve Cuda compatibility of vllm-openai image #2393

Improve Cuda compatibility of vllm-openai image #2393

Comments

RonanKMcGovern commented Jan 9, 2024

RonanKMcGovern commented Jan 31, 2024

LronDC commented May 20, 2024 • edited Loading

LronDC commented May 20, 2024 •

edited

Loading