GPU memory usage/requirement?

by Bilibili - opened Jun 28, 2023

Jun 28, 2023

Thanks for this work!

Since the original StarCoder requires 60+ GB GPU RAM for inference, I wonder what about the GPTQ version, and could the model run inference on V100-32G?

Bilibili changed discussion title from GPU memory usage peak? to GPU memory usage requirement? Jun 28, 2023

Bilibili changed discussion title from GPU memory usage requirement? to GPU memory usage/requirement? Jun 28, 2023

Njax

Jul 29, 2023

I'm totally new to GPTQ and am not exactly sure how to calculate the exacts, but it seems happy with 20-30 gigs from my CPU's ram, and I have only 12 gigs used in my GPU.

TheBloke

Owner Jul 29, 2023

Yes 32GB is more than enough VRAM for nearly any model in GPTQ. This one needs around 12GB yeah

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment