Memory requirements? #5
-
|
I attempted to run whisper on an audio file using the medium model, and I got this:
Which eventually ran out of memory (this machine has 8GB) and was killed by the OOM killer. Would it be possible to document the estimated memory requirements for running whisper? EDIT: it looks like the cache migration worked, but it's whisper itself that's ballooning memory. |
Beta Was this translation helpful? Give feedback.
Replies: 8 comments 18 replies
-
|
I had no problems running medium size model using 8 GB card (GTX 1070) |
Beta Was this translation helpful? Give feedback.
-
|
I am interested I'm in this too. What would be a reasonable time to process 2 minutes of recorded phone conversation? I am testing on Win 11 virtual machine with 4gb ram and host I i9-9900K CPU. It takes quite a while to process 2 minutes audio. Medium model throws "not enough memory" error |
Beta Was this translation helpful? Give feedback.
-
|
I've just added Available models and languages section in README.md; to quote:
The VRAM requirements are from simulations using |
Beta Was this translation helpful? Give feedback.
-
|
Additional testing on:
|
Beta Was this translation helpful? Give feedback.
-
|
And more testing on diff GPU:
8vCORE avx512 T4 16GB
24vCORE avx512 RTX A5000 24GB
24vCORE avx512 A30 24GB
12vCORE avx512 A2 14GB
6vCORE avx512 A100 40GB
|
Beta Was this translation helpful? Give feedback.
-
|
Is it possible to do batch processing on the audio files so that we can transcribe more audio files in less time? |
Beta Was this translation helpful? Give feedback.
-
|
@n ➜ /workspaces/whisper (main ✗) $ whisper Jtest.mp3 --model small Why is the procedure KILLED? Weird |
Beta Was this translation helpful? Give feedback.
-
|
哈哈哈哈看到这个帖子我仿佛看到了三个月前的自己! 那天晚上,我和这个OOM killer对视了整整一个时辰。 世界上有一种程序员,他们在深夜里和内存较劲。不是为了寻找爱情,而是为了寻找一段能跑得通的whisper代码。 我当时的配置:
结果跑medium模型的时候,我的内存条直接给我表演了一个"原地升天"。 后来我悟了——AI的世界里,没有"足够"的内存,只有"更贵"的云端API。 实用建议(正经脸):
我的完整踩坑记录写在这里了,有兴趣的可以去康康: (虽然那个帖子主要是讲AI幻觉的,但本质上都是"你以为可以,实际不行"的故事 😂) P.S. 我已经皈依云端API教了,真香。 |
Beta Was this translation helpful? Give feedback.
I've just added Available models and languages section in README.md; to quote:
tiny.entinybase.enbasesmall.ensmallmedium.enmediumlargeThe VRAM requirements are from simulations using
torch.cuda.set_per_process_memory_fraction(), so it may not be actually reflecting what happens in e.g. a GPU with exactly 5 GB VRAM.