解决50系显卡cuda报错 CUDA error: no kernel image is available & 50系安装Flash attention

原创已于 2025-08-15 10:58:19 修改 · 1.4k 阅读

5 ·

本内容遵循CC 4.0 BY-SA版权协议

标签

#深度学习 #pytorch #人工智能

收录于

于 2025-05-27 13:49:56 首次发布

Qwen3-32B-Chat 私有部署镜像 | RTX4090D 24G 显存 CUDA12.4 优化版

本镜像基于 RTX 4090D 24GB 显存 + CUDA 12.4 + 驱动 550.90.07 深度优化，内置完整运行环境与 Qwen3-32B 模型依赖，开箱即用。

CUDA error: no kernel image is available for execution on the device

pip uninstall  torch torchvision torchaudio
pip cache purge
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu128

git clone https://github.com/Dao-AILab/flash-attention
cd flash-attention #进入目录
git submodule update --init --recursive 

安装最新的编译工具
pip install --upgrade wheel

卸载ninja工具，否则会编译出错！
pip uninstall ninja

如果你是其他架构的显卡，则将里面的“120”改为该架构对应的代号。
export FLASH_ATTN_CUDA_ARCHS="120"
python setup.py install # 1h

更新conda环境内的 libstdc++ 库，否则运行时会报错，方法如下：
conda install -c conda-forge libstdcxx-ng

try:
    import flash_attn
    print("flash-attention 已安装，版本：", flash_attn.__version__)
except ImportError:
    print("flash-attention 未安装")

您可能感兴趣的与本文相关的镜像