安装anaconda之后,创建虚拟环境
conda create -n MinerU python=3.10
conda activate MinerU
pip install -U magic-pdf==1.2.2 --user --extra-index-url https://wheels.myhloli.com
pip install Pillow -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install paddlepaddle -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install opencv-python -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install ultralytics -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install doclayout_yolo -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install pycocotools -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install detectron2 --extra-index-url https://myhloli.github.io/wheels/
pip install timm -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install unimernet -i https://pypi.tuna.tsinghua.edu.cn/
pip install paddleocr -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install rapid_table -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install struct_eqtable -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install openai -i https://pypi.tuna.tsinghua.edu.cn/simple
pip install rapidocr_onnxruntime -i https://pypi.tuna.tsinghua.edu.cn/simple
然后需要安装支撑的AI模型
wget https://github.com/opendatalab/MinerU/raw/master/scripts/download_models_hf.py -O download_models_hf.py
python download_models_hf.py
之后便能运用cpu使用命令转换了。
如果要用GPU则需要修改magic-pdf.json,且安装PaddlePaddle的GPU版本。
命令格式:magic-pdf -p "pdf所在路径" -o "输出到哪个文件夹" -m auto

482

被折叠的 条评论
为什么被折叠?



