logo
0
0
WeChat Login
Forkfromgaildev/autoglm-phone-9B-desktop, ahead:main1 commits

curl -LsSf https://astral.sh/uv/install.sh | sh source $HOME/.local/bin/env uv venv source .venv/bin/activate uv pip install vllm -i https://mirrors.cloud.tencent.com/pypi/simple

python -m vllm.entrypoints.openai.api_server
--model /workspace/AutoGLM-Phone-9B
--tensor-parallel-size 1
--gpu-memory-utilization 0.9

开放端口访问

http://localhost:8000/v1/chat/completions

About

No description, topics, or website provided.
Language
Shell56.1%
Python19.9%
Dockerfile15.8%
Jinja8.3%