Engine Arguments¶

Engine arguments control the behavior of the vLLM engine.

For offline inference, they are part of the arguments to LLM class.
For online serving, they are part of the arguments to vllm serve.

The engine argument classes, EngineArgs and AsyncEngineArgs, are a combination of the configuration classes defined in vllm.config. Therefore, if you are interested in developer documentation, we recommend looking at these configuration classes as they are the source of truth for types, defaults and docstrings.

当传递 JSON 命令行参数时，以下几组参数是等效的

--json-arg '{"key1": "value1", "key2": {"key3": "value2"}}'
--json-arg.key1 value1 --json-arg.key2.key3 value2

此外，列表元素可以使用 + 单独传递

--json-arg '{"key4": ["value3", "value4", "value5"]}'
--json-arg.key4+ value3 --json-arg.key4+='value4,value5'

`EngineArgs`¶

`AsyncEngineArgs`¶