vllm_gaudi.envs ¶
VLLM_USE_HPU_CONTIGUOUS_CACHE_FETCH module-attribute ¶
VLLM_USE_HPU_CONTIGUOUS_CACHE_FETCH: bool = True
environment_variables module-attribute ¶
environment_variables: dict[str, Callable[[], Any]] = {
"VLLM_USE_HPU_CONTIGUOUS_CACHE_FETCH": lambda: lower()
in ("1", "true"),
"VLLM_HPU_FORCE_CHANNEL_FP8": lambda: lower()
in ("1", "true")
and get("QUANT_CONFIG", None) is None,
}