vllm_worker is incompatible with vllm > 0.8.x #3704

Open

opened

on Mar 24, 2025

While this works on VLLM 0.7.x, the latest one (0.8.2), which supports mistral-small and gemma, does not seem have the "engine" attribute:

2025-03-24 22:04:27 | ERROR | stderr | Traceback (most recent call last):
2025-03-24 22:04:27 | ERROR | stderr |   File "/p/haicluster/llama/FastChat/fastchat/serve/vllm_worker.py", line 291, in <module>
2025-03-24 22:04:27 | ERROR | stderr |     worker = VLLMWorker(
2025-03-24 22:04:27 | ERROR | stderr |              ^^^^^^^^^^^
2025-03-24 22:04:27 | ERROR | stderr |   File "/p/haicluster/llama/FastChat/fastchat/serve/vllm_worker.py", line 57, in __init__
2025-03-24 22:04:27 | ERROR | stderr |     self.tokenizer = llm_engine.engine.tokenizer
2025-03-24 22:04:27 | ERROR | stderr |                      ^^^^^^^^^^^^^^^^^
2025-03-24 22:04:27 | ERROR | stderr | AttributeError: 'AsyncLLM' object has no attribute 'engine'

Metadata

Assignees

No one assigned

Labels

No labels

No labels

Type

No type

Fields

No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests