News
This repo is used for vLLM's interative performance spot check. For automated benchmark, please refer to vLLM's nightly set. The goal for this repo is establish a set of commonly used benchmarks and ...
🙏 @deepseek_ai's highly performant inference engine is built on top of vLLM. Now they are open-sourcing the engine the right way: instead of a separate repo, they are bringing changes to the open ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results