-
Postgraduate student at Sun Yat-sen university
-
LLM Inference, HPC, Simulaters, GPU, architecture
- vLLM [Benchmark] Refactor sample_requests in benchmark_throughput link
- vLLM [Bugfix] fix automatic prefix args and add log info link
- vLLM [Minor Fix] Fix comments in benchmark_serving link
- vLLM [Minor Fix] Remove unused code in benchmark_prefix_caching.py link
- TVM [Doc] Fix minor error in "Expressions in Relay" link
- TVM [Doc] Fix minor error in doc (Add an operator to Relay) link