- 💼 Software Engineer at Meituan, specialized in GPU inference using TensorFlow/TensorRT for CTR and PyTorch for LLMs. Earlier at Tencent
- 📫 Contact: [email protected]
- beijing
-
05:10
(UTC +08:00)
Popular repositories Loading
-
sglang
sglang PublicForked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python 2
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
-
torch-int
torch-int PublicForked from Guangxuan-Xiao/torch-int
This repository contains integer operators on GPUs for PyTorch.
Python
-
ControlNet_TensorRT
ControlNet_TensorRT PublicForked from TRT2022/ControlNet_TensorRT
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案
Python
-
-
lmdeploy
lmdeploy PublicForked from InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Python
If the problem persists, check the GitHub status page or contact support.