gty111

Follow

🎯

Focusing is all you need

Tianyu Guo gty111

🎯

Focusing is all you need

Follow

Ph.D. student of Sun Yat-Sen University, prior intern @Tencent, @rednote-hilab and @MoonshotAI. Simulators, GPU, architecture, AI Infra, MLSys

164 followers · 137 following

Sun Yat-sen University
Guangzhou
00:36 (UTC +08:00)
https://gty111.github.io/info/
https://orcid.org/0009-0005-2979-4486

Achievements

Achievements

Highlights

Pro

gty111/README.md

PH.D. student at Sun Yat-sen university
AI Infra, MLSys, Simulaters, GPU architecture
Visit my personal web

News

[2026/01/13] [LMSYS blog] EPD Disaggregation: Elastic Encoder Scaling for Vision-Language Models in SGLang
[2025/06/27] [arXiv] [Code] gLLM is accepted by SC'25. Congratulations!
[2025/05/28] [arXiv] [Code] EFIM is accepted by Euro-Par'25
[2025/04/27] [arXiv] [Code] We have released gLLM, an efficient pipeline parallelism inference engine for LLM.

PRs for Project

vLLM

SGLang

xDiT

DistVAE

Fix batch dimension

TVM

Pinned Loading

gLLM gLLM Public

An Efficient and Versatile Inference Engine for Distributed LLM Serving

Python 63 5
PTX-EMU PTX-EMU Public

PTX-EMU is a simple emulator for CUDA program.

C++ 40 7
GEMM_MMA GEMM_MMA Public

Optimize GEMM with tensorcore step by step

40 8
SimpleUseGpgpuSim SimpleUseGpgpuSim Public

GPGPU-SIM 使用篇

Shell 14 1
GEMM_WMMA GEMM_WMMA Public

GEMM by WMMA (tensor core)

Cuda 15 9
ConvNN ConvNN Public

A simple CNN training framework support on CPU and GPU(CUDNN)

C++ 3