I am an M.S. student in Computer Engineering at the University of California, San Diego, where I am a research intern in Prof. Yufei Ding’s Lab. My research focuses on efficient AI infrastructure, LLM inference acceleration, hardware-software co-design, and memory-centric architecture.
My recent work explores memory-centric architectures for long-context LLM attention, token-level speculative decoding for diffusion language models, and AI for chip design benchmarks. More broadly, I aim to build efficient and reliable AI infrastructure by co-designing algorithms, runtime systems, memory architectures, and hardware-aware execution strategies for next-generation AI workloads.
Email: h5ye@ucsd.edu
Education
- University of California, San Diego (UCSD), 2025 - 2027 (expected)
M.S. in Computer Engineering
Advisor: Prof. Yufei Ding - Sun Yat-Sen University (SYSU), 2021 - 2025
B.E. in Microelectronic Science and Engineering
Experience
- Prof. Yufei Ding’s Lab, UCSD, 2025.10 - Present
Research Intern - Xuanyuan Investment Co., Ltd., 2025.06 - 2025.09
Quantitative Research Intern - Hichip Semiconductor Co., Ltd., 2025.02 - 2025.05
Digital Designer Intern - Hardware-Software Co-Design & Parallel Computing Group, SYSU, 2022.06 - 2023.07
Researcher, RISC-V GPGPU Design
news
| Apr 28, 2026 | Our paper AMMA: A Multi-Chiplet Memory-Centric Architecture for Low-Latency 1M Context Attention Serving was released on arXiv. |
|---|---|
| Jan 29, 2026 | Our paper ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design was released on arXiv. |
| Jan 01, 2026 | Started working on SimSD: Simple Speculative Decoding in Diffusion Language Models. |
| Oct 01, 2025 | Joined Prof. Yufei Ding’s Lab at UCSD as a research intern. |
| Sep 01, 2025 | Started my M.S. in Computer Engineering at UC San Diego. |