Publications

Filter

2026

  1. ArXiv
    AMMA: A Multi-Chiplet Memory-Centric Architecture for Low-Latency 1M Context Attention Serving
    Zhongkai Yu, Haotian Ye, Chenyang Zhou, Ohm Rishabh Venkatachalam, Zaifeng Pan, Zhengding Hu, Junsung Kim, Won Woo Ro, Po-An Tsai, Shuyi Pei, Yangwook Kang, and Yufei Ding
    arXiv preprint arXiv:2604.26103, 2026
  2. ArXiv
    ChipBench: A Next-Step Benchmark for Evaluating LLM Performance in AI-Aided Chip Design
    Zhongkai Yu, Chenyang Zhou, Yichen Lin, Hejia Zhang, Haotian Ye, Junxia Cui, Zaifeng Pan, Jishen Zhao, and Yufei Ding
    arXiv preprint arXiv:2601.21448, 2026