Distributed Generative Inference of LLM at Internet Scales with Multi-Dimensional Communication Optimization — Jiu Chen, Shuangyan Yang, Xu Xiong, Hexiao Duan, Xinran Zhang, Jie Ren, Dong Li | Kutubxona