TACO: Efficient Communication Compression of Intermediate Tensors for Scalable Tensor-Parallel LLM Training — Man Liu, Xingchen Liu, Xingjian Tian, Bing Lu, Shengkay Lyu, Shengquan Yin, Wenjing Huang, Zheng Wei, Hairui Zhao, Guangming Tan, Dingwen Tao | Kutubxona