Meta-Aligner: Bidirectional Preference-Policy Optimization for Multi-Objective LLMs Alignment — Wenzhe Xu, Biao Liu, Yiyang Sun, Xin Geng, Ning Xu | Kutubxona