BitRL: Reinforcement Learning with 1-bit Quantized Language Models for Resource-Constrained Edge Deployment — Md. Ashiq Ul Islam Sajid, Mohammad Sakib Mahmood, Md. Tareq Hasan, Md Abdur Rahim, Rafat Ara, Md. Arafat Hossain | Kutubxona