The Recurrent Transformer: Greater Effective Depth and Efficient Decoding — Costin-Andrei Oncescu, Depen Morwani, Samy Jelassi, Alexandru Meterez, Mujin Kwun, Sham Kakade | Kutubxona