GPU Server Sizing Methodology For LLM Inference Solutions: A Formalized Approach To Infrastructure Capacity Planning — Bogdan Garbar, Pavel Sozonov, Dmitry Velibekov, Pavel Lukash, Yaroslav Kotov | Kutubxona