CUDA Kernel Optimization and Counter-Free Performance Analysis for Depthwise Convolution in Cloud Environments — Huriyeh Babak, Melanie Schaller | Kutubxona