[Distinguished Lecture] On the Convergence of Stochastic Gradient Descent with Bandwidth-based Step Size