Preliminary kk-strongly convex Lipschitz w.r.t norm ||x||||x|| Gradient descent choose step size tkt_k convergence rate under Lipschitz gradient convergence rate under strong convexity Steepest descent Preliminary -strongly convex is 2-strongly convex. Roughly speaking, the faster a function grows (in either direction), the stronger convexity it has. Lipschitz w.r.t norm A function which satisfies Lipschitz condition cannot grow too fast, precisely, faster than linear. A function cannot be both Lipschitz and strongly convex. http://www.cs.cmu.edu/~ggordon/10725-F12/schedule.html Roughly speaking, if is Lipschitz, . If is Lipschitz, . Gradient descent Consider the expansion (linear approximation plus proximity term) choose step size Backtracking line search. fix with while , update . Exact line search. (usually not possible and it’s not worth it) convergence rate under Lipschitz gradient Assume that is convex and differentiable, ...