Explanation of the paper, along with examples and implementation for momentum optimizer.