Sarina Li

Problem

Based on the new modifications of VectorAdam, the momentum vector can be moved along the sphere using parallel transport due to the momentum being projected completely out for larger learning rates.

Additionally, can this algorithm be applied to more than just spheres? What about ellipsoids? What about manifolds in general?

To address these problems, I’ll be documenting my work here :)