New paper out

Benchmarking Optimizers for Large Language Model Pretraining” – joint work with Matteo Pagliardini and Martin Jaggi.

Andrii Semenov
Andrii Semenov
ML Researcher

My research interests include Large-Scale and Stochastic Optimization.