Andrii Semenov
Andrii Semenov
Home
Education
Work Experience
Posts
Publications
Talks
Projects
Light
Dark
Automatic
Posts
New paper out
“Adaptive Regularized Newton Method with Inexact Hessian” – joint work with Aleksandr Shestakov, Nail Bashirov, Alexander Gasnikov, Martin Takáč, Aleksandr Beznosikov and Dmitry Kamzolov.
Dec 9, 2025
1 min read
PDF
arXiv
I will present 2 papers at EurIPS 2025
Between December 2 and December 7, I will attend EurIPS 2025 in Copenhagen, an event officially endorsed by NeurIPS. I will present two papers there: “Benchmarking Optimizers for Large Language Model Pretraining” and “Just a Simple Transformation is Enough for Data Protection in Split Learning”.
Nov 24, 2025
1 min read
ICOMP, 1 accepted paper (with Oral Presentation)
“Adaptive Regularized Newton Method with Inexact Hessian” – joint work with Aleksandr Shestakov, Nail Bashirov, Alexander Gasnikov, Martin Takáč, Aleksandr Beznosikov and Dmitry Kamzolov.
Sep 15, 2025
1 min read
PDF
arXiv
New paper out
“Apertus: Democratizing Open and Compliant LLMs for Global Language Environments” – a project by Team Apertus within the Swiss AI Initiative.
Sep 2, 2025
1 min read
arXiv
Technical Report
Hugging Face
Swiss AI
New paper out
“Benchmarking Optimizers for Large Language Model Pretraining” – joint work with Matteo Pagliardini and Martin Jaggi.
Sep 1, 2025
1 min read
PDF
arXiv
New paper out
“Gradient-Normalized Smoothness for Optimization with Approximate Hessians” – joint work with Martin Jaggi and Nikita Doikov.
Jun 16, 2025
1 min read
PDF
arXiv
ICML, 1 accepted paper
“Clipping Improves Adam-Norm and AdaGrad-Norm when the Noise Is Heavy-Tailed” (with Savelii Chezhegov, Yaroslav Klyukin, Aleksandr Beznosikov, Alexander Gasnikov, Samuel Horváth, Martin Takáč and Eduard Gorbunov).
May 1, 2025
1 min read
PDF
arXiv
Visiting Martin Takáč at MBZUAI
Between 15 February and 15 April, I will work as a Visiting Student at the Machine Learning Department of MBZUAI under the supervision of Professor Martin Takáč.
Feb 14, 2025
0 min read
New paper out
“Sign Operator for Coping with Heavy-Tailed Noise: High Probability Convergence Bounds with Extensions to Distributed Optimization and Comparison Oracle” – joint work with Nikita Kornilov, Philip Zmushko, Alexander Gasnikov and Aleksandr Beznosikov.
Feb 11, 2025
1 min read
PDF
arXiv
New paper out
“Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning” – joint work with Philip Zmushko, Alexander Pichugin and Aleksandr Beznosikov.
Dec 16, 2024
1 min read
PDF
arXiv
»
Cite
×