Publications

(2024). Mixed Newton Method for Optimization in Complex Spaces.

PDF Cite DOI arXiv

(2024). Gradient Clipping Improves AdaGrad when the Noise Is Heavy-Tailed.

PDF Cite Code DOI arXiv

(2024). Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning.

PDF Cite Code DOI arXiv

(2023). Bregman Proximal Method for Efficient Communications under Similarity.

PDF Cite DOI arXiv