publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
2024
- ICLR WSBridging Lottery ticket and Grokking: Is Weight Norm Sufficient to Explain Delayed Generalization?In ICLR 2024 Workshop on Bridging the Gap Between Practice and Theory in Deep Learning, 2024
- NeuripsADOPT: Modified Adam Can Converge with Any {}beta_2 with the Optimal RateIn The Thirty-eighth Annual Conference on Neural Information Processing Systems, Neurips, 2024