publications

Conference

  1. AACL2025
    scaling_vector.png
    Interpreting Multi-Attribute Confounding through Numerical Attributes in Large Language Models
    Hirohane Takagi*, Gouki Minegishi*, Shota Kizawa, Issey Sukeda, and Hitomi Yanaka
    In International Joint Conference on Natural Language Processing & Asia-Pacific Chapter of the Association for Computational Linguistics 2025, 2025
    *Equal contribution
  2. Neurips2025
    Reasoning_Graph.gif
    Topology of Reasoning: Understanding Large Reasoning Models through Reasoning Graph Properties
    Gouki Minegishi, Hiroki Furuta, Takeshi Kojima, Yusuke Iwasawa, and Yutaka Matsuo
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems, Neurips, 2025
  3. ICML2025
    ICL.gif
    Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence
    Gouki Minegishi, Hiroki Furuta, Shohei Taniguchi, Yusuke Iwasawa, and Yutaka Matsuo
    In Forty-second International Conference on Machine Learning, ICML, 2025
  4. ICLR2025
    SAE.png
    Rethinking Evaluation of Sparse Autoencoders through the Representation of Polysemous Words
    Gouki Minegishi, Hiroki Furuta, Yusuke Iwasawa, and Yutaka Matsuo
    In The Thirteenth International Conference on Learning Representations, ICLR, 2025
  5. Neurips2024
    ADOPT: Modified Adam Can Converge with Any {}beta_2 with the Optimal Rate
    Shohei Taniguchi, Keno Harada, Gouki Minegishi, Yuta Oshima, Seong Cheol Jeong, and 5 more authors
    In The Thirty-eighth Annual Conference on Neural Information Processing Systems, Neurips, 2024

Conference Workshop

  1. ICLR2025 WS
    In-Context Meta Learning Induces Multi-Phase Circuit Emergence
    Gouki Minegishi, Hiroki Furuta, Shohei Taniguchi, Yusuke Iwasawa, and Yutaka Matsuo
    In ICLR 2025 Workshop on Building Trust in Language Models and Applications, 2025
  2. ICLR2024 WS
    Interpreting Grokked Transformers in Complex Modular Arithmetic
    Hiroki Furuta, Minegishi Gouki, Yusuke Iwasawa, and Yutaka Matsuo
    In ICLR 2024 Workshop on Bridging the Gap Between Practice and Theory in Deep Learning, 2024
  3. ICLR2024 WS
    Bridging Lottery ticket and Grokking: Is Weight Norm Sufficient to Explain Delayed Generalization?
    Minegishi Gouki, Yusuke Iwasawa, and Yutaka Matsuo
    In ICLR 2024 Workshop on Bridging the Gap Between Practice and Theory in Deep Learning, 2024

Journal

  1. TMLR2025
    Grokking.png
    Bridging Lottery Ticket and Grokking: Understanding Grokking from Inner Structure of Networks
    Gouki Minegishi, Yusuke Iwasawa, and Yutaka Matsuo
    Transactions on Machine Learning Research, TMLR, 2025
  2. TMLR2024
    Towards Empirical Interpretation of Internal Circuits and Properties in Grokked Transformers on Modular Polynomials
    Hiroki Furuta, Gouki Minegishi, Yusuke Iwasawa, and Yutaka Matsuo
    Transactions on Machine Learning Research, TMLR, 2024

Preprint

  1. Preprint
    denoising_head.png
    Mechanism of Task-oriented Information Removal in In-context Learning
    Hakaze Cho, Haolin Yang, Gouki Minegishi, and Naoya Inoue
    2025
  2. Preprint
    RL_SFT.png
    RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs
    Kohsei Matsutani, Shota Takashiro, Gouki Minegishi, Takeshi Kojima, Yusuke Iwasawa, and 1 more author
    2025