Kyunghyun Cho
NYU professor and Genentech / Prescient Design AI lead; co-inventor of GRU and seq2seq with attention.
- Role
- professor
- Currently at
- New York University
- twitter.com/kchonyc
- Scholar
- scholar.google.com/citations
- Papers
- 30
Cite
Notes
Only stored in your browser.
Authored papers
30RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
arXiv 2025
MIST: Mutual Information Via Supervised Training
arXiv 2025
HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja
arXiv 2025
AION-1: Omnimodal Foundation Model for Astronomical Sciences
arXiv 2025
Aioli: A Unified Optimization Framework for Language Model Data Mixing
arXiv 2024
When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun
arXiv 2024
Large-Scale Targeted Cause Discovery with Data-Driven Learning
arXiv 2024
Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning
arXiv 2024
Improving Code Generation by Training with Natural Language Feedback
arXiv 2023
Latent State Models of Training Dynamics
arXiv 2023
Regularizing with Pseudo-Negatives for Continual Self-Supervised Learning
arXiv 2023
Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs
arXiv 2023
System-Level Natural Language Feedback
arXiv 2023
AstroCLIP: A Cross-Modal Foundation Model for Galaxies
arXiv 2023
Training Language Models with Language Feedback at Scale
arXiv 2023
Towards Understanding and Improving GFlowNet Training
arXiv 2023
A Non-monotonic Self-terminating Language Model
arXiv 2022
KLUE: Korean Language Understanding Evaluation
arXiv 2021
NaturalProofs: Mathematical Theorem Proving in Natural Language
arXiv 2021
AdapterHub: A Framework for Adapting Transformers
EMNLP 2020 11
Capacity, Bandwidth, and Compositionality in Emergent Language Learning
arXiv 2019
Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models
ICLR 2020 1
Passage Re-ranking with BERT
arXiv 2019
BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model
bert-has-a-mouth-and-it-must-speak-bert-as-a-1
SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine
arXiv 2017
End-to-End Goal-Driven Web Navigation
end-to-end-goal-driven-web-navigation-1
Convolutional Recurrent Neural Networks for Music Classification
arXiv 2016
Natural Language Understanding with Distributed Representation
arXiv 2015
Describing Videos by Exploiting Temporal Structure
describing-videos-by-exploiting-temporal-1
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
arXiv 2014
Affiliations
Frequent co-authors
10from 30 papers