0

Kyunghyun Cho

NYU professor and Genentech / Prescient Design AI lead; co-inventor of GRU and seq2seq with attention.

Role
professor
Papers
30

Cite

Notes

Only stored in your browser.

30papers

Authored papers

30

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

arXiv 2025

2025

MIST: Mutual Information Via Supervised Training

arXiv 2025

2025

HERITAGE: An End-to-End Web Platform for Processing Korean Historical Documents in Hanja

arXiv 2025

2025

AION-1: Omnimodal Foundation Model for Astronomical Sciences

arXiv 2025

2025

Aioli: A Unified Optimization Framework for Language Model Data Mixing

arXiv 2024

2024

When Does Classical Chinese Help? Quantifying Cross-Lingual Transfer in Hanja and Kanbun

arXiv 2024

2024

Large-Scale Targeted Cause Discovery with Data-Driven Learning

arXiv 2024

2024

Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning

arXiv 2024

2024

Improving Code Generation by Training with Natural Language Feedback

arXiv 2023

2023

Latent State Models of Training Dynamics

arXiv 2023

2023

Regularizing with Pseudo-Negatives for Continual Self-Supervised Learning

arXiv 2023

2023

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs

arXiv 2023

2023

System-Level Natural Language Feedback

arXiv 2023

2023

AstroCLIP: A Cross-Modal Foundation Model for Galaxies

arXiv 2023

2023

Training Language Models with Language Feedback at Scale

arXiv 2023

2023

Towards Understanding and Improving GFlowNet Training

arXiv 2023

2023

A Non-monotonic Self-terminating Language Model

arXiv 2022

2022

KLUE: Korean Language Understanding Evaluation

arXiv 2021

2021

NaturalProofs: Mathematical Theorem Proving in Natural Language

arXiv 2021

2021

AdapterHub: A Framework for Adapting Transformers

EMNLP 2020 11

2020

Capacity, Bandwidth, and Compositionality in Emergent Language Learning

arXiv 2019

2019

Mixout: Effective Regularization to Finetune Large-scale Pretrained Language Models

ICLR 2020 1

2019

Passage Re-ranking with BERT

arXiv 2019

2019

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

bert-has-a-mouth-and-it-must-speak-bert-as-a-1

2019

SearchQA: A New Q&A Dataset Augmented with Context from a Search Engine

arXiv 2017

2017

End-to-End Goal-Driven Web Navigation

end-to-end-goal-driven-web-navigation-1

2016

Convolutional Recurrent Neural Networks for Music Classification

arXiv 2016

2016

Natural Language Understanding with Distributed Representation

arXiv 2015

2015

Describing Videos by Exploiting Temporal Structure

describing-videos-by-exploiting-temporal-1

2015

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

arXiv 2014

2014

Affiliations

Currently at

New York University

professor · university lab

Frequent co-authors

10

from 30 papers