Yoon Kim
- Papers
- 31
Cite
Notes
Only stored in your browser.
Authored papers
31Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models
arXiv 2026
Log-Linear Attention
arXiv 2025
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research
arXiv 2025
Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
arXiv 2025
PaTH Attention: Position Encoding via Accumulating Householder Transformations
arXiv 2025
Data Engineering for Scaling Language Models to 128K Context
arXiv 2024
Training-Free Activation Sparsity in Large Language Models
arXiv 2024
Value Augmented Sampling for Language Model Alignment and Personalization
arXiv 2024
The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities
arXiv 2024
Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization
arXiv 2024
Improving Black-box Robustness with In-Context Rewriting
arXiv 2024
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
arXiv 2024
The Surprising Effectiveness of Test-Time Training for Few-Shot Learning
arXiv 2024
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps
arXiv 2024
Learning to Decode Collaboratively with Multiple Language Models
arXiv 2024
In-Context Language Learning: Architectures and Algorithms
arXiv 2024
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
arXiv 2023
Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement
arXiv 2023
Entailment as Robust Self-Learner
arXiv 2023
LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning
arXiv 2023
Gated Linear Attention Transformers with Hardware-Efficient Training
arXiv 2023
Grammar Prompting for Domain-Specific Language Generation with Large Language Models
grammar-prompting-for-domain-specific-1
Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks
arXiv 2023
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
arXiv 2023
Deriving Language Models from Masked Language Models
arXiv 2023
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings
NAACL 2022 7
Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
NeurIPS 2023 11
Parameter-Efficient Transfer Learning with Diff Pruning
parameter-efficient-transfer-learning-with
OpenNMT: Neural Machine Translation Toolkit
opennmt-neural-machine-translation-toolkit-1
Latent Alignment and Variational Attention
latent-alignment-and-variational-attention-1
Sequence-Level Knowledge Distillation
sequence-level-knowledge-distillation-1
Affiliations
Frequent co-authors
10from 31 papers