0

Yoon Kim

Papers
31

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
31papers

Authored papers

31

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

arXiv 2026

2026

Log-Linear Attention

arXiv 2025

2025

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

arXiv 2025

2025

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping

arXiv 2025

2025

PaTH Attention: Position Encoding via Accumulating Householder Transformations

arXiv 2025

2025

Data Engineering for Scaling Language Models to 128K Context

arXiv 2024

2024

Training-Free Activation Sparsity in Large Language Models

arXiv 2024

2024

Value Augmented Sampling for Language Model Alignment and Personalization

arXiv 2024

2024

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities

arXiv 2024

2024

Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization

arXiv 2024

2024

Improving Black-box Robustness with In-Context Rewriting

arXiv 2024

2024

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

arXiv 2024

2024

The Surprising Effectiveness of Test-Time Training for Few-Shot Learning

arXiv 2024

2024

Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps

arXiv 2024

2024

Learning to Decode Collaboratively with Multiple Language Models

arXiv 2024

2024

In-Context Language Learning: Architectures and Algorithms

arXiv 2024

2024

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

arXiv 2023

2023

Phenomenal Yet Puzzling: Testing Inductive Reasoning Capabilities of Language Models with Hypothesis Refinement

arXiv 2023

2023

Entailment as Robust Self-Learner

arXiv 2023

2023

LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning

arXiv 2023

2023

Gated Linear Attention Transformers with Hardware-Efficient Training

arXiv 2023

2023

Grammar Prompting for Domain-Specific Language Generation with Large Language Models

grammar-prompting-for-domain-specific-1

2023

Reasoning or Reciting? Exploring the Capabilities and Limitations of Language Models Through Counterfactual Tasks

arXiv 2023

2023

Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning

arXiv 2023

2023

Deriving Language Models from Masked Language Models

arXiv 2023

2023

DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings

NAACL 2022 7

2022

Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors

NeurIPS 2023 11

2022

Parameter-Efficient Transfer Learning with Diff Pruning

parameter-efficient-transfer-learning-with

2020

OpenNMT: Neural Machine Translation Toolkit

opennmt-neural-machine-translation-toolkit-1

2018

Latent Alignment and Variational Attention

latent-alignment-and-variational-attention-1

2018

Sequence-Level Knowledge Distillation

sequence-level-knowledge-distillation-1

2016

Affiliations

No known affiliations.

Frequent co-authors

10

from 31 papers