Minsu Kim
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding
arXiv 2026
Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Speech Representations
arXiv 2025
Large Language Models are Strong Audio-Visual Speech Recognition Learners
arXiv 2024
Learning diverse attacks on large language models for robust red-teaming and safety tuning
arXiv 2024
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing
arXiv 2024
Improved off-policy training of diffusion samplers
arXiv 2024
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark
arXiv 2023
Local Search GFlowNets
arXiv 2023
Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences
bootstrapped-training-of-score-conditioned
DevFormer: A Symmetric Transformer for Context-Aware Device Placement
arXiv 2022
Affiliations
Frequent co-authors
10from 10 papers