Yingfa Chen
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts
arXiv 2026
BlockFFN: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity
arXiv 2025
StateX: Enhancing RNN Recall via Post-training State Expansion
arXiv 2025
Cost-Optimal Grouped-Query Attention for Long-Context Modeling
arXiv 2025
$\infty$Bench: Extending Long Context Evaluation Beyond 100K Tokens
arXiv 2024
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models
arXiv 2024
Sparsing Law: Towards Large Language Models with Greater Activation Sparsity
arXiv 2024
Robust and Scalable Model Editing for Large Language Models
arXiv 2024
CFDBench: A Large-Scale Benchmark for Machine Learning Methods in Fluid Dynamics
arXiv 2023
Sub-Character Tokenization for Chinese Pretrained Language Models
arXiv 2021
Affiliations
Frequent co-authors
10from 10 papers