Senjie Jin
- Papers
- 9
Cite
Notes
Only stored in your browser.
Authored papers
9Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models
arXiv 2026
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination
arXiv 2025
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
arXiv 2024
MouSi: Poly-Visual-Expert Vision-Language Models
arXiv 2024
Secrets of RLHF in Large Language Models Part II: Reward Modeling
arXiv 2024
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model
arXiv 2024
The Rise and Potential of Large Language Model Based Agents: A Survey
arXiv 2023
Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement
arXiv 2023
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
arXiv 2023
Affiliations
Frequent co-authors
10from 9 papers