Senjie Jin

Papers: 9

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

9papers

Authored papers

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

arXiv 2026

2026

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

arXiv 2025

2025

Secrets of RLHF in Large Language Models Part II: Reward Modeling

arXiv 2024

2024

MouSi: Poly-Visual-Expert Vision-Language Models

arXiv 2024

2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

arXiv 2024

2024

SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Model

arXiv 2024

2024

The Rise and Potential of Large Language Model Based Agents: A Survey

arXiv 2023

2023

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement

arXiv 2023

2023

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 9 papers

Qi Zhang

Tao Gui

Xuanjing Huang

Zhiheng Xi

Rui Zheng

Yuhao Zhou

Xiaoran Fan

Shihan Dou

Xiao Wang

Boyang Hong