Dipendra Misra

Papers: 6

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

6papers

Authored papers

Policy Improvement using Language Feedback Models

arXiv 2024

2024

Dataset Reset Policy Optimization for RLHF

arXiv 2024

2024

Aligning LLM Agents by Learning Latent Preference from User Edits

arXiv 2024

2024

Towards Principled Representation Learning from Videos for Reinforcement Learning

arXiv 2024

2024

The Truth is in There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

arXiv 2023

2023

Learning to Generate Better Than Your LLM

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

from 6 papers

Jonathan D. Chang

Kianté Brantley

Wen Sun

Akanksha Saran

Alex Lamb

Alexey Taymanov

Eduardo Salinas

Ge Gao

Jason D. Lee

John Langford