0

Jie Fu

Papers
40

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
40papers

Authored papers

40

CodeCircuit: Toward Inferring LLM-Generated Code Correctness via Attribution Graphs

arXiv 2026

2026

Re:Form -- Reducing Human Priors in Scalable Formal Software Verification with RL in LLMs: A Preliminary Study on Dafny

arXiv 2025

2025

Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning

arXiv 2025

2025

Learning from Failures in Multi-Attempt Reinforcement Learning

arXiv 2025

2025

Generating Symbolic World Models via Test-time Scaling of Large Language Models

arXiv 2025

2025

ChatMusician: Understanding and Generating Music Intrinsically with LLM

arXiv 2024

2024

VCR: A Task for Pixel-Level Complex Reasoning in Vision Language Models via Restoring Occluded Text

arXiv 2024

2024

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

arXiv 2024

2024

Kun: Answer Polishment for Chinese Self-Alignment with Instruction Back-Translation

arXiv 2024

2024

Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction

arXiv 2024

2024

ComposerX: Multi-Agent Symbolic Music Composition with LLMs

arXiv 2024

2024

MIO: A Foundation Model on Multimodal Tokens

arXiv 2024

2024

Layerwise Recurrent Router for Mixture-of-Experts

arXiv 2024

2024

Unlocking Continual Learning Abilities in Language Models

arXiv 2024

2024

LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

arXiv 2024

2024

SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval

arXiv 2024

2024

Pixel Sentence Representation Learning

arXiv 2024

2024

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

arXiv 2024

2024

LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing

arXiv 2024

2024

GW-MoE: Resolving Uncertainty in MoE Router with Global Workspace Theory

arXiv 2024

2024

MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation

arXiv 2024

2024

m2mKD: Module-to-Module Knowledge Distillation for Modular Transformers

arXiv 2024

2024

ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate

arXiv 2023

2023

Huatuo-26M, a Large-scale Chinese Medical QA Dataset

arXiv 2023

2023

A Survey of Reasoning with Foundation Models

arXiv 2023

2023

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs

arXiv 2023

2023

AutoAgents: A Framework for Automatic Agent Generation

arXiv 2023

2023

Chinese Open Instruction Generalist: A Preliminary Release

arXiv 2023

2023

LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT

arXiv 2023

2023

Align on the Fly: Adapting Chatbot Behavior to Established Norms

arXiv 2023

2023

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

arXiv 2023

2023

MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training

arXiv 2023

2023

TACO: Topics in Algorithmic COde generation dataset

arXiv 2023

2023

Unlocking Emergent Modularity in Large Language Models

arXiv 2023

2023

Massive Editing for Large Language Models via Meta Learning

arXiv 2023

2023

SynFundus-1M: A High-quality Million-scale Synthetic fundus images Dataset with Fifteen Types of Annotation

arXiv 2023

2023

Think Before You Act: Decision Transformers with Working Memory

arXiv 2023

2023

Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning

arXiv 2023

2023

Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with $1/n$ Parameters

arXiv 2021

2021

CoCon: A Self-Supervised Approach for Controlled Text Generation

ICLR 2021 1

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 40 papers