Fei Wu
- Papers
- 30
Cite
Notes
Only stored in your browser.
Authored papers
30Fast-Slow Thinking for Large Vision-Language Model Reasoning
arXiv 2025
InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners
arXiv 2025
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
arXiv 2025
Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion
arXiv 2025
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection
arXiv 2025
Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction
arXiv 2025
OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use
arXiv 2025
InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models
arXiv 2025
InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
arXiv 2025
ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation
arXiv 2025
Fine-tuning Large Language Models for Improving Factuality in Legal Question Answering
arXiv 2025
AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios
arXiv 2025
InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning
arXiv 2025
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
arXiv 2024
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
arXiv 2024
Reinforcement Learning Enhanced LLMs: A Survey
arXiv 2024
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models
arXiv 2024
Causal Agent based on Large Language Model
arXiv 2024
Training-free LLM-generated Text Detection by Mining Token Probability Sequences
arXiv 2024
A Comprehensive Survey of Direct Preference Optimization: Datasets, Theories, Variants, and Applications
arXiv 2024
Mitigating the Backdoor Effect for Multi-Task Model Merging via Safety-Aware Subspace
arXiv 2024
MetaCoCo: A New Few-Shot Classification Benchmark with Spurious Correlation
arXiv 2024
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
hap-structure-aware-masked-image-modeling-for
Goal-Oriented Prompt Attack and Safety Evaluation for LLMs
arXiv 2023
Instruction Tuning for Large Language Models: A Survey
arXiv 2023
OakInk: A Large-scale Knowledge Repository for Understanding Hand-Object Interaction
CVPR 2022 1
ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information
ACL 2021 5
FcaNet: Frequency Channel Attention Networks
ICCV 2021 10
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
CVPR 2022 1
Dice Loss for Data-imbalanced NLP Tasks
dice-loss-for-data-imbalanced-nlp-tasks-1
Affiliations
Frequent co-authors
10from 30 papers