Qianqian Xie
- Papers
- 22
Cite
Notes
Only stored in your browser.
Authored papers
22DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation
arXiv 2026
FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information
arXiv 2025
MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment
arXiv 2025
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation
arXiv 2025
From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models
arXiv 2025
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance
arXiv 2025
Enhancing Financial Time-Series Forecasting with Retrieval-Augmented Large Language Models
arXiv 2025
MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs
arXiv 2025
IF-VidCap: Can Video Caption Models Follow Instructions?
arXiv 2025
MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues
arXiv 2025
MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation
arXiv 2025
EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis
arXiv 2024
No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks
arXiv 2024
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
arXiv 2024
Me LLaMA: Foundation Large Language Models for Medical Applications
arXiv 2024
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models
arXiv 2024
Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams
arXiv 2024
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance
arXiv 2023
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models
arXiv 2023
Back to the Future: Towards Explainable Temporal Reasoning with Large Language Models
arXiv 2023
Towards Interpretable Mental Health Analysis with Large Language Models
arXiv 2023
Word Grounded Graph Convolutional Network
arXiv 2023
Affiliations
Frequent co-authors
10from 22 papers