Qianqian Xie

MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment

arXiv 2025

IF-VidCap: Can Video Caption Models Follow Instructions?

arXiv 2025

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

arXiv 2025

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

arXiv 2025

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

arXiv 2025

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

arXiv 2025

MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs

arXiv 2025

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

arXiv 2025

Enhancing Financial Time-Series Forecasting with Retrieval-Augmented Large Language Models

arXiv 2025

EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis

arXiv 2024

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

arXiv 2024

Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams

arXiv 2024

No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks

arXiv 2024

Me LLaMA: Foundation Large Language Models for Medical Applications

arXiv 2024

FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making

arXiv 2024

PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance

arXiv 2023

Word Grounded Graph Convolutional Network

arXiv 2023

Towards Interpretable Mental Health Analysis with Large Language Models

arXiv 2023

Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models

arXiv 2023

Back to the Future: Towards Explainable Temporal Reasoning with Large Language Models

arXiv 2023