0

Qianqian Xie

Papers
22

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
22papers

Authored papers

22

DR^{3}-Eval: Towards Realistic and Reproducible Deep Research Evaluation

arXiv 2026

2026

FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information

arXiv 2025

2025

MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment

arXiv 2025

2025

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

arXiv 2025

2025

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

arXiv 2025

2025

Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance

arXiv 2025

2025

Enhancing Financial Time-Series Forecasting with Retrieval-Augmented Large Language Models

arXiv 2025

2025

MMAFFBen: A Multilingual and Multimodal Affective Analysis Benchmark for Evaluating LLMs and VLMs

arXiv 2025

2025

IF-VidCap: Can Video Caption Models Follow Instructions?

arXiv 2025

2025

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

arXiv 2025

2025

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

arXiv 2025

2025

EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis

arXiv 2024

2024

No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks

arXiv 2024

2024

FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making

arXiv 2024

2024

Me LLaMA: Foundation Large Language Models for Medical Applications

arXiv 2024

2024

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

arXiv 2024

2024

Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician Exams

arXiv 2024

2024

PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance

arXiv 2023

2023

Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models

arXiv 2023

2023

Back to the Future: Towards Explainable Temporal Reasoning with Large Language Models

arXiv 2023

2023

Towards Interpretable Mental Health Analysis with Large Language Models

arXiv 2023

2023

Word Grounded Graph Convolutional Network

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 22 papers