Xiangliang Zhang
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15Visual Aesthetic Benchmark: Can Frontier Models Judge Beauty?
arXiv 2026
Emergent Social Intelligence Risks in Generative Multi-Agent Systems
arXiv 2026
AutoLLMResearch: Training Research Agents for Automating LLM Experiment Configuration -- Learning from Cheap, Optimizing Expensive
arXiv 2026
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data
arXiv 2025
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation
arXiv 2025
MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training
arXiv 2025
Preference Leakage: A Contamination Problem in LLM-as-a-judge
arXiv 2025
TrustLLM: Trustworthiness in Large Language Models
arXiv 2024
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
arXiv 2024
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
arXiv 2024
HonestLLM: Toward an Honest and Helpful Large Language Model
arXiv 2024
MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
arXiv 2024
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?
arXiv 2024
What can Large Language Models do in chemistry? A comprehensive benchmark on eight tasks
NeurIPS 2023 11
Affiliations
Frequent co-authors
10from 15 papers