Guang Yang
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Multimodal OCR: Parse Anything from Documents
arXiv 2026
dots.ocr: Multilingual Document Layout Parsing in a Single Vision-Language Model
arXiv 2025
PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking
arXiv 2025
Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis
arXiv 2025
MMMG: a Comprehensive and Reliable Evaluation Suite for Multitask Multimodal Generation
arXiv 2025
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report Evaluation
arXiv 2025
The state-of-the-art in Cardiac MRI Reconstruction: Results of the CMRxRecon Challenge in MICCAI 2023
arXiv 2024
LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations
arXiv 2024
Distribution Backtracking Builds A Faster Convergence Trajectory for Diffusion Distillation
arXiv 2024
Crafting Customisable Characters with LLMs: Introducing SimsChat, a Persona-Driven Role-Playing Agent Framework
arXiv 2024
Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification
arXiv 2023
Multi-Modal Experience Inspired AI Creation
arXiv 2022
DRAG: Dynamic Region-Aware GCN for Privacy-Leaking Image Detection
arXiv 2022
ComFormer: Code Comment Generation via Transformer and Fusion Method-based Hybrid Code Representation
arXiv 2021
Affiliations
Frequent co-authors
10from 14 papers