Zhoujun Li
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25InCoder-32B: Code Foundation Model for Industrial Scenarios
arXiv 2026
A Comprehensive Survey on Long Context Language Modeling
arXiv 2025
Redefining Machine Translation on Social Network Services with Large Language Models
arXiv 2025
P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark
arXiv 2025
A Survey on Latent Reasoning
arXiv 2025
Multilingual Multimodal Software Developer for Code Generation
arXiv 2025
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation
arXiv 2025
DependEval: Benchmarking LLMs for Repository Dependency Understanding
arXiv 2025
CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation
arXiv 2025
SVIPTR: Fast and Efficient Scene Text Recognition with Vision Permutable Extractor
arXiv 2024
FuzzCoder: Byte-level Fuzzing Test via Large Language Model
arXiv 2024
McEval: Massively Multilingual Code Evaluation
arXiv 2024
SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence
arXiv 2024
OWL: A Large Language Model for IT Operations
arXiv 2023
Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models
arXiv 2023
MT4CrossOIE: Multi-stage Tuning for Cross-lingual Open Information Extraction
arXiv 2023
Enhancing Large Language Model with Self-Controlled Memory Framework
arXiv 2023
CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation
arXiv 2022
HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation
arXiv 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
arXiv 2022
GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation
arXiv 2022
UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation
arXiv 2022
TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
arXiv 2021
DocBank: A Benchmark Dataset for Document Layout Analysis
COLING 2020 8
TableBank: A Benchmark Dataset for Table Detection and Recognition
LREC 2020 5
Affiliations
Frequent co-authors
10from 25 papers