0

Zhoujun Li

Papers
25

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
25papers

Authored papers

25

InCoder-32B: Code Foundation Model for Industrial Scenarios

arXiv 2026

2026

A Comprehensive Survey on Long Context Language Modeling

arXiv 2025

2025

Redefining Machine Translation on Social Network Services with Large Language Models

arXiv 2025

2025

P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark

arXiv 2025

2025

A Survey on Latent Reasoning

arXiv 2025

2025

Multilingual Multimodal Software Developer for Code Generation

arXiv 2025

2025

KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

arXiv 2025

2025

DependEval: Benchmarking LLMs for Repository Dependency Understanding

arXiv 2025

2025

CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation

arXiv 2025

2025

SVIPTR: Fast and Efficient Scene Text Recognition with Vision Permutable Extractor

arXiv 2024

2024

FuzzCoder: Byte-level Fuzzing Test via Large Language Model

arXiv 2024

2024

McEval: Massively Multilingual Code Evaluation

arXiv 2024

2024

SEvenLLM: Benchmarking, Eliciting, and Enhancing Abilities of Large Language Models in Cyber Threat Intelligence

arXiv 2024

2024

OWL: A Large Language Model for IT Operations

arXiv 2023

2023

Character, Word, or Both? Revisiting the Segmentation Granularity for Chinese Pre-trained Language Models

arXiv 2023

2023

MT4CrossOIE: Multi-stage Tuning for Cross-lingual Open Information Extraction

arXiv 2023

2023

Enhancing Large Language Model with Self-Controlled Memory Framework

arXiv 2023

2023

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation

arXiv 2022

2022

HLT-MT: High-resource Language-specific Training for Multilingual Neural Machine Translation

arXiv 2022

2022

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator

arXiv 2022

2022

GTrans: Grouping and Fusing Transformer Layers for Neural Machine Translation

arXiv 2022

2022

UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation

arXiv 2022

2022

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models

arXiv 2021

2021

DocBank: A Benchmark Dataset for Document Layout Analysis

COLING 2020 8

2020

TableBank: A Benchmark Dataset for Table Detection and Recognition

LREC 2020 5

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 25 papers