0

LVFace: Large Vision model for Face Recogniton

LVFace, a Vision Transformer-based face recognition model with Progressive Cluster Optimization, achieves state-of-the-art performance across benchmarks and real-world scenarios.

Year
2025
Venue
ICCV 2025
Authors
5
Hosting
Abstract onlyARXIV-DEFAULT

Cite

Notes

Only stored in your browser.

Attribution

Abstract & full text
arxiv.org/abs/2501.13420ARXIV-DEFAULT
TL;DR
Semantic Scholar
Attribution policy →

Abstract

Recently, large vision models have demonstrated powerful representation capabilities in the field of computer vision. However, we unexpectedly found that face recognition research is still mainly focused on CNN-based model architectures, which may lead to suboptimal state-of-the-art (SOTA) performance in face recognition. Therefore, we study how to use various loss functions from historical research orthogonally to train a new state-of-the-art face recognition model based on large vision models, called LVFace. On the largest public face database, WebFace42M, we demonstrated the superiority of LVFace over other advanced face recognition methods and achieved first place in the ICCV21 MFR-Ongoing challenge, until the submission of this work (December 30, 2024, academic track).

Authors

5