Emotion estimation in general is a field that has been studied for a long time, and several approaches exist using machine learning. in this paper, we present an LSTM model, that processes the blend-shapes produced by the library MediaPipe, for a face detected in a live stream of a camera, to estimate the main emotion from the facial expressions, this model is trained on the FER2013 dataset and delivers a result of 71% accuracy and 62% f1-score which meets the accuracy benchmark of the FER2013 dataset, with significantly reduced computation costs. https://github.com/Samir-atra/Emotion_estimation_from_video_footage_with_LSTM_ML_algorithm
Emotion estimation from video footage with LSTM
An LSTM model processes MediaPipe-generated blend-shapes from live video streams to estimate emotions, achieving 71% accuracy and 62% F1-score on the FER2013 dataset with reduced computational costs.
- Year
- 2025
- Venue
- arXiv 2025
- Authors
- 1
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2501.13432v3ARXIV-DEFAULT
- TL;DR
- Semantic Scholar