We present and make available pre-trained language models (Phraser, Word2Vec, Doc2Vec, FastText, and BERT) for the Brazilian legal language, a Python package with functions to facilitate their use, and a set of demonstrations/tutorials containing some applications involving them. Given that our material is built upon legal texts coming from several Brazilian courts, this initiative is extremely helpful for the Brazilian legal field, which lacks other open and specific tools and language models. Our main objective is to catalyze the use of natural language processing tools for legal texts analysis by the Brazilian industry, government, and academia, providing the necessary tools and accessible material.
LegalNLP -- Natural Language Processing methods for the Brazilian Legal Language
Pre-trained language models for Brazilian legal language are provided along with tools and tutorials to facilitate their use in legal text analysis.
- Year
- 2021
- Venue
- arXiv 2021
- Authors
- 9
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2110.15709ARXIV-DEFAULT
- TL;DR
- Semantic Scholar