Portuguese pre-trained BERT models
☆873Jun 17, 2024Updated last year
Alternatives and similar repositories for portuguese-bert
Users that are interested in portuguese-bert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Portuguese Named Entity Recognition☆61Sep 27, 2023Updated 2 years ago
- Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks☆252Oct 12, 2025Updated 6 months ago
- Code for training and evaluating T5 on Portuguese data.☆91Dec 8, 2022Updated 3 years ago
- A flexible normalizer for user-generated content☆64Feb 5, 2026Updated 3 months ago
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆188Jun 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆58Mar 25, 2023Updated 3 years ago
- This Universal Dependencies (UD) Portuguese treebank.☆53Nov 12, 2025Updated 5 months ago
- Evaluation and baseline scripts for the ASSIN shared task.☆11Oct 12, 2019Updated 6 years ago
- Biomedical and Clinical BERT for Portuguese Language☆67Dec 12, 2024Updated last year
- pt-BR Corpus with the Wikipedia dump☆27Apr 15, 2020Updated 6 years ago
- ☆64Apr 11, 2023Updated 3 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- pre-trained Language Models☆310May 13, 2025Updated 11 months ago
- Portuguese translation of the SQuAD dataset☆19Oct 22, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis☆50Feb 22, 2025Updated last year
- We introduce the Fake.Br Corpus, which is composed of aligned true and fake news written in Brazilian Portuguese.☆191Oct 30, 2020Updated 5 years ago
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 9 months ago
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated 3 weeks ago
- ☆22Jun 22, 2022Updated 3 years ago
- Brazilian Tertiary Care Dataset☆17Dec 14, 2022Updated 3 years ago
- Portuguese BERT and XLM-R models fine-tuned in semantic role labeling.☆26Feb 11, 2022Updated 4 years ago
- fklearn: Functional Machine Learning☆1,539Apr 27, 2026Updated last week
- Implementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br☆39Mar 23, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆344Oct 30, 2022Updated 3 years ago
- 🕵 Artificial Intelligence for social control of public administration | **This repository does not receive frequent updates. Check out t…☆4,591Jan 31, 2024Updated 2 years ago
- 🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e di…☆2,432Apr 5, 2024Updated 2 years ago
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection…☆47Jan 5, 2026Updated 4 months ago
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago
- Essay-BR: a corpus of essays for the Brazilian Portuguese language☆23Sep 5, 2022Updated 3 years ago
- ☆14Oct 23, 2025Updated 6 months ago
- Captura os dados de sócios das empresas brasileiras na Receita Federal e exporta para um formato legível por humanos☆607Feb 4, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tagger treinado para reconhecer palavras do Português☆11Aug 2, 2019Updated 6 years ago
- Classificador de poemas do Fernando Pessoa de acordo com os seus heterônimos☆39Dec 8, 2022Updated 3 years ago
- Code and documentation for the MariTalk API☆317Apr 20, 2026Updated 2 weeks ago
- Set of rules designed to improve the lemmatization process in Spacy for portuguese☆15Nov 17, 2021Updated 4 years ago
- 🧑⚖️ Em nome da LAI! Gerador de petições com base na LAI.☆14Apr 6, 2021Updated 5 years ago
- Python wrapper para o SEI! -Sistema Eletrônico de Informações☆18Mar 6, 2018Updated 8 years ago
- Resources for morphological analysis of Portuguese☆27Apr 19, 2026Updated 2 weeks ago