Portuguese pre-trained BERT models
☆878Jun 17, 2024Updated 2 years ago
Alternatives and similar repositories for portuguese-bert
Users that are interested in portuguese-bert are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Portuguese Named Entity Recognition☆61Sep 27, 2023Updated 2 years ago
- A flexible normalizer for user-generated content☆64Feb 5, 2026Updated 4 months ago
- List of resources and tools developed with focus on Portuguese.☆361Jun 26, 2025Updated 11 months ago
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆190Jun 12, 2023Updated 3 years ago
- ☆58Mar 25, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This Universal Dependencies (UD) Portuguese treebank.☆53May 6, 2026Updated last month
- Evaluation and baseline scripts for the ASSIN shared task.☆11Oct 12, 2019Updated 6 years ago
- Biomedical and Clinical BERT for Portuguese Language☆67Dec 12, 2024Updated last year
- pt-BR Corpus with the Wikipedia dump☆27Apr 15, 2020Updated 6 years ago
- Finetuning InstructLLaMA with portuguese data☆559Jun 6, 2023Updated 3 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆35Mar 12, 2024Updated 2 years ago
- pre-trained Language Models☆311May 13, 2025Updated last year
- Portuguese translation of the SQuAD dataset☆19Oct 22, 2020Updated 5 years ago
- Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis☆50Feb 22, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- We introduce the Fake.Br Corpus, which is composed of aligned true and fake news written in Brazilian Portuguese.☆196Oct 30, 2020Updated 5 years ago
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 10 months ago
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated 2 months ago
- ☆22Jun 22, 2022Updated 3 years ago
- Portuguese BERT and XLM-R models fine-tuned in semantic role labeling.☆26Feb 11, 2022Updated 4 years ago
- fklearn: Functional Machine Learning☆1,544Jun 10, 2026Updated last week
- Implementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br☆39Mar 23, 2017Updated 9 years ago
- Datasets of Neuropsychological Language Tests in Brazilian Portuguese☆14Oct 14, 2025Updated 8 months ago
- Datasets to train supervised classifiers for Named-Entity Recognition in different languages (Portuguese, German, Dutch, French, English)☆344Oct 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🕵 Artificial Intelligence for social control of public administration | **This repository does not receive frequent updates. Check out t…☆4,595Jan 31, 2024Updated 2 years ago
- 🍕 Repositório para juntar informações sobre materiais de estudo em análise de dados e áreas afins, empresas que trabalham com dados e di…☆2,435Apr 5, 2024Updated 2 years ago
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago
- Dados diários mais recentes do coronavírus por município brasileiro☆534Apr 1, 2022Updated 4 years ago
- Natively pre-trained open-source Portuguese language models.☆86Feb 24, 2026Updated 3 months ago
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection…☆49Jan 5, 2026Updated 5 months ago
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago
- Wrapper para API de consulta do acervo do LexML☆53Dec 8, 2022Updated 3 years ago
- Essay-BR: a corpus of essays for the Brazilian Portuguese language☆24Sep 5, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jun 10, 2022Updated 4 years ago
- Captura os dados de sócios das empresas brasileiras na Receita Federal e exporta para um formato legível por humanos☆607Feb 4, 2026Updated 4 months ago
- Tagger treinado para reconhecer palavras do Português☆11Aug 2, 2019Updated 6 years ago
- Classificador de poemas do Fernando Pessoa de acordo com os seus heterônimos☆39Dec 8, 2022Updated 3 years ago
- Base dos discursos dos deputados federais de 2003 a 2017☆13Feb 28, 2018Updated 8 years ago
- Code and documentation for the MariTalk API☆321Updated this week
- Set of rules designed to improve the lemmatization process in Spacy for portuguese☆15Nov 17, 2021Updated 4 years ago