pre-trained Language Models
☆310May 13, 2025Updated 11 months ago
Alternatives and similar repositories for language-models
Users that are interested in language-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Learning experiments☆19Nov 18, 2019Updated 6 years ago
- ☆21Sep 26, 2018Updated 7 years ago
- Portuguese Named Entity Recognition☆61Sep 27, 2023Updated 2 years ago
- Code for training and evaluating T5 on Portuguese data.☆91Dec 8, 2022Updated 3 years ago
- Portuguese pre-trained BERT models☆873Jun 17, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆58Mar 25, 2023Updated 3 years ago
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated 3 weeks ago
- ☆17May 27, 2020Updated 5 years ago
- OpenWordnet-PT: an open access wordnet for Portuguese☆160Apr 19, 2026Updated 2 weeks ago
- List of resources and tools developed with focus on Portuguese.☆349Jun 26, 2025Updated 10 months ago
- Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks☆252Oct 12, 2025Updated 6 months ago
- Transformers for Cost-Sensitive BERT for Generalisable Sentence Classification on Imbalanced Data☆18May 28, 2020Updated 5 years ago
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆188Jun 12, 2023Updated 2 years ago
- Data and Baselines for AStitchInLanguageModels dataset☆12Oct 31, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- DocILE: Document Information Localization and Extraction Benchmark☆146May 15, 2024Updated last year
- ☆16Nov 4, 2019Updated 6 years ago
- A pair of tiny foundational models trained in Brazilian Portuguese.🦙🦙☆45Feb 24, 2026Updated 2 months ago
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago
- This Universal Dependencies (UD) Portuguese treebank.☆53Nov 12, 2025Updated 5 months ago
- Normalizer tool for user-generated content (Brazilian Portuguese)☆14May 13, 2022Updated 3 years ago
- Simulação do COVID-19 nos municípios brasileiros | Brazilian municipalities COVID-19 simuation☆12Mar 30, 2020Updated 6 years ago
- German Dataset for Legal Information Retrieval☆25Feb 26, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Finetuning InstructLLaMA with portuguese data☆558Jun 6, 2023Updated 2 years ago
- Active Learning for Text Classification in Python☆640Apr 17, 2026Updated 2 weeks ago
- BankSim is a banking agent-based simulation framework developed in Python☆17Feb 21, 2018Updated 8 years ago
- Alexa Video Streaming Skill Template☆18Aug 2, 2021Updated 4 years ago
- Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.☆21Mar 14, 2024Updated 2 years ago
- Base de acórdãos do Tribunal de Contas da União☆28Dec 8, 2022Updated 3 years ago
- Natively pre-trained open-source Portuguese language models.☆86Feb 24, 2026Updated 2 months ago
- A flexible normalizer for user-generated content☆64Feb 5, 2026Updated 3 months ago
- ☆396Jan 7, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Brazilian Legal Text Dataset for pre-trainning transformer based models☆17Jun 30, 2023Updated 2 years ago
- pt-BR Corpus with the Wikipedia dump☆27Apr 15, 2020Updated 6 years ago
- Explorador da Constituição: a Constituição Federal e suas Emendas acessíveis para o mundo da Ciência de Dados☆79Nov 15, 2020Updated 5 years ago
- A Natural Language Processing’s roadmap for begginers☆48Oct 12, 2022Updated 3 years ago
- Software that makes labeling PDFs easy.☆430May 13, 2024Updated last year
- A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, …☆298Feb 15, 2026Updated 2 months ago
- Use fastai-v2 with HuggingFace's pretrained transformers☆110Sep 25, 2020Updated 5 years ago