pre-trained Language Models
☆310May 13, 2025Updated last year
Alternatives and similar repositories for language-models
Users that are interested in language-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago
- ☆21Sep 26, 2018Updated 7 years ago
- Portuguese Named Entity Recognition☆61Sep 27, 2023Updated 2 years ago
- Code for training and evaluating T5 on Portuguese data.☆91Dec 8, 2022Updated 3 years ago
- Portuguese pre-trained BERT models☆875Jun 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆58Mar 25, 2023Updated 3 years ago
- Transformer model for Portuguese language (Brazil pt_BR)☆16Apr 10, 2026Updated last month
- ☆17May 27, 2020Updated 6 years ago
- List of resources and tools developed with focus on Portuguese.☆357Jun 26, 2025Updated 11 months ago
- Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks☆252Oct 12, 2025Updated 7 months ago
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆188Jun 12, 2023Updated 2 years ago
- NLP French language model implementing ULMFiT☆87Mar 18, 2019Updated 7 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- A pair of tiny foundational models trained in Brazilian Portuguese.🦙🦙☆45Feb 24, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago
- This Universal Dependencies (UD) Portuguese treebank.☆53May 6, 2026Updated 3 weeks ago
- Finetuning InstructLLaMA with portuguese data☆557Jun 6, 2023Updated 2 years ago
- German Dataset for Legal Information Retrieval☆26Feb 26, 2024Updated 2 years ago
- Active Learning for Text Classification in Python☆642Updated this week
- A list of libraries and NLP projects for Portuguese☆19May 22, 2017Updated 9 years ago
- Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.☆22Mar 14, 2024Updated 2 years ago
- Natively pre-trained open-source Portuguese language models.☆86Feb 24, 2026Updated 3 months ago
- A flexible normalizer for user-generated content☆64Feb 5, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 10 months ago
- ☆396Jan 7, 2024Updated 2 years ago
- Brazilian Legal Text Dataset for pre-trainning transformer based models☆19Jun 30, 2023Updated 2 years ago
- pt-BR Corpus with the Wikipedia dump☆27Apr 15, 2020Updated 6 years ago
- Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understan…☆365Oct 31, 2022Updated 3 years ago
- Explorador da Constituição: a Constituição Federal e suas Emendas acessíveis para o mundo da Ciência de Dados☆79Nov 15, 2020Updated 5 years ago
- A Natural Language Processing’s roadmap for begginers☆48Oct 12, 2022Updated 3 years ago
- Software that makes labeling PDFs easy.☆430May 13, 2024Updated 2 years ago
- Use fastai-v2 with HuggingFace's pretrained transformers☆110Sep 25, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Mining Legal Arguments in Court Decisions - Data and software☆76May 15, 2023Updated 3 years ago
- Implementação e modelo gerado com o treinamento (trigram) da wikipedia em pt-br☆39Mar 23, 2017Updated 9 years ago
- The evalution suite for the 🚀 Open Portuguese LLM Leaderboard☆26Aug 31, 2025Updated 8 months ago
- Python scripts to read a Portuguese Wikipedia XML dump file, parse it and generate plain text files.☆14Mar 12, 2014Updated 12 years ago
- ☆13Nov 10, 2024Updated last year
- ☆15Feb 5, 2019Updated 7 years ago
- A curated list of resources for Document Understanding (DU) topic☆1,518Jun 2, 2023Updated 2 years ago