The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their performance across carefully curated Portuguese language tasks.
☆72Jul 28, 2025Updated 9 months ago
Alternatives and similar repositories for napolab
Users that are interested in napolab are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Apr 6, 2026Updated last month
- ☆12Oct 12, 2023Updated 2 years ago
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.☆52Dec 6, 2024Updated last year
- Brazilian Tertiary Care Dataset☆17Dec 14, 2022Updated 3 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Apr 29, 2022Updated 4 years ago
- Portuguese translation of the GLUE benchmark and Scitail dataset☆33Jun 27, 2022Updated 3 years ago
- List of resources and tools developed with focus on Portuguese.☆349Jun 26, 2025Updated 10 months ago
- ☆20Dec 22, 2023Updated 2 years ago
- Self-contained, comprehensive overview of PT-BR-LLMs advancements, architectures, and resources.☆31Dec 31, 2025Updated 4 months ago
- ☆27Jan 23, 2024Updated 2 years ago
- ☆12Nov 10, 2024Updated last year
- Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".☆11Dec 8, 2022Updated 3 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77Jan 8, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17May 27, 2020Updated 5 years ago
- ☆47Feb 7, 2024Updated 2 years ago
- Evaluation and baseline scripts for the ASSIN shared task.☆11Oct 12, 2019Updated 6 years ago
- A multilingual version of MS MARCO passage ranking dataset☆147Oct 19, 2023Updated 2 years ago
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago
- Portuguese Named Entity Recognition☆61Sep 27, 2023Updated 2 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- Portuguese voice2json profile based on Pocketsphinx☆11Jul 15, 2020Updated 5 years ago
- Code for training and evaluating T5 on Portuguese data.☆91Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆29Feb 2, 2024Updated 2 years ago
- Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.☆21Mar 14, 2024Updated 2 years ago
- HashtagMaster: Segmentation tool for hashtags☆12Oct 27, 2020Updated 5 years ago
- Using questions to summarize large amounts of textual data.☆25Sep 23, 2020Updated 5 years ago
- NoHarm Discharge Summary - Improving Care Transition with LLM☆25Feb 23, 2026Updated 2 months ago
- ☆13Jan 8, 2024Updated 2 years ago
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆188Jun 12, 2023Updated 2 years ago
- ☆40May 13, 2023Updated 2 years ago
- ☆22Jul 22, 2025Updated 9 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Finetuning InstructLLaMA with portuguese data☆558Jun 6, 2023Updated 2 years ago
- The accompanying code for "Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understandin…☆21Sep 4, 2019Updated 6 years ago
- Instruct-tuning LLaMA on consumer hardware with machine-translated data☆19Apr 17, 2023Updated 3 years ago
- Extrator de entidades mencionadas em notícias da mídia☆15May 25, 2021Updated 4 years ago
- Simple and scalable tools for data-driven pretraining data selection.☆29Jun 9, 2025Updated 10 months ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Portuguese pre-trained BERT models☆873Jun 17, 2024Updated last year