Portuguese translation of the GLUE benchmark and Scitail dataset
☆33Jun 27, 2022Updated 3 years ago
Alternatives and similar repositories for PLUE
Users that are interested in PLUE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PorSimplesSent - A Portuguese corpus of aligned sentences pairs to investigate sentence readability assessment☆14Jan 15, 2020Updated 6 years ago
- PyTorch code for NAACL 2022 paper: DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation (https://aclanthology.org/2022.fi…☆16Apr 21, 2026Updated 2 weeks ago
- Stanford Question Answering Dataset (SQuAD) 2.0 translated to Brazilian Portuguese (PT-BR) language.☆12Nov 14, 2020Updated 5 years ago
- Python Library for Natural Language Processing for Portuguese Language☆17Mar 2, 2016Updated 10 years ago
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- List of resources and tools developed with focus on Portuguese.☆349Jun 26, 2025Updated 10 months ago
- ☆12Apr 29, 2022Updated 4 years ago
- ☆27Jan 23, 2024Updated 2 years ago
- Pretrained segmenter models for Portuguese legislative text.☆14Oct 13, 2024Updated last year
- HashtagMaster: Segmentation tool for hashtags☆12Oct 27, 2020Updated 5 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.☆52Dec 6, 2024Updated last year
- FactNews is the first dataset to predict sentence-level factuality of news reporting. Furthemore, we provide baseline results for sentenc…☆11Jun 12, 2025Updated 10 months ago
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatically retry Claude Code sessions when hitting Anthropic subscription rate limits☆47Mar 31, 2026Updated last month
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆22Feb 14, 2024Updated 2 years ago
- ☆10Jan 31, 2022Updated 4 years ago
- Repository for the paper: "Using deep learning to predict outcomes of legal appeals better than human experts"☆10Aug 1, 2022Updated 3 years ago
- ☆14Feb 29, 2024Updated 2 years ago
- Fine tuning of the Retrieval-Augmented Generation (RAG) with a custom knowledge source.☆13Feb 10, 2021Updated 5 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- COVID-19 Related NLP Papers☆30Jan 20, 2022Updated 4 years ago
- Re-implementation of Bi- (or, Dual-) encoder for Entity Linking. You can run experiments only in 3 seconds.☆11Jun 12, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Spotify and Last.fm tool that shows how internationally diverse your music taste is☆13Mar 14, 2024Updated 2 years ago
- ☆10Dec 14, 2020Updated 5 years ago
- Datasets of Neuropsychological Language Tests in Brazilian Portuguese☆13Oct 14, 2025Updated 6 months ago
- Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.☆21Mar 14, 2024Updated 2 years ago
- Precedents from Brazilian High Courts (STF and STJ)☆12Aug 23, 2018Updated 7 years ago
- Subset selection / data pruning for weak supervision☆16Jun 21, 2023Updated 2 years ago
- ☆13Apr 17, 2026Updated 2 weeks ago
- A Python Library for Biquality Learning☆16Mar 20, 2026Updated last month
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 11 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Using questions to summarize large amounts of textual data.☆25Sep 23, 2020Updated 5 years ago
- Lightweight implementations of generative label models for weakly supervised machine learning☆24Apr 4, 2026Updated last month
- Portuguese pre-trained BERT models☆873Jun 17, 2024Updated last year
- TransformerDB☆19Apr 22, 2021Updated 5 years ago
- Manifold-Mixup implementation for fastai V1☆19Oct 1, 2020Updated 5 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- A clone of indri-5.12 with minor customizations.☆25Sep 23, 2024Updated last year