gustrd / cabra
Fine-tuning OpenLlama-Instruct with portuguese data, for commercial use.
☆19Updated last year
Alternatives and similar repositories for cabra:
Users that are interested in cabra are comparing it to the libraries listed below
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.☆46Updated 4 months ago
- Natively pre-trained open-source Portuguese language models.☆60Updated last week
- The evalution suite for the 🚀 Open Portuguese LLM Leaderboard☆20Updated 2 weeks ago
- Extrator de entidades mencionadas em notícias da mídia☆15Updated 3 years ago
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.☆67Updated last month
- Automated Deep Research with LLMs, web search, paper parsing, and didactic summarization.☆47Updated 2 weeks ago
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection…☆33Updated 2 months ago
- Text processing repository to free brazilian municipal gazettes from closed file formats for the Querido Diário project.☆23Updated 2 weeks ago
- Este repositório não está recebendo atualizações | A platform for profiling public figures in Brazilian politics☆163Updated 2 years ago
- Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.☆20Updated last year
- Curadoria dos melhores links compartilhados no grupo https://t.me/nlpbr no Telegram.☆12Updated last year
- Dataset para análise de sentimentos na língua portuguesa com dados coletados do Twitter.☆66Updated 7 years ago
- data about OAB Exams☆12Updated 6 years ago
- ☆91Updated 2 years ago
- ☆49Updated 2 years ago
- Espaço para compartilhamento de empresas de tecnologia na cidade de Goiânia e região e suas fontes de vagas.☆31Updated 3 years ago
- Scripts para capturar dados do Repositório de Dados Eleitorais do TSE, limpá-los, normalizá-los e agrupá-los☆153Updated 2 weeks ago
- Wrapper para API de consulta do acervo do LexML☆45Updated 2 years ago
- O VEÍCULO COLABORATIVO SOBRE TRANSPARÊNCIA E OPEN DATA NO BRASIL.☆4Updated last year
- ☆137Updated last year
- A flexible normalizer for user-generated content☆62Updated 3 weeks ago
- O site de jobs Python☆119Updated 2 years ago
- A pair of tiny foundational models trained in Brazilian Portuguese.🦙🦙☆34Updated 3 months ago
- A list of libraries and NLP projects for Portuguese☆19Updated 7 years ago
- Brazilian city names and official codes, IBGE, LexML and others☆53Updated 4 years ago
- ☆115Updated 5 years ago
- Simplify your video editing workflow with Python 📹☆123Updated 2 months ago
- Code and documentation for the MariTalk API☆278Updated this week
- Linguistic Datasets for Portuguese: Lista de conjuntos de dados linguísticos para língua portuguesa com licença flexíveis: banco de dados…☆72Updated 4 years ago
- Notebooks from Operação Serenata de Amor | ** Este repositório não recebe atualizações frequentes **☆52Updated 4 years ago