22-hours/cabrita

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/22-hours/cabrita)

22-hours / cabrita

Finetuning InstructLLaMA with portuguese data

☆559

Alternatives and similar repositories for cabrita

Users that are interested in cabrita are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DominguesM / alpaca-lora-ptbr-7b
View on GitHub
Finetuning Stanford Alpaca (LLaMA) with Brazilian Portuguese data
☆39Apr 10, 2023Updated 3 years ago
ajdavidl / Portuguese-NLP
View on GitHub
List of resources and tools developed with focus on Portuguese.
☆364Jun 25, 2026Updated last month
unicamp-dl / quati
View on GitHub
☆13Nov 10, 2024Updated last year
SecexSaudeTCU / noticias_ner
View on GitHub
Extrator de entidades mencionadas em notícias da mídia
☆15May 25, 2021Updated 5 years ago
jneto04 / ner-pt
View on GitHub
Portuguese Named Entity Recognition
☆61Sep 27, 2023Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
neuralmind-ai / portuguese-bert
View on GitHub
Portuguese pre-trained BERT models
☆880Jun 17, 2024Updated 2 years ago
maritaca-ai / maritalk-api
View on GitHub
Code and documentation for the MariTalk API
☆328Jul 17, 2026Updated last week
ruanchaves / napolab
View on GitHub
The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…
☆72Jul 28, 2025Updated last year
Nkluge-correa / Tucano
View on GitHub
Natively pre-trained open-source Portuguese language models.
☆86Feb 24, 2026Updated 5 months ago
noharm-ai / brateca
View on GitHub
Brazilian Tertiary Care Dataset
☆19Dec 14, 2022Updated 3 years ago
Portuguese-Benchmark-Datasets / BLUEX
View on GitHub
☆20Dec 22, 2023Updated 2 years ago
gustrd / cabra
View on GitHub
Fine-tuning OpenLlama-Instruct with portuguese data, for commercial use.
☆20Aug 8, 2023Updated 2 years ago
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,913Jul 29, 2024Updated 2 years ago
felipemaiapolo / legalnlp
View on GitHub
LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language
☆191Jun 12, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fabioacl / PortugueseClinicalNER
View on GitHub
☆17May 27, 2020Updated 6 years ago
nathanshartmann / portuguese_word_embeddings
View on GitHub
Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks
☆252Oct 12, 2025Updated 9 months ago
HAILab-PUCPR / SemClinBr
View on GitHub
SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks
☆37Mar 12, 2024Updated 2 years ago
eduagarcia / lm-evaluation-harness-pt
View on GitHub
The evalution suite for the 🚀 Open Portuguese LLM Leaderboard
☆25Aug 31, 2025Updated 10 months ago
gururise / AlpacaDataCleaned
View on GitHub
Alpaca dataset from Stanford, cleaned and curated
☆1,602Mar 7, 2026Updated 4 months ago
gabinete-compartilhado-acredito / DOUTOR
View on GitHub
DOU Tracker, Obtainer & Reporter
☆26May 22, 2023Updated 3 years ago
tatsu-lab / stanford_alpaca
View on GitHub
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,246Jul 17, 2024Updated 2 years ago
cwhy / rwkv-decon
View on GitHub
Trying to deconstruct RWKV in understandable terms
☆14May 6, 2023Updated 3 years ago
lachlansneff / sparsellama
View on GitHub
☆40Mar 25, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sagui-nlp / DeBERTinha
View on GitHub
☆12Oct 12, 2023Updated 2 years ago
synesthesiam / pt-br_pocketsphinx-cmu
View on GitHub
Portuguese voice2json profile based on Pocketsphinx
☆11Jul 15, 2020Updated 6 years ago
maritaca-ai / oab-bench
View on GitHub
☆28Apr 6, 2026Updated 3 months ago
pedropaiola / ptt5-summ
View on GitHub
☆10Nov 30, 2022Updated 3 years ago
nilc-nlp / DNLT-BP
View on GitHub
Datasets of Neuropsychological Language Tests in Brazilian Portuguese
☆14Oct 14, 2025Updated 9 months ago
LibreOffice / OmegaT
View on GitHub
repository to manage document-based translation with OmegaT
☆18Nov 1, 2024Updated last year
noharm-ai / summary
View on GitHub
NoHarm Discharge Summary - Improving Care Transition with LLM
☆25Feb 23, 2026Updated 5 months ago
piegu / language-models
View on GitHub
pre-trained Language Models
☆310May 13, 2025Updated last year
unicamp-dl / PTT5
View on GitHub
Code for training and evaluating T5 on Portuguese data.
☆91Dec 8, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
PicoCreator / RWKV-LM-LoRA
View on GitHub
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …
☆10Nov 3, 2023Updated 2 years ago
lucianosb / awesome-nlpbr
View on GitHub
Curadoria dos melhores links compartilhados no grupo https://t.me/nlpbr no Telegram.
☆12Apr 10, 2024Updated 2 years ago
thalesbertaglia / enelvo
View on GitHub
A flexible normalizer for user-generated content
☆64Jul 22, 2026Updated last week
ruanchaves / elmo
View on GitHub
Supporting code for the paper "Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks".
☆11Dec 8, 2022Updated 3 years ago
masa3141 / japanese-alpaca-lora
View on GitHub
A japanese finetuned instruction LLaMA
☆128Mar 20, 2023Updated 3 years ago
rodrigokrosa / tacotron2-GL-brazillian-portuguese
View on GitHub
Repository to document results of an Tacotron 2 adaptation for brazilian portuguese.
☆17Sep 8, 2022Updated 3 years ago
eduagarcia / roberta-legal-portuguese
View on GitHub
Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.
☆22Mar 14, 2024Updated 2 years ago