22-hours / cabrita
Finetuning InstructLLaMA with portuguese data
☆561Updated last year
Alternatives and similar repositories for cabrita:
Users that are interested in cabrita are comparing it to the libraries listed below
- Code and documentation for the MariTalk API☆275Updated last week
- Finetuning Stanford Alpaca (LLaMA) with Brazilian Portuguese data☆39Updated last year
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.☆45Updated 3 months ago
- List of resources and tools developed with focus on Portuguese.☆267Updated last month
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.☆67Updated 3 weeks ago
- ☆16Updated last year
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆167Updated last year
- Natively pre-trained open-source Portuguese language models.☆58Updated last month
- Portuguese pre-trained BERT models☆834Updated 9 months ago
- We introduce the Fake.Br Corpus, which is composed of aligned true and fake news written in Brazilian Portuguese.☆172Updated 4 years ago
- Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.☆239Updated 3 years ago
- ☆48Updated 2 years ago
- A flexible normalizer for user-generated content☆62Updated 2 weeks ago
- pre-trained Language Models☆301Updated 6 months ago
- Tune any FALCON in 4-bit☆466Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated last year
- LeIA (Léxico para Inferência Adaptada) é um fork do léxico e ferramenta para análise de sentimentos VADER (Valence Aware Dictionary and s…☆123Updated last year
- Gerador de DAGs no Apache Airflow para fazer clipping do Diário Oficial da União.☆105Updated 3 weeks ago
- Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.☆20Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆302Updated 5 months ago
- Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks☆246Updated last year
- C++ implementation for BLOOM☆809Updated last year
- Linguistic Datasets for Portuguese: Lista de conjuntos de dados linguísticos para língua portuguesa com licença flexíveis: banco de dados…☆70Updated 4 years ago
- LLM that combines the principles of wizardLM and vicunaLM☆715Updated last year
- Estudo e implementação dos principais algoritmos de Machine Learning em Jupyter Notebooks.☆221Updated 2 years ago
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection…☆32Updated 2 months ago
- fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backe…☆408Updated last year
- Contains decisions from Supremo Tribunal Federal☆20Updated 3 years ago
- ☆535Updated last year
- review dataset☆10Updated 3 years ago