22-hours / cabrita
Finetuning InstructLLaMA with portuguese data
☆554Updated last year
Related projects: ⓘ
- List of resources and tools developed with focus on Portuguese.☆226Updated 2 months ago
- Finetuning Stanford Alpaca (LLaMA) with Brazilian Portuguese data☆39Updated last year
- Code and documentation for the MariTalk API☆245Updated this week
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.☆63Updated 3 weeks ago
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆163Updated last year
- ☆15Updated 8 months ago
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.☆34Updated 9 months ago
- NLPortuguês - Aprenda PLN em português! Esse repositório contem os materiais e exercícios do curso NLPortuguês, hospedado tambem no cours…☆89Updated 6 months ago
- LeIA (Léxico para Inferência Adaptada) é um fork do léxico e ferramenta para análise de sentimentos VADER (Valence Aware Dictionary and s…☆119Updated last year
- pre-trained Language Models☆280Updated 2 weeks ago
- Portuguese Named Entity Recognition☆59Updated 11 months ago
- Portuguese pre-trained BERT models☆793Updated 3 months ago
- ☆44Updated last year
- Fine-tuning OpenLlama-Instruct with portuguese data, for commercial use.☆18Updated last year
- LLM that combines the principles of wizardLM and vicunaLM☆712Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆295Updated last year
- A flexible normalizer for user-generated content☆57Updated 2 weeks ago
- review dataset☆9Updated 3 years ago
- Code for training and evaluating T5 on Portuguese data.☆84Updated last year
- Alpaca dataset from Stanford, cleaned and curated☆1,494Updated last year
- Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks☆236Updated last year
- We introduce the Fake.Br Corpus, which is composed of aligned true and fake news written in Brazilian Portuguese.☆165Updated 3 years ago
- Conjunto de POS-taggers treinados para classificação gramatical de sentenças em português.☆57Updated 5 years ago
- NoHarm Discharge Summary - Improving Care Transition with LLM☆16Updated this week
- C++ implementation for BLOOM☆813Updated last year
- An open-source implementation of Google's PaLM models☆804Updated 3 months ago
- Dataset para análise de sentimentos na língua portuguesa com dados coletados do Twitter.☆67Updated 6 years ago
- Basaran is an open-source alternative to the OpenAI text completion API. It provides a compatible streaming API for your Hugging Face Tra…☆1,290Updated 7 months ago
- Contains decisions from Supremo Tribunal Federal☆17Updated 3 years ago
- Inference code and configs for the ReplitLM model family☆925Updated 11 months ago