Nkluge-correa / TeenyTinyLlama
A pair of tiny foundational models trained in Brazilian Portuguese.π¦π¦
β34Updated 3 months ago
Alternatives and similar repositories for TeenyTinyLlama:
Users that are interested in TeenyTinyLlama are comparing it to the libraries listed below
- Natively pre-trained open-source Portuguese language models.β60Updated last week
- Portuguese translation of the GLUE benchmark and Scitail datasetβ31Updated 2 years ago
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.β67Updated last month
- Fine-tuning OpenLlama-Instruct with portuguese data, for commercial use.β19Updated last year
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.β46Updated 4 months ago
- β47Updated last year
- The evalution suite for the π Open Portuguese LLM Leaderboardβ20Updated 2 weeks ago
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detectionβ¦β33Updated 2 months ago
- β29Updated last year
- π₯ Use Hugging Face text and token classification pipelines directly in spaCyβ63Updated last year
- FaQuAD reading comprehension dataset and related code to reproduce experiments from Sayama et al. (BRACIS 2019).β8Updated 2 years ago
- Finetuning Stanford Alpaca (LLaMA) with Brazilian Portuguese dataβ39Updated 2 years ago
- Pre-train Static Word Embeddingsβ56Updated 2 weeks ago
- Code for training and evaluating T5 on Portuguese data.β86Updated 2 years ago
- Transformer model for Portuguese language (Brazil pt_BR)β16Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.β79Updated last year
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β75Updated last year
- β49Updated 2 years ago
- Repository for deepdoctection tutorial notebooksβ44Updated 5 months ago
- β16Updated last year
- Contains decisions from Supremo Tribunal Federalβ21Updated 3 years ago
- Efficient few-shot learning with cross-encoders.β51Updated last year
- β20Updated last year
- Curadoria dos melhores links compartilhados no grupo https://t.me/nlpbr no Telegram.β12Updated last year
- Generalist and Lightweight Model for Text Classificationβ121Updated 2 weeks ago
- pre-trained Language Modelsβ302Updated 7 months ago
- pt-BR Corpus with the Wikipedia dumpβ26Updated 5 years ago
- OpenWordnet-PT: an open access wordnet for Portugueseβ156Updated last month
- Trully flash implementation of DeBERTa disentangled attention mechanism.β45Updated 2 weeks ago
- β11Updated 2 years ago