Nkluge-correa / TucanoLinks
Natively pre-trained open-source Portuguese language models.
☆66Updated last month
Alternatives and similar repositories for Tucano
Users that are interested in Tucano are comparing it to the libraries listed below
Sorting:
- List of resources and tools developed with focus on Portuguese.☆283Updated 3 weeks ago
- Code and documentation for the MariTalk API☆288Updated last week
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆172Updated 2 years ago
- ☆53Updated 2 years ago
- Baixa processos e decisões do Tribunal de Justiça de São Paulo☆93Updated 2 weeks ago
- ☆35Updated last week
- A Natural Portuguese Language Benchmark (Napolab) for the evaluation of language models.☆68Updated last week
- Code and data to evaluate LLMs on the ENEM, the main standardized Brazilian university admission exams.☆47Updated 7 months ago
- Finetuning InstructLLaMA with portuguese data☆563Updated 2 years ago
- NLPortuguês - Aprenda PLN em português! Esse repositório contem os materiais e exercícios do curso NLPortuguês, hospedado tambem no cours…☆106Updated last year
- The evalution suite for the 🚀 Open Portuguese LLM Leaderboard☆20Updated 3 months ago
- Repositório minimalista para criação de agentes de IA inteligentes e versáteis com protocolos A2A (Agent-to-Agent) e MCP (Model Context P…☆128Updated last week
- Portuguese Named Entity Recognition☆59Updated last year
- LeIA (Léxico para Inferência Adaptada) é um fork do léxico e ferramenta para análise de sentimentos VADER (Valence Aware Dictionary and s…☆127Updated 2 years ago
- We introduce the Fake.Br Corpus, which is composed of aligned true and fake news written in Brazilian Portuguese.☆179Updated 4 years ago
- HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection…☆33Updated 3 weeks ago
- Portuguese pre-trained BERT models☆839Updated last year
- A flexible normalizer for user-generated content☆63Updated 2 months ago
- Scripts para capturar dados do Repositório de Dados Eleitorais do TSE, limpá-los, normalizá-los e agrupá-los☆153Updated 3 months ago
- Explorador da Constituição: a Constituição Federal e suas Emendas acessíveis para o mundo da Ciência de Dados☆71Updated 4 years ago
- Linguistic Datasets for Portuguese: Lista de conjuntos de dados linguísticos para língua portuguesa com licença flexíveis: banco de dados…☆75Updated 4 years ago
- Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks☆249Updated last year
- Scraper do Portal da Transparência do Governo Federal, em Python 3☆55Updated 3 months ago
- ⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/sdk/☆403Updated last month
- Curso de data science - Diversidados☆22Updated last year
- ☆42Updated last week
- Automated Deep Research with LLMs, web search, paper parsing, and didactic summarization.☆54Updated 3 months ago
- Gerador de DAGs no Apache Airflow para fazer clipping do Diário Oficial da União.☆142Updated 2 weeks ago
- Brazilian city names and official codes, IBGE, LexML and others☆54Updated 4 years ago
- Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis☆43Updated 4 months ago