pre-trained Language Models
☆310May 13, 2025Updated 9 months ago
Alternatives and similar repositories for language-models
Users that are interested in language-models are comparing it to the libraries listed below
Sorting:
- ☆21Sep 26, 2018Updated 7 years ago
- Code for training and evaluating T5 on Portuguese data.☆90Dec 8, 2022Updated 3 years ago
- ☆16May 27, 2020Updated 5 years ago
- Resources for morphological analysis of Portuguese☆26Apr 1, 2025Updated 11 months ago
- ☆56Mar 25, 2023Updated 2 years ago
- Portuguese pre-trained BERT models☆861Jun 17, 2024Updated last year
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆31Mar 12, 2024Updated last year
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago
- A library for Time-Series exploration, analysis & modelling.☆17Dec 10, 2020Updated 5 years ago
- LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language☆179Jun 12, 2023Updated 2 years ago
- NLP French language model implementing ULMFiT☆87Mar 18, 2019Updated 6 years ago
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 7 months ago
- DocILE: Document Information Localization and Extraction Benchmark☆142May 15, 2024Updated last year
- Related resources to the paper RoBERTaLexPT: A Legal RoBERTa Model pretrained with deduplication for Portuguese.☆20Mar 14, 2024Updated last year
- ☆23Feb 11, 2026Updated 3 weeks ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated last month
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- A list of libraries and NLP projects for Portuguese☆19May 22, 2017Updated 8 years ago
- Code used for the VICTOR dataset paper☆20Jun 17, 2020Updated 5 years ago
- A pair of tiny foundational models trained in Brazilian Portuguese.🦙🦙☆44Feb 24, 2026Updated last week
- Educational tools for AI Ethics and Safety research 🛠️🔬☆25Jan 7, 2025Updated last year
- Use fastai-v2 with HuggingFace's pretrained transformers☆110Sep 25, 2020Updated 5 years ago
- Transformers for Cost-Sensitive BERT for Generalisable Sentence Classification on Imbalanced Data☆18May 28, 2020Updated 5 years ago
- Open source software for machine learning production monitoring : maintain control over production models, detect bias, explain your resu…☆21Mar 3, 2023Updated 3 years ago
- French Jurisprudences at your fingertips @ every 72h☆15Nov 18, 2025Updated 3 months ago
- A library that integrates huggingface transformers with the world of fastai, giving fastai devs everything they need to train, evaluate, …☆297Feb 15, 2026Updated 2 weeks ago
- ☆12Nov 10, 2024Updated last year
- Repositório para recursos☆12Jun 5, 2023Updated 2 years ago
- Self-contained, comprehensive overview of PT-BR-LLMs advancements, architectures, and resources.☆28Dec 31, 2025Updated 2 months ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- Pretrained segmenter models for Portuguese legislative text.☆13Oct 13, 2024Updated last year
- Python client library for the ClamAV antivirus.☆12May 15, 2025Updated 9 months ago
- Mining Legal Arguments in Court Decisions - Data and software☆74May 15, 2023Updated 2 years ago
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761☆284Jan 22, 2026Updated last month
- The evalution suite for the 🚀 Open Portuguese LLM Leaderboard☆26Aug 31, 2025Updated 6 months ago
- Software that makes labeling PDFs easy.☆427May 13, 2024Updated last year
- A curated list of resources for Document Understanding (DU) topic☆1,503Jun 2, 2023Updated 2 years ago
- Extrator de entidades mencionadas em notícias da mídia☆15May 25, 2021Updated 4 years ago
- A tool for correcting misspellings in textual input using the Noisy Channel Model.☆11Sep 26, 2020Updated 5 years ago