Linguistic Datasets for Portuguese: Lista de conjuntos de dados linguísticos para língua portuguesa com licença flexíveis: banco de dados, lista de palavras, sinônimos, antônimos, dicionário temático, tesauro, linked data, semântica, ontologia e representação de conhecimento
☆82Nov 21, 2020Updated 5 years ago
Alternatives and similar repositories for linguistic-datasets-portuguese
Users that are interested in linguistic-datasets-portuguese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DicSin - Dicionário de Sinônimos Português Brasil☆23May 21, 2018Updated 8 years ago
- The Brazilian Portuguese language, Unitex primary sources for the vocabulary and dictionary definitions☆25Jan 14, 2018Updated 8 years ago
- Dicionário Histórico Biográfico Brasileiro☆13Apr 1, 2025Updated last year
- Named entity extraction from Portuguese web text☆71Aug 16, 2017Updated 8 years ago
- Meetups realizados periodicamente pela comunidade do DevFSA☆15Dec 4, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Deep troll uses a deep learning model that identifies whether an audio contains the Gemidao troll (AAAWN OOOWN NHAAA AWWWWN AAAAAH).☆19Dec 8, 2022Updated 3 years ago
- OpenWordnet-PT: an open access wordnet for Portuguese☆160Apr 19, 2026Updated last month
- Resources for morphological analysis of Portuguese☆28Apr 19, 2026Updated last month
- Generates the most important key-phrase/key-words from a document based on a corpus☆10Jun 17, 2024Updated last year
- R package: Lexicons for Portuguese Text Analysis☆59Jan 9, 2018Updated 8 years ago
- Slides and scripts from Data in Bahia meetups☆13Dec 8, 2022Updated 3 years ago
- List of resources and tools developed with focus on Portuguese.☆357Jun 26, 2025Updated 10 months ago
- This Universal Dependencies (UD) Portuguese treebank.☆53May 6, 2026Updated 2 weeks ago
- Usando Python + MongoDB para baixar projetos de leis, escanear os PDFs e ler os textos☆22Mar 5, 2018Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🐝 Tiny CLI to post simultaneously to Mastodon and Bluesky☆17Apr 3, 2026Updated last month
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆34Mar 12, 2024Updated 2 years ago
- Licitações de Feira de Santana de fácil acesso aos cidadãos 🏦☆18Mar 2, 2020Updated 6 years ago
- Modelo de Trabalho de Conclusão do Curso de Engenharia de Computação da Universidade Estadual de Feira de Santana, baseado nas normas NBR…☆16Dec 12, 2013Updated 12 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- Source code for ACL-IJCNLP 2021 findings paper: SIRE: Separate Intra- and Inter-sentential Reasoning for Document-level Relation Extracti…☆21Aug 4, 2022Updated 3 years ago
- ☆12Apr 29, 2022Updated 4 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- O banco de redações da UOL (http://educacao.uol.com.br/bancoderedacoes/) em XML como modelo de testes e validação de técnicas de PLN (Pro…