Linguistic Datasets for Portuguese: Lista de conjuntos de dados linguísticos para língua portuguesa com licença flexíveis: banco de dados, lista de palavras, sinônimos, antônimos, dicionário temático, tesauro, linked data, semântica, ontologia e representação de conhecimento
☆82Nov 21, 2020Updated 5 years ago
Alternatives and similar repositories for linguistic-datasets-portuguese
Users that are interested in linguistic-datasets-portuguese are comparing it to the libraries listed below
Sorting:
- DicSin - Dicionário de Sinônimos Português Brasil☆23May 21, 2018Updated 7 years ago
- To help search, filter, and download papers from 'acl anthology' (https://aclanthology.org/).☆18Sep 12, 2024Updated last year
- Named entity extraction from Portuguese web text☆71Aug 16, 2017Updated 8 years ago
- Deep troll uses a deep learning model that identifies whether an audio contains the Gemidao troll (AAAWN OOOWN NHAAA AWWWWN AAAAAH).☆19Dec 8, 2022Updated 3 years ago
- OpenWordnet-PT: an open access wordnet for Portuguese☆160Jul 24, 2025Updated 7 months ago
- An editor for EBNF grammars, used by Lark – parsing library for Python☆12Apr 25, 2019Updated 6 years ago
- A flexible normalizer for user-generated content☆64Feb 5, 2026Updated last month
- Resources for morphological analysis of Portuguese☆26Apr 1, 2025Updated 11 months ago
- Generates the most important key-phrase/key-words from a document based on a corpus☆10Jun 17, 2024Updated last year
- List of resources and tools developed with focus on Portuguese.☆310Jun 26, 2025Updated 8 months ago
- PHP low-level client for Vespa. https://vespa.ai/☆17Jan 22, 2026Updated last month
- This Universal Dependencies (UD) Portuguese treebank.☆53Nov 12, 2025Updated 4 months ago
- Usando Python + MongoDB para baixar projetos de leis, escanear os PDFs e ler os textos☆22Mar 5, 2018Updated 8 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆31Mar 12, 2024Updated 2 years ago
- Character Based Named Entity Recognition.☆40Apr 3, 2018Updated 7 years ago
- Licitações de Feira de Santana de fácil acesso aos cidadãos 🏦☆18Mar 2, 2020Updated 6 years ago
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- pyHDB - Ferramenta de auxílio metodológico para pesquisas na interface da Hemeroteca Digital Brasileira da Biblioteca Nacional. Desenvo…☆11Oct 3, 2025Updated 5 months ago
- Source code for ACL-IJCNLP 2021 findings paper: SIRE: Separate Intra- and Inter-sentential Reasoning for Document-level Relation Extracti…☆21Aug 4, 2022Updated 3 years ago
- Implementation of a dependency parser using neural networks☆11Mar 7, 2017Updated 9 years ago
- ☆12Feb 9, 2021Updated 5 years ago
- ☆12Apr 29, 2022Updated 3 years ago
- Trying to deconstruct RWKV in understandable terms☆14May 6, 2023Updated 2 years ago
- A Lemmatizer for Portuguese☆32Mar 6, 2019Updated 7 years ago
- Source code of ACL2022 "Headed-Span-Based Projective Dependency Parsing" and "Combining (second-order) graph-based and headed-span-based …☆16Jan 12, 2023Updated 3 years ago
- Entrega de dados dinâmicos para cecm.usp.br utilizando GitHub CDN Pages.☆12Dec 5, 2025Updated 3 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- ☆13Jun 3, 2023Updated 2 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- Portuguese voice2json profile based on Pocketsphinx☆11Jul 15, 2020Updated 5 years ago
- WordPress Plugin to include Material Design Icons☆10Aug 28, 2023Updated 2 years ago
- Tensorflow implementation of the Skipgram model with different scripts to train Portuguese word embeddings.☆18Aug 26, 2017Updated 8 years ago
- An R package for reading data in the DBC (compressed DBF) format used by DATASUS.☆83Nov 20, 2025Updated 4 months ago
- Portuguese pre-trained BERT models☆865Jun 17, 2024Updated last year
- Optimized Differentiable Neural Computer In Chainer☆23Jul 12, 2018Updated 7 years ago
- Flask server for RWKV☆10Apr 3, 2023Updated 2 years ago
- Node.js implementation binding for the RWKV.cpp module☆21Aug 2, 2023Updated 2 years ago
- Automatically create YaSnippets in R☆24Sep 2, 2015Updated 10 years ago