EticaAI / linguistic-datasets-portugueseView external linksLinks
Linguistic Datasets for Portuguese: Lista de conjuntos de dados linguísticos para língua portuguesa com licença flexíveis: banco de dados, lista de palavras, sinônimos, antônimos, dicionário temático, tesauro, linked data, semântica, ontologia e representação de conhecimento
☆82Nov 21, 2020Updated 5 years ago
Alternatives and similar repositories for linguistic-datasets-portuguese
Users that are interested in linguistic-datasets-portuguese are comparing it to the libraries listed below
Sorting:
- Named entity extraction from Portuguese web text☆71Aug 16, 2017Updated 8 years ago
- PHP low-level client for Vespa. https://vespa.ai/☆17Jan 22, 2026Updated 3 weeks ago
- 💻 Remoto Brasil - Dicas e jobs remotos aqui :D☆15Apr 21, 2018Updated 7 years ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆31Mar 12, 2024Updated last year
- Modelo de Trabalho de Conclusão do Curso de Engenharia de Computação da Universidade Estadual de Feira de Santana, baseado nas normas NBR…☆14Dec 12, 2013Updated 12 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆18Jan 15, 2026Updated last month
- Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).☆17Nov 18, 2021Updated 4 years ago
- sequence tagging with spaCy and crfsuite☆20Mar 18, 2023Updated 2 years ago
- SMiLER - Samsung MultiLingual Entity and Relation Extraction dataset☆18Feb 11, 2021Updated 5 years ago
- List of resources and tools developed with focus on Portuguese.☆309Jun 26, 2025Updated 7 months ago
- Character Based Named Entity Recognition.☆40Apr 3, 2018Updated 7 years ago
- A Language-consistent Open Relation Extraction Model.☆16Mar 24, 2023Updated 2 years ago
- Source code of ACL2022 "Headed-Span-Based Projective Dependency Parsing" and "Combining (second-order) graph-based and headed-span-based …☆16Jan 12, 2023Updated 3 years ago
- Source code for ACL-IJCNLP 2021 findings paper: SIRE: Separate Intra- and Inter-sentential Reasoning for Document-level Relation Extracti…☆21Aug 4, 2022Updated 3 years ago
- BERT model fine-tuned for question answering tasks in Portuguese text☆18Aug 3, 2020Updated 5 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- This Universal Dependencies (UD) Portuguese treebank.☆53Nov 12, 2025Updated 3 months ago
- R package: Lexicons for Portuguese Text Analysis☆59Jan 9, 2018Updated 8 years ago
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Oct 4, 2022Updated 3 years ago
- A flexible normalizer for user-generated content☆63Feb 5, 2026Updated last week
- Resources for morphological analysis of Portuguese☆26Apr 1, 2025Updated 10 months ago
- 📑 Python Package to reconstruct the original continuous text from PDFs with language models☆32Sep 8, 2023Updated 2 years ago
- The Natural Portuguese Language Benchmark (Napolab). Stay up to date with the latest advancements in Portuguese language models and their…☆72Jul 28, 2025Updated 6 months ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Apr 24, 2019Updated 6 years ago
- NextAuth + AWS Cognito Email & Google Login Example, also provide blog posts for detail explanation 🐉☆14Oct 15, 2024Updated last year
- Dependency Parsing as Sequence Labeling☆27Jul 25, 2024Updated last year
- ☆26Jan 23, 2024Updated 2 years ago
- Escrevi este roadmap para ajudar amigos próximos, está aberto a sugestões!☆14Sep 9, 2025Updated 5 months ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Dec 14, 2022Updated 3 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆36Mar 24, 2023Updated 2 years ago
- Search COVID-19 Open Research Dataset (CORD-19) using Vespa - the open source big data serving engine.☆38Nov 11, 2025Updated 3 months ago
- AMALGrAM, an English supersense tagger written in Python☆33May 31, 2017Updated 8 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated 10 months ago
- Chatbot that answers frequently asked questions in French, English, and Tunisian using the Rasa NLU framework and RWKV-4-Raven☆13May 19, 2023Updated 2 years ago
- Handles OpenDocument files and translates them to HTML.☆10Oct 8, 2019Updated 6 years ago
- 🧠 A neovim plugin to handle commit using AI☆16Jun 19, 2025Updated 7 months ago
- PyLadiesCon 2025 Conference website☆16Dec 22, 2025Updated last month
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Nov 15, 2025Updated 3 months ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago