Python Notebook for a workshop at Convercon Ireland 2019. The title is How to Curate and NLP Dataset and is about a process to find errors in a dataset to improve training.
☆13Feb 16, 2020Updated 6 years ago
Alternatives and similar repositories for datacleanup
Users that are interested in datacleanup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jan 18, 2020Updated 6 years ago
- Tool for sentiment analysis annotation☆13Mar 26, 2025Updated last year
- Go programming language resources (code snippets, workshops, etc.) #golang☆19Aug 27, 2018Updated 7 years ago
- ☆15Oct 11, 2020Updated 5 years ago
- List of resources to get started with Deep Learning for NLP.☆14Mar 30, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Using Gradio interface to build UI for converting text to speech☆13Jan 26, 2021Updated 5 years ago
- This repository keep my research materials about Named Entity Recognition using Transfer Learning☆10Oct 15, 2020Updated 5 years ago
- VADER Reddit Sentiment analysis using Dash in python☆10Dec 8, 2022Updated 3 years ago
- Multilabel Convolutional Neural Network model to analyze and extract sentiments in texts in Brazilian Portuguese language.☆12Mar 13, 2018Updated 8 years ago
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆12Sep 26, 2022Updated 3 years ago
- Refining pre-trained word embeddings with supervised word-label embeddings for Text Classification (by topic)☆12Jul 6, 2021Updated 4 years ago
- This is an implementation of electra according to the paper {ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators…☆13Jun 3, 2020Updated 5 years ago
- Multi-Language Sentiment Analysis☆12May 1, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Improving Sentiment Analysis with Multi-task Learning of Negation☆14May 6, 2021Updated 5 years ago
- The Heracles framework for developing and evaluating text mining algorithms☆10Jul 1, 2022Updated 3 years ago
- ☆12Nov 29, 2019Updated 6 years ago
- My PyTorch playground for NLP☆13Sep 20, 2018Updated 7 years ago
- Cluster up to millions of peptide sequences on shared sequence motifs.☆13Oct 1, 2018Updated 7 years ago
- ☆15May 12, 2017Updated 9 years ago
- An interactive Elasticsearch Workshop for beginners to learn about Elasticsearch and Search Engines in a hands-on way.☆17Oct 22, 2017Updated 8 years ago
- The SETimes.HR+ Croatian dependency treebank☆16Dec 27, 2016Updated 9 years ago
- ☆28Apr 19, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A curated list of Neuro-Symbolic Visual Reasoning☆17Jul 23, 2021Updated 4 years ago
- Code for ACL'20 paper "It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations"☆19May 1, 2025Updated last year
- Library for Character/Word n-gram Analysis☆23Mar 2, 2017Updated 9 years ago
- Experiments for my masters thesis☆15Jun 27, 2020Updated 5 years ago
- Dynamic data selection for neural machine translation☆20Jan 28, 2018Updated 8 years ago
- Poly-encoder architecture and pre-training pipeline implementation (pytorch)☆16Jun 29, 2020Updated 5 years ago
- SRL4ORL: Improving Opinion Role Labeling Using Multi-Task Learning With Semantic Role Labeling☆14Oct 10, 2018Updated 7 years ago
- ☆19Sep 29, 2019Updated 6 years ago
- A3C and generic hierarchical RL for sentiment analysis tasks☆15Dec 1, 2019Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Full Stack Data Science projects centered around Apache Spark Streaming for educational purpose.☆19May 1, 2023Updated 3 years ago
- Text processing library for sentiment analysis and related tasks☆27Apr 23, 2026Updated 3 weeks ago
- Source-LDA: Enhancing probabilistic topic models using prior knowledge sources (ICDE 2017)☆21May 18, 2017Updated 9 years ago
- SentiSE is a sentiment analysis tool for Software Engineering interactions☆18Oct 21, 2019Updated 6 years ago
- Tools for training pytorch language models☆27Nov 14, 2020Updated 5 years ago
- Machine Learning and Object Oriented Programming with Python, a FAES course at the National Institutes of Health☆20Jun 7, 2016Updated 9 years ago
- Modularizing Unsupervised Sense Embedding☆30Feb 15, 2018Updated 8 years ago