Python Notebook for a workshop at Convercon Ireland 2019. The title is How to Curate and NLP Dataset and is about a process to find errors in a dataset to improve training.
☆13Feb 16, 2020Updated 6 years ago
Alternatives and similar repositories for datacleanup
Users that are interested in datacleanup are comparing it to the libraries listed below
Sorting:
- Tool for sentiment analysis annotation☆13Mar 26, 2025Updated 11 months ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆11Sep 26, 2022Updated 3 years ago
- VADER Reddit Sentiment analysis using Dash in python☆10Dec 8, 2022Updated 3 years ago
- A modular, scalable, fast and reliable phishing detection framework☆11Dec 1, 2018Updated 7 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- Twitter meets tik tok☆10Jul 25, 2020Updated 5 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- A module for normalising text.☆10Nov 6, 2019Updated 6 years ago
- This repository keep my research materials about Named Entity Recognition using Transfer Learning☆10Oct 15, 2020Updated 5 years ago
- The Heracles framework for developing and evaluating text mining algorithms☆10Jul 1, 2022Updated 3 years ago
- ☆11Nov 29, 2019Updated 6 years ago
- Enterprise Solution for Text Classification (using BERT)☆10Dec 26, 2022Updated 3 years ago
- ☆12Jun 3, 2016Updated 9 years ago
- Simple tool to import/export Elasticsearch indices into a file, and/or reshard an index☆19Jan 25, 2022Updated 4 years ago
- Code for our work "Read, Highlight and Summarize: A Hierarchical Neural Semantic Encoder-based Approach"☆10Oct 28, 2019Updated 6 years ago
- Using Gradio interface to build UI for converting text to speech☆13Jan 26, 2021Updated 5 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Improving Sentiment Analysis with Multi-task Learning of Negation☆14May 6, 2021Updated 4 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Jul 11, 2020Updated 5 years ago
- Speech Recognition Scoring Toolkit☆13Sep 30, 2015Updated 10 years ago
- ☆15May 12, 2017Updated 8 years ago
- Multi-Language Sentiment Analysis☆12May 1, 2023Updated 2 years ago
- The SETimes.HR+ Croatian dependency treebank☆16Dec 27, 2016Updated 9 years ago
- ☆16Feb 9, 2024Updated 2 years ago
- Cluster up to millions of peptide sequences on shared sequence motifs.☆13Oct 1, 2018Updated 7 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12May 11, 2018Updated 7 years ago
- Reverse engineer patterns for use with SpaCy's DependencyMatcher☆36Feb 8, 2020Updated 6 years ago
- HMM Tutorial☆12Apr 15, 2018Updated 7 years ago
- My PyTorch playground for NLP☆13Sep 20, 2018Updated 7 years ago
- A repository of scripts and files related to the CryptoWall version 3 threat☆12Mar 3, 2016Updated 10 years ago
- End to End Multiview Lip Reading☆10Jan 26, 2018Updated 8 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 11 months ago
- An (in-progress) AutoML survey focusing on practical systems.☆16Oct 5, 2021Updated 4 years ago
- Guidelines for the responsible use of explainable AI and machine learning.☆17Jan 30, 2023Updated 3 years ago
- Slides and coding demo for word2vec☆12Nov 14, 2016Updated 9 years ago
- Experiments for my masters thesis☆15Jun 27, 2020Updated 5 years ago
- A curated list of Neuro-Symbolic Visual Reasoning☆16Jul 23, 2021Updated 4 years ago
- gzipstream allows Python to process multi-part gzip files from a streaming source☆23Feb 24, 2017Updated 9 years ago
- Sentiment Lexicon Generation Suite☆15Dec 4, 2017Updated 8 years ago