Dataiku DSS plugin to detect languages, correct misspellings, and clean text data π§Ό
β22Jan 29, 2026Updated 3 months ago
Alternatives and similar repositories for dss-plugin-nlp-preparation
Users that are interested in dss-plugin-nlp-preparation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stroke-based Character Reconstruction ---> https://arxiv.org/abs/1806.08990β15Dec 6, 2021Updated 4 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]β25Jul 5, 2022Updated 3 years ago
- Self-collected data for Masked Face recognition paper (300+ different participants)β12Jul 13, 2023Updated 2 years ago
- fastlangid, the only language identification package that support cantonese (zh-yue), simplified (zh-hans) and traditional chinese (zh-haβ¦β43Dec 6, 2022Updated 3 years ago
- Post-processing OCR errors with seq2seq modelsβ28Jul 30, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β12Jun 3, 2021Updated 4 years ago
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21β13Jul 15, 2022Updated 3 years ago
- Transform MCR 3.0 data to read with nltk WordNet reader. Use this to load WordNet in Spanish, among other languages, from nltk.β25Oct 10, 2022Updated 3 years ago
- This repo contains my works on the area of NLP, such as Neural Machine Translation, Named Entity Recognition etc,.β13Sep 19, 2020Updated 5 years ago
- Supercharged pandas indexingβ11Mar 28, 2021Updated 5 years ago
- Vietnamese ID information detectionβ19Jun 24, 2022Updated 3 years ago
- β13Nov 30, 2022Updated 3 years ago
- CSS & HTML on Python Easilyβ11Sep 23, 2024Updated last year
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browserβ13Nov 17, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- nocaps: novel object captioning at scaleβ10May 23, 2019Updated 6 years ago
- CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction [ICRA 2025]β18Oct 20, 2025Updated 6 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β69Apr 14, 2026Updated 3 weeks ago
- Bash script to create an ebook from a list of web articles. Inspired by the now-defunct Readlists.org by Readabilityβ18Oct 13, 2019Updated 6 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languagesβ11Feb 6, 2024Updated 2 years ago
- Resources and documentation for UK Biobank to OMOP CDM v5.3.1 conversionβ10Oct 20, 2020Updated 5 years ago
- Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paβ¦β39Apr 7, 2026Updated last month
- β15Oct 12, 2015Updated 10 years ago
- Remark plugin for selecting and storing code blocks from markdown.β18Dec 7, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β12Apr 28, 2023Updated 3 years ago
- A MkDocs plugin to add bootstrap classes to plan markdown generated tables.β13Mar 27, 2020Updated 6 years ago
- β10Oct 15, 2020Updated 5 years ago
- Super simple, zero config options, <2kb declarative tooltip library with no dependencies.β17Jun 2, 2023Updated 2 years ago
- A Python app that converts vocal recordings into MIDI files. Transform your singing into digital music!β17Nov 1, 2025Updated 6 months ago
- A Bio2BEL package for DrugBank (https://www.drugbank.ca)β10Dec 14, 2020Updated 5 years ago
- A WordPress plugin to receive movie/series information, including poster and trailer from IMDB.β10May 21, 2017Updated 8 years ago
- Force Users to upload profile photo before they can use the site.β10Dec 17, 2017Updated 8 years ago
- Design your Material-UI buttons, add clickable hyperlinks, integrate them in your Streamlit apps! πβ10Jun 17, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Configuration system geared towards Python ML projectsβ11Apr 30, 2023Updated 3 years ago
- Dark and bright them for Sublimeβ22Sep 15, 2016Updated 9 years ago
- β12Oct 24, 2025Updated 6 months ago
- Neural Sentiment Analyzer for Modern Hebrewβ21Nov 21, 2022Updated 3 years ago
- personal diaryβ14Apr 28, 2026Updated last week
- Python utility to extract differences between two pandas dataframes.β11Apr 4, 2026Updated last month
- pixel-mosaic converts images into pixel art and preserves features while downscalingβ30Mar 7, 2026Updated last month