Dataiku DSS plugin to detect languages, correct misspellings, and clean text data π§Ό
β22Jan 29, 2026Updated 4 months ago
Alternatives and similar repositories for dss-plugin-nlp-preparation
Users that are interested in dss-plugin-nlp-preparation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python library to generate highly realistic typos (fuzz-testing)β13Mar 16, 2025Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]β25Jul 5, 2022Updated 3 years ago
- Post-processing OCR errors with seq2seq modelsβ28Jul 30, 2020Updated 5 years ago
- BERT Tokenizer with vocabulary tailored for Cantoneseβ23Oct 27, 2022Updated 3 years ago
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21β13Jul 15, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- BERT models pretrained on the CORD-19 Kaggle datasetβ15Jun 8, 2020Updated 6 years ago
- Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orchestration (under development)β26Updated this week
- An OSINT tool to find data leaks on a targeted websiteβ18Mar 30, 2021Updated 5 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β69Apr 14, 2026Updated 2 months ago
- Bash script to create an ebook from a list of web articles. Inspired by the now-defunct Readlists.org by Readabilityβ18Oct 13, 2019Updated 6 years ago
- My OpenCode and Oh-My-OpenCode configuration files with API proxy setup documentationβ37Jan 5, 2026Updated 5 months ago
- Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paβ¦β39Apr 7, 2026Updated 2 months ago
- β15Oct 12, 2015Updated 10 years ago
- Website for the KGC 2020 Tutorial: "Building a Knowledge Graph from schema.org annotations"β10Jun 26, 2020Updated 5 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code and Word2Vec embeddings of LOINC codes for KDD 2019 DSHealth paper "Evaluation of Embeddings of Laboratory Test Codes for Patients aβ¦β11Jun 13, 2024Updated 2 years ago
- Remark plugin for selecting and storing code blocks from markdown.β18Dec 7, 2022Updated 3 years ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translationβ15Aug 27, 2024Updated last year
- A MkDocs plugin to add bootstrap classes to plan markdown generated tables.β13Mar 27, 2020Updated 6 years ago
- β10Oct 15, 2020Updated 5 years ago
- A collection of python utility functionsβ11May 8, 2026Updated last month
- The best Python package for comparing two dataframesβ12Dec 29, 2021Updated 4 years ago
- A persistent datastore backed by RocksDB with fuzzy key lookup using an arbitrary distance function accelerated by the SymSpell algorithmβ14May 9, 2024Updated 2 years ago
- Super simple, zero config options, <2kb declarative tooltip library with no dependencies.β17Jun 2, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The code for the Sales Dashboard demoβ16May 19, 2025Updated last year
- A Python app that converts vocal recordings into MIDI files. Transform your singing into digital music!β17Jun 7, 2026Updated last week
- A Bio2BEL package for DrugBank (https://www.drugbank.ca)