Dataiku DSS plugin to detect languages, correct misspellings, and clean text data π§Ό
β22Jan 29, 2026Updated 2 months ago
Alternatives and similar repositories for dss-plugin-nlp-preparation
Users that are interested in dss-plugin-nlp-preparation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A python library to generate highly realistic typos (fuzz-testing)β13Mar 16, 2025Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]β25Jul 5, 2022Updated 3 years ago
- Self-collected data for Masked Face recognition paper (300+ different participants)β12Jul 13, 2023Updated 2 years ago
- Post-processing OCR errors with seq2seq modelsβ28Jul 30, 2020Updated 5 years ago
- Fullstack machine learning inference templateβ31Nov 24, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Common Lisp implementation of the Zipper data structure first described by GerΓ‘rd Huet.β15Dec 21, 2017Updated 8 years ago
- Chrome Extension. Use bookmarks and bookmarklets from the context menu.β15Nov 4, 2015Updated 10 years ago
- Omnipy is a high level Python library for type-driven data wrangling and scalable workflow orchestration (under development)β26Apr 8, 2026Updated last week
- Vietnamese ID information detectionβ19Jun 24, 2022Updated 3 years ago
- Supercharged pandas indexingβ11Mar 28, 2021Updated 5 years ago
- Libp2p / IPFS terminal-based chatβ14Jan 6, 2023Updated 3 years ago
- CSS & HTML on Python Easilyβ11Sep 23, 2024Updated last year
- Extends gunDB with the ability to chain into most.js observables.β14Jan 11, 2017Updated 9 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browserβ13Nov 17, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction [ICRA 2025]β18Oct 20, 2025Updated 5 months ago
- This is a simple paper trading cryptocurrency trading bot that's using Coingecko data to buy and sell coins based on price movement. It cβ¦β12Dec 12, 2024Updated last year
- An OSINT tool to find data leaks on a targeted websiteβ17Mar 30, 2021Updated 5 years ago
- Bash script to create an ebook from a list of web articles. Inspired by the now-defunct Readlists.org by Readabilityβ18Oct 13, 2019Updated 6 years ago
- Bayesian Optimization Meets Self-Distillation, ICCV 2023β10Aug 28, 2023Updated 2 years ago
- 20 python libs and more: read me first!β12Apr 11, 2024Updated 2 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languagesβ11Feb 6, 2024Updated 2 years ago
- Proof of concept for Nuxt.js Documentation with MDX + Vue live editorβ15Dec 12, 2022Updated 3 years ago
- Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paβ¦β39Apr 7, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Persistent Gun super peer supporting HTTP and HTTPS/SSL.β13Oct 9, 2024Updated last year
- Website for the KGC 2020 Tutorial: "Building a Knowledge Graph from schema.org annotations"β10Jun 26, 2020Updated 5 years ago
- A parser class for simple formulae.β12Feb 21, 2017Updated 9 years ago
- Simple web code editor build with web components librariesβ15Oct 12, 2023Updated 2 years ago
- Remark plugin for selecting and storing code blocks from markdown.β18Dec 7, 2022Updated 3 years ago
- β12Apr 28, 2023Updated 2 years ago
- A collection of python utility functionsβ11Mar 30, 2026Updated 2 weeks ago
- Automatically perform exploratory data analysis, and generate a report in Word '.docx' format.β10Feb 11, 2026Updated 2 months ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).β14Jun 15, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- β11Apr 8, 2022Updated 4 years ago
- A simple way to copy a frontmatter key in obsidian, and create an url from it !β19May 25, 2024Updated last year
- A Bio2BEL package for DrugBank (https://www.drugbank.ca)β10Dec 14, 2020Updated 5 years ago
- Force Users to upload profile photo before they can use the site.β10Dec 17, 2017Updated 8 years ago
- Library and examples to interface a HPGL plotter such as HP7550a to processing.β10Jan 15, 2015Updated 11 years ago
- Change the structure of an existing JSON object with this mapping moduleβ13Jan 6, 2023Updated 3 years ago
- personal diaryβ14Updated this week