Easily clean text with spaCy!
☆34Apr 26, 2026Updated last month
Alternatives and similar repositories for spacy-cleaner
Users that are interested in spacy-cleaner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Aug 24, 2023Updated 2 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆75Dec 16, 2023Updated 2 years ago
- RDF Community Discussions. Ask anything here!☆13Apr 11, 2024Updated 2 years ago
- Neue Scraper☆10May 6, 2026Updated last month
- Code and data for the CIKM2021 paper "Learning Ideological Embeddings From Information Cascades"☆10Sep 8, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Apr 17, 2023Updated 3 years ago
- ZH-color-scheme & theme_stat template for ggplot2.☆17Updated this week
- Deep Neural Networks for audio classification☆11Apr 11, 2024Updated 2 years ago
- Python Script to access ATT&CK content available in STIX via a public TAXII server☆13Dec 21, 2024Updated last year
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- ☆17Jan 5, 2023Updated 3 years ago
- https://adventofcode.com/2024☆12Dec 25, 2024Updated last year
- 🚀GUI for training spaCy models☆55May 18, 2021Updated 5 years ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆19May 18, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deep Learning framework for fast and clean research with Pytorch☆13Oct 9, 2020Updated 5 years ago
- Projekt «Named Entity Recognition für die zentralen Serien des Staatsarchivs Kanton Zürich»☆10Jul 14, 2025Updated 11 months ago
- Named Entity Recognition☆19Feb 13, 2026Updated 4 months ago
- A simple command-line tool to calculate importance of tokens in prompts sent to an LLM.☆19Apr 3, 2026Updated 2 months ago
- ☆20Mar 10, 2025Updated last year
- Python library to convert EEG datasets to a BIDS compatible dataset☆18Apr 10, 2026Updated 2 months ago
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆12Apr 8, 2022Updated 4 years ago
- Decorators for logging purposes for all your dataframes☆15Jan 31, 2025Updated last year
- Benchmark for learning stiff problems using physics-informed machine learning☆13Dec 15, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A different, but useful, textcat approach.☆18Jul 15, 2024Updated last year
- Geographic Data Science in Python - UFMG'19☆12Mar 26, 2019Updated 7 years ago
- ☆16Jan 23, 2025Updated last year
- Eine kuratierte Liste hilfreicher Informationen zu Offenen Daten☆20Jun 12, 2022Updated 4 years ago
- Measure how understandable a German text is.☆12May 31, 2026Updated 2 weeks ago
- Starter Code (R and Python) for all CSV data sets of Team Data Shop, Statistical Office, Canton Zurich☆14May 31, 2026Updated 2 weeks ago
- Demo repository for running eBPF in GitHub Actions☆23Mar 27, 2025Updated last year
- Code to create the dataset from "A New Aligned Simple German Corpus☆11Jan 8, 2024Updated 2 years ago
- Easily make interactive plots of player-tracking data☆11Sep 20, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆15Jul 25, 2023Updated 2 years ago
- ☆13Jun 7, 2024Updated 2 years ago
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Improving neural network representations using human similarity judgments☆13Nov 22, 2024Updated last year
- Implementing Visual Saliency Models☆13Jan 10, 2018Updated 8 years ago
- Homepage of Software Engineering for Machine Learning☆17May 25, 2026Updated 3 weeks ago
- GPU (CUDA) accelerated filters using 2D convolution for high resolution images.☆15Feb 1, 2025Updated last year