Easily clean text with spaCy!
☆34Apr 26, 2026Updated last week
Alternatives and similar repositories for spacy-cleaner
Users that are interested in spacy-cleaner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Aug 24, 2023Updated 2 years ago
- Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.☆45May 13, 2024Updated last year
- Neue Scraper☆10Apr 22, 2026Updated 2 weeks ago
- ☆11Apr 17, 2023Updated 3 years ago
- ZH-color-scheme & theme_stat template for ggplot2.☆18Apr 15, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simplify German language! Leichte Sprache Tool.☆12Oct 20, 2025Updated 6 months ago
- Python Script to access ATT&CK content available in STIX via a public TAXII server☆13Dec 21, 2024Updated last year
- Deep Neural Networks for audio classification☆11Apr 11, 2024Updated 2 years ago
- https://adventofcode.com/2024☆12Dec 25, 2024Updated last year
- 🚀GUI for training spaCy models☆55May 18, 2021Updated 4 years ago
- Training and Inference Notebooks for the RedPajama (OpenLlama) models☆19May 18, 2023Updated 2 years ago
- Fast domain-aware neural network emulation of a planetary boundary layer parameterization in a numerical weather forecast model☆12Mar 26, 2019Updated 7 years ago
- Presentation material for my talk at Pycon DE 2023: Intro on synthetic tabular data including synthetic data generation, evaluation metri…☆13Jan 26, 2026Updated 3 months ago
- Deep Learning framework for fast and clean research with Pytorch☆13Oct 9, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Projekt «Named Entity Recognition für die zentralen Serien des Staatsarchivs Kanton Zürich»☆10Jul 14, 2025Updated 9 months ago
- A simple command-line tool to calculate importance of tokens in prompts sent to an LLM.☆19Apr 3, 2026Updated last month
- ☆20Mar 10, 2025Updated last year
- Python library to convert EEG datasets to a BIDS compatible dataset☆18Apr 10, 2026Updated 3 weeks ago
- A different, but useful, textcat approach.☆18Jul 15, 2024Updated last year
- ☆16Jan 23, 2025Updated last year
- Eine kuratierte Liste hilfreicher Informationen zu Offenen Daten☆20Jun 12, 2022Updated 3 years ago
- Starter Code (R and Python) for all CSV data sets of Team Data Shop, Statistical Office, Canton Zurich☆13Apr 27, 2026Updated last week
- Intuitive graphical representation of source code☆14Mar 15, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code to create the dataset from "A New Aligned Simple German Corpus☆11Jan 8, 2024Updated 2 years ago
- Command line client for GIN☆14Feb 25, 2023Updated 3 years ago
- Distances between N-dimensional images☆14Jul 20, 2023Updated 2 years ago
- Cosine Similary Search in ElasticSearch + FAISS GPU☆12Mar 24, 2022Updated 4 years ago
- This repository represents a basic implementation of the paper "Riemannian Geometry of Deep Generative Models", along with the results on…☆12Oct 23, 2019Updated 6 years ago
- Official code for the paper "Compositional Generalization from First Principles" (NeurIPS 2023)☆15Jul 25, 2023Updated 2 years ago
- ☆13Jun 7, 2024Updated last year
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- A github action for detecting a "trigger" in a pull request description or comment☆13Jun 13, 2025Updated 10 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆117Oct 20, 2025Updated 6 months ago
- Improving neural network representations using human similarity judgments☆13Nov 22, 2024Updated last year
- Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib☆19Nov 19, 2021Updated 4 years ago
- Examples for how to use DSPY and GEPA☆48Apr 6, 2026Updated last month
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆12Jun 15, 2023Updated 2 years ago
- Wavelet phase harmonic scattering transform☆14Jul 5, 2022Updated 3 years ago
- Homepage of Software Engineering for Machine Learning☆17Feb 4, 2026Updated 3 months ago