๐ Machine learning dataset loaders for testing and example scripts
โ47Mar 26, 2026Updated 2 weeks ago
Alternatives and similar repositories for ml-datasets
Users that are interested in ml-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CMU Linguistic Annotation Backendโ15Sep 22, 2025Updated 6 months ago
- A crowd-sourcing project by Cambridge Digital Library undertaken during the University of Cambridge's closure period due to the Coronavirโฆโ11Nov 1, 2024Updated last year
- Modular Rust transformer/LLM library using Candleโ38May 5, 2024Updated last year
- Generate a SQLite database from Wikipedia & Wikidata dumps.โ36Mar 27, 2024Updated 2 years ago
- ๐ Additional lookup tables and data resources for spaCyโ115Jun 4, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling on Cloudways โข AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Lightweight piece tokenization libraryโ12Apr 15, 2024Updated last year
- ๐ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCyโ1,404Mar 27, 2026Updated 2 weeks ago
- SpacyV3 Text Categorizer Tutorialโ17Nov 15, 2020Updated 5 years ago
- A repository to store all of my cheat sheets for various things.โ12Nov 21, 2025Updated 4 months ago
- Wrapper for the macOS signpost APIโ16Apr 24, 2023Updated 2 years ago
- English-Korean dictionary for Kindleโ12Jun 2, 2018Updated 7 years ago
- Benchmark Datasets for BioNLP Tasksโ17May 7, 2025Updated 11 months ago
- ๐ฅ Fast matrix-multiplication as a self-contained Python library โ no system dependencies!โ236Apr 1, 2026Updated last week
- ๐งฌ A VS Code extension for annotating data with Prodigyโ30Nov 25, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI โข AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Inter-annotator agreement for Doccanoโ28May 3, 2020Updated 5 years ago
- Markdown extension to expand directives to include source example files to also include their variants. Only useful to tiangolo's projetsโฆโ15Apr 3, 2026Updated last week
- Python tool for batch visual question answering (BVQA).โ14Sep 18, 2025Updated 6 months ago
- Python library for Bayesian Autoencodersโ13Jun 10, 2022Updated 3 years ago
- โ๏ธ Parallel and distributed training with spaCy and Rayโ56Jul 31, 2023Updated 2 years ago
- ๐ซ Jupyter notebooks for spaCy examples and tutorialsโ287Feb 1, 2019Updated 7 years ago
- Information Extraction Dataset Zoo.โ30Apr 9, 2022Updated 4 years ago
- Introduction to Programming using Pythonโ19Jul 12, 2020Updated 5 years ago
- Python koans for beginner programmersโ18Mar 28, 2015Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting โข AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Bag of, not words, but tricks!โ68Oct 31, 2023Updated 2 years ago
- CFFI-based pango-cairo bindings for Python.โ16Jun 25, 2024Updated last year
- Quantifying interactions with government services to support delivery teams to improve their own products and servicesโ10Sep 5, 2022Updated 3 years ago
- ๐ฅ Browser-based slides or PDFs of our talks and presentationsโ94Jan 26, 2019Updated 7 years ago
- Code for "All-In-1: Short Text Classification with One Model for All Languages" - Plank (2017), IJCNLP 2017 shared task 4โ16Oct 26, 2017Updated 8 years ago
- An implementation of BERT using PyTorch's TransformerEncoderโ32Dec 15, 2019Updated 6 years ago
- Book: Practical Probabilistic Machine Learning in Pythonโ10Apr 3, 2021Updated 5 years ago
- Software Engineering Repositoryโ13Aug 30, 2017Updated 8 years ago
- ConceptNet to neo4j 2.2โ10Nov 6, 2015Updated 10 years ago
- End-to-end encrypted cloud storage - Proton Drive โข AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Sentiment Corpus for Swedish ๐ธ๐ช Norwegian ๐ณ๐ด Danish ๐ฉ๐ฐ Finnish ๐ซ๐ฎ (and English ๐ด๓ ง๓ ข๓ ฅ๓ ฎ๓ ง๓ ฟ)โ15May 3, 2021Updated 4 years ago
- This is a complete stack for running Symfony 4 into Docker containers using docker-compose tool.โ10Feb 13, 2018Updated 8 years ago
- โ19Mar 26, 2026Updated 2 weeks ago
- โ12Apr 13, 2018Updated 7 years ago
- โ11May 26, 2020Updated 5 years ago
- 20 python libs and more: read me first!โ12Apr 11, 2024Updated last year
- TextGraphs-13 Shared Task on Multi-Hop Inference Explanation Regenerationโ44Feb 24, 2020Updated 6 years ago