Downloads 2020 English Wikipedia articles as plaintext
☆27Mar 25, 2023Updated 3 years ago
Alternatives and similar repositories for wikipedia-downloader
Users that are interested in wikipedia-downloader are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- Script for downloading GitHub.☆13Sep 24, 2020Updated 5 years ago
- ☆33May 23, 2023Updated 2 years ago
- ☆95Jul 16, 2022Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Nov 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Script for downloading GitHub.☆99Jul 1, 2024Updated last year
- downloads and parses subtitle dataset from opensubtitles.org☆15Apr 19, 2024Updated 2 years ago
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆11Jun 2, 2024Updated last year
- [ICML 2023] Are Diffusion Models Vulnerable to Membership Inference Attacks?☆43Sep 4, 2024Updated last year
- Simple migration engine for Peewee☆19Updated this week
- ☆1,644Apr 27, 2023Updated 2 years ago
- The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"☆32Apr 13, 2026Updated last week
- Linux 多线程服务端编程 一书的脚注合集☆21Nov 5, 2021Updated 4 years ago
- Not financial advice.☆28Mar 18, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tool to generate documentation for Nelua source files.☆10Dec 11, 2021Updated 4 years ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Models☆86Dec 6, 2023Updated 2 years ago
- A Python script that generates Snort IDS rules from network packets☆24Oct 30, 2017Updated 8 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 6 months ago
- ☆15Sep 24, 2023Updated 2 years ago
- codesearch.ai semantic code search engine☆42Mar 24, 2023Updated 3 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Mar 30, 2026Updated 2 weeks ago
- A dataset of alignment research and code to reproduce it☆78Jun 22, 2023Updated 2 years ago
- Dataset of Canada goose images with annotations of bounding boxes with object classes, suitable for testing object detection algorithms.☆40Aug 2, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A syntax highlighter for the web using tree-sitter.☆16Sep 9, 2022Updated 3 years ago
- Implementation of stop sequencer for Huggingface Transformers