roll a wikipedia dump into mongo
☆253Apr 2, 2026Updated last week
Alternatives and similar repositories for dumpster-dive
Users that are interested in dumpster-dive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a pretty-committed wikipedia markup parser☆851Dec 12, 2025Updated 3 months ago
- Svelte infographics component☆27Sep 7, 2020Updated 5 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆256Dec 5, 2023Updated 2 years ago
- ☆14Dec 24, 2016Updated 9 years ago
- ☆11Dec 2, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Python tools for parsing Wikipedia/MediaWiki database dumps☆23Feb 28, 2013Updated 13 years ago
- Simple btc trend following algorithm, using cbpro api.☆13Nov 15, 2020Updated 5 years ago
- A tool for extracting plain text from Wikipedia dumps☆3,976May 23, 2024Updated last year
- distributed shell job control with role based configuration for Node.js☆15Apr 22, 2014Updated 11 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Python wrapper for ClausIE.☆27Aug 27, 2021Updated 4 years ago
- A node base class for Javascript and Coffeescript (logging, options, defaults and EventEmitter)☆13Mar 23, 2011Updated 15 years ago
- Personal configuration files for unix-like environments.☆22Jul 26, 2022Updated 3 years ago
- Allows search engines to crawl and index your single page app!☆15Apr 13, 2016Updated 9 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Nodejs wrapper for Stanford Classifier.☆47Feb 13, 2021Updated 5 years ago
- RactiveJS components & AmpersandJS models☆12Sep 25, 2015Updated 10 years ago
- modest natural-language processing☆12,064Feb 25, 2026Updated last month
- Standalone Semanticizer☆32Mar 4, 2015Updated 11 years ago
- Copy engineering rules!☆29Feb 26, 2015Updated 11 years ago
- node.js interface to the ConceptNet semantic network API [DEPRECATED; ConceptNet API has changed]☆30Oct 5, 2017Updated 8 years ago
- display urls being tweeted with an event hashtag☆18Apr 16, 2016Updated 9 years ago
- ☆15Aug 15, 2012Updated 13 years ago
- Yagnus Javascript Libraries☆22Jul 22, 2013Updated 12 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Jul 16, 2013Updated 12 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- Example os studio microservices☆11May 12, 2016Updated 9 years ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Sep 23, 2022Updated 3 years ago
- An Abstractive summarizer for online news articles.☆18Mar 25, 2015Updated 11 years ago
- Little list of happy places☆17Jun 16, 2020Updated 5 years ago
- Node.js module for the aREST framework☆11Sep 25, 2018Updated 7 years ago
- ☆13Mar 1, 2024Updated 2 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆50Jul 21, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Redux and RactiveJS example☆10Mar 8, 2016Updated 10 years ago
- npm module for flickr api☆14Jun 24, 2015Updated 10 years ago
- Graphviz -> Sketchy PNG in one image, for automation☆25Feb 24, 2021Updated 5 years ago
- A datepicker component for RactiveJs☆10Jun 26, 2018Updated 7 years ago
- JavaScript library for getting geojson from the Wikipedia API☆22Sep 25, 2015Updated 10 years ago
- One trick pony NLP library for extracting keywords from HTML documents☆18Jan 6, 2016Updated 10 years ago
- JSON-LD parser that implements the RDFJS Sink interface using jsonld.js☆13Mar 2, 2026Updated last month