roll a wikipedia dump into mongo
☆252Apr 2, 2026Updated last month
Alternatives and similar repositories for dumpster-dive
Users that are interested in dumpster-dive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a pretty-committed wikipedia markup parser☆852Dec 12, 2025Updated 5 months ago
- Svelte infographics component☆27Sep 7, 2020Updated 5 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆256Dec 5, 2023Updated 2 years ago
- plugin to add part-of-speech (POS) tags☆22Oct 23, 2024Updated last year
- A tool for extracting plain text from Wikipedia dumps☆3,985May 23, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Make sure the last sync call to an async function is executed after all previous ones have finished☆31Jan 30, 2016Updated 10 years ago
- Personal configuration files for unix-like environments.☆22Jul 26, 2022Updated 3 years ago
- Allows search engines to crawl and index your single page app!☆15Apr 13, 2016Updated 10 years ago
- Administrative tool for your ipfs.pics server☆13Aug 16, 2016Updated 9 years ago
- 🕸️ Get all DNS Records for any domain☆33Aug 14, 2025Updated 9 months ago
- Nodejs wrapper for Stanford Classifier.☆47Feb 13, 2021Updated 5 years ago
- RactiveJS components & AmpersandJS models☆12Sep 25, 2015Updated 10 years ago
- modest natural-language processing☆12,093Feb 25, 2026Updated 2 months ago
- Standalone Semanticizer☆32Mar 4, 2015Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Bypass CORS (Cross-Origin Resource Sharing) get HTML from external domains and make your own API☆15Oct 21, 2017Updated 8 years ago
- Source real estate prices from the Common Crawl.☆27Oct 22, 2018Updated 7 years ago
- an example app☆16May 2, 2015Updated 11 years ago
- node.js interface to the ConceptNet semantic network API [DEPRECATED; ConceptNet API has changed]☆30Oct 5, 2017Updated 8 years ago
- display urls being tweeted with an event hashtag☆18Apr 16, 2016Updated 10 years ago
- Event sourcing JavaScript entity class☆11Apr 24, 2017Updated 9 years ago
- ☆15Aug 15, 2012Updated 13 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Jul 16, 2013Updated 12 years ago
- A clone of indri-5.12 with minor customizations.☆25Sep 23, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Example os studio microservices☆11May 12, 2016Updated 10 years ago
- An Abstractive summarizer for online news articles.☆18Mar 25, 2015Updated 11 years ago
- Code for co-training large language models (e.g. T0) with smaller ones (e.g. BERT) to boost few-shot performance☆17Sep 23, 2022Updated 3 years ago
- Node.js module for the aREST framework☆11Sep 25, 2018Updated 7 years ago
- ☆13Mar 1, 2024Updated 2 years ago
- Auto naming of objects for easier debugging☆19Jan 22, 2021Updated 5 years ago
- npm module for flickr api☆14Jun 24, 2015Updated 10 years ago
- Graphviz -> Sketchy PNG in one image, for automation☆25Feb 24, 2021Updated 5 years ago
- JavaScript library for getting geojson from the Wikipedia API☆22Sep 25, 2015Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- JSON-LD parser that implements the RDFJS Sink interface using jsonld.js☆13Mar 2, 2026Updated 2 months ago
- Wikipedia Live Monitor☆22Dec 21, 2024Updated last year
- Chef cookbook for the http://druid.io/☆10Apr 25, 2016Updated 10 years ago
- Ground - MV(C/VM) Javascript Framework☆100Mar 12, 2018Updated 8 years ago
- fasttag part of speech tagger javascript implementation☆280Apr 27, 2020Updated 6 years ago
- Simple spatio-temporal windowing in Kafka Streams☆13Jul 14, 2016Updated 9 years ago
- thingSoC - Open Source Sockets for the Internet of Things☆16Oct 29, 2016Updated 9 years ago