roll a wikipedia dump into mongo
☆253Apr 2, 2026Updated 2 months ago
Alternatives and similar repositories for dumpster-dive
Users that are interested in dumpster-dive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a pretty-committed wikipedia markup parser☆851May 27, 2026Updated 2 weeks ago
- Svelte infographics component☆27Sep 7, 2020Updated 5 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆256Dec 5, 2023Updated 2 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json dump. Questions? https://gitter.im/idio-opensource/Lobby☆17May 20, 2022Updated 4 years ago
- Chambua is an open-source semantic tagging application that analyses text and extracts names of people, places (& geocodes them), organis…☆33Nov 12, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python tools for parsing Wikipedia/MediaWiki database dumps☆23Feb 28, 2013Updated 13 years ago
- Simple btc trend following algorithm, using cbpro api.☆13Nov 15, 2020Updated 5 years ago
- A tool for extracting plain text from Wikipedia dumps☆3,988May 23, 2024Updated 2 years ago
- Make sure the last sync call to an async function is executed after all previous ones have finished☆31Jan 30, 2016Updated 10 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated 2 years ago
- Python wrapper for ClausIE.☆28Aug 27, 2021Updated 4 years ago
- Single file RL example with Keras☆13Aug 25, 2017Updated 8 years ago
- Allows search engines to crawl and index your single page app!☆15Apr 13, 2016Updated 10 years ago
- ☆25Apr 28, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- RactiveJS components & AmpersandJS models☆12Sep 25, 2015Updated 10 years ago
- modest natural-language processing☆12,113May 27, 2026Updated 2 weeks ago
- Standalone Semanticizer☆32Mar 4, 2015Updated 11 years ago
- Bypass CORS (Cross-Origin Resource Sharing) get HTML from external domains and make your own API☆15Oct 21, 2017Updated 8 years ago
- an example app☆16May 2, 2015Updated 11 years ago
- node.js interface to the ConceptNet semantic network API [DEPRECATED; ConceptNet API has changed]☆30Oct 5, 2017Updated 8 years ago
- display urls being tweeted with an event hashtag☆18Apr 16, 2016Updated 10 years ago
- Event sourcing JavaScript entity class☆11Apr 24, 2017Updated 9 years ago
- ☆15Aug 15, 2012Updated 13 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Yagnus Javascript Libraries☆22Jul 22, 2013Updated 12 years ago
- A clone of indri-5.12 with minor customizations.☆25Sep 23, 2024Updated last year
- An Abstractive summarizer for online news articles.☆18Mar 25, 2015Updated 11 years ago
- Node.js module for the aREST framework☆11Sep 25, 2018Updated 7 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆50Jul 21, 2015Updated 10 years ago
- Redux and RactiveJS example☆10Mar 8, 2016Updated 10 years ago
- npm module for flickr api☆14Jun 24, 2015Updated 10 years ago
- JavaScript library for getting geojson from the Wikipedia API☆22Sep 25, 2015Updated 10 years ago
- One trick pony NLP library for extracting keywords from HTML documents☆18Jan 6, 2016Updated 10 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- JSON-LD parser that implements the RDFJS Sink interface using jsonld.js☆13Mar 2, 2026Updated 3 months ago
- Wikipedia Live Monitor☆22Dec 21, 2024Updated last year
- fasttag part of speech tagger javascript implementation☆280Apr 27, 2020Updated 6 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Sep 22, 2017Updated 8 years ago
- Resolve data table conflicts☆17Jun 11, 2015Updated 11 years ago
- THIS REPO HAS BEEN MOVED TO https://github.com/sockethub/sockethub - a simple tool to facilitate handling and referencing activity stream…☆11Dec 30, 2019Updated 6 years ago
- Simple spatio-temporal windowing in Kafka Streams☆13Jul 14, 2016Updated 9 years ago