dwdyer / zeitgeist
Intelligent RSS news aggregator.
☆33Updated 11 months ago
Related projects: ⓘ
- ☆36Updated 10 months ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 6 years ago
- Internet Archive Data Mining Tools☆44Updated 3 years ago
- FeedCrunch.IO - Take RSS Feeds to the next level with personnalized recommendations☆15Updated 2 years ago
- ClickScript is a visual programming language, a data flow programming language running entirely in a web browser.☆64Updated 12 years ago
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆16Updated 8 years ago
- A distributed system for mining common crawl using SQS, AWS-EC2 and S3☆14Updated 10 years ago
- The news homepage archive☆81Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated last year
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 8 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Updated 11 years ago
- Sauna - a social news reader and curation tool☆52Updated 9 years ago
- Save a bunch of web pages as a self-contained, compressed archive file for offline storage and sharing.☆34Updated 11 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 3 years ago
- export data from twitter archive and visualize it☆25Updated last year
- An online sentiment analyzer built with Flask and TextBlob☆15Updated 11 years ago
- Common web archive utility code.☆50Updated last week
- Embed Idyll directly in an HTML page☆13Updated last year
- A place for storing ideas.☆15Updated 8 years ago
- ☆15Updated 5 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 5 years ago
- ☆23Updated this week
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated 7 months ago
- Whit is an open source SMS service, which allows you to query CrunchBase, Wikipedia, and several other data APIs.☆200Updated 11 years ago
- Trough: Big data, small databases.☆38Updated last month
- A javascript tool to visualize the diff's in wikipedia☆34Updated last year
- Grabbing all news.☆62Updated 4 years ago
- The "hyp.is" service that takes a user to a URL with Hypothesis activated☆47Updated this week
- ☆21Updated 3 years ago
- Tools for exploring the contents of web archive files.☆39Updated 3 years ago