anvaka / sayit-dataLinks
data with similar subreddits graph
☆48Updated 2 years ago
Alternatives and similar repositories for sayit-data
Users that are interested in sayit-data are comparing it to the libraries listed below
Sorting:
- Matrix-based News Aggregation to Explore Media Bias☆20Updated 7 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆25Updated 5 years ago
- A Chrome extension that creates a personalized map of the web based on the user's browsing history.☆24Updated 12 years ago
- A chrome extension that lets you select any text and run it through ChatGPT☆102Updated 2 years ago
- reddit discovery☆127Updated last year
- Python tool to monitor RSS feeds and download the linked content.☆15Updated 8 years ago
- A tool to easily scrape youtube data using the Google API☆12Updated 5 months ago
- This repo includes a collection of Javascript Userscripts to scrape and download data from multiple websites. This enables scraping from …☆14Updated 4 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated 3 weeks ago
- Search the internet from your terminal. Speed read your results. Terminal nirvana.☆21Updated 4 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- Have too many tabs opened on Chrome? This extension helps you organize your tabs on windows per projects.☆116Updated 2 years ago
- webapp for unglue.it - A Free Ebook Foundation program☆18Updated last month
- A dead simple web-clipper | ✂Capture ⇒ ⊞ Select ⇒ ✔Done☆32Updated 7 years ago
- A collection of various reddit bots.☆22Updated 9 years ago
- Crawl Wikipedia pages and upload TTS to Youtube.☆10Updated 5 months ago
- Provide what you expected from Instagram.☆22Updated 4 years ago
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.☆16Updated last month
- A scrapy spider to extract post, thread, and user information from a vBulletin forum to a MongoDB database.☆32Updated 9 years ago
- Generate the awesome lists in JSON file.☆29Updated 9 years ago
- The extension for Amazon Mechanical Turk☆71Updated 2 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆50Updated this week
- Nondestructive warc-in-tar to warc conversion☆27Updated 12 years ago
- ☆19Updated 4 years ago
- Backend code for GitHub Recommendation Extension☆28Updated 3 years ago
- Reverse image search extension for Google Chrome.☆79Updated last year
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 11 months ago
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- A super simple and helpful way to add websites to monitor☆28Updated 5 months ago
- Real-time insights into the news you read☆28Updated 2 years ago