Paul-E / Pushshift-Importer
☆14Updated 5 months ago
Alternatives and similar repositories for Pushshift-Importer:
Users that are interested in Pushshift-Importer are comparing it to the libraries listed below
- Ethical, legal, and effortless extraction of Reddit data in your database☆64Updated 4 months ago
- Example scripts for the pushshift dump files☆325Updated last week
- Read compressed NDJSON .zst files easily☆32Updated 2 years ago
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, fo…☆14Updated 3 years ago
- Twitter data around the Ukraine Invasion in February 2022☆15Updated last year
- A Selenium-driven tool for automated website interaction and scraping.☆18Updated 3 years ago
- An Obsidian plugin to submit file links to an ArchiveBox instance.☆36Updated 2 years ago
- A python tool that imports annotations made in Hypothesis (https://hypothes.is) to Zotero (https://www.zotero.org).☆59Updated 6 years ago
- Sync all your Diigo bookmarks to a directory as Markdown files. Intended for use with Obsidian☆22Updated 4 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated last year
- A multithread Pushshift.io API Wrapper for reddit.com comment and submission searches.☆215Updated last year
- scraper for facebook, gab, google and tiktok☆22Updated 7 months ago
- A set of jupyter notebooks demonstrating how to use the Media Cloud API.☆36Updated last year
- Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web in…☆331Updated last week
- day one jsons to markdown converter | I'm not supporting it, check forks for better versions☆29Updated 3 years ago
- Command-line utility to help researchers collect video metadata from Youtube API☆29Updated 5 months ago
- Utilties which support the proccessing of XML based USPTO trademark bulk download files☆29Updated 5 years ago
- Tools for Twitter☆35Updated 3 years ago
- A chat bot that pulls from your Readwise highlights, using ChatGPT API.☆24Updated 8 months ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆104Updated 6 years ago
- A list of over 5000 US news domains and their social media accounts☆43Updated 2 years ago
- Export/access your Hypothes.is data: annotations and profile info☆41Updated 3 weeks ago
- WordWanderer – take your text for a walk☆12Updated 5 years ago
- Visualise networks of companies, officers and addresses connected through UK Companies House☆60Updated 3 months ago
- Pushshift Telegram Ingest☆85Updated 5 years ago
- UNOFFICIAL Python API to interface with Parler.com☆53Updated 6 months ago
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆17Updated last year
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆62Updated last year
- This repo contains a collection of templates for use with the gpt-3 text generation model.☆19Updated last year