Sydius / mbox-to-txt
A simple Python script that takes an mbox file and converts it into a text file.
☆38Updated 6 years ago
Related projects: ⓘ
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆42Updated 5 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated last year
- Tag news stories based on models trained on the NYT corpus.☆39Updated last year
- Scrapers for disaster data - writes to https://github.com/simonw/disaster-data☆49Updated 7 months ago
- Some tools to help analyze the twitter archive☆61Updated last month
- micro-library to produce a couple of basic, attractive, printable plots with matplotlib☆11Updated 6 years ago
- Presentations on Quantified Self and Self-Tracking with Python☆29Updated last year
- Import your genome into a SQLite database☆21Updated 5 years ago
- Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.☆24Updated 2 years ago
- Interactive computing in Markdown☆43Updated last year
- 🐾 PdpCLI is a pandas DataFrame processing CLI tool which enables you to build a pandas pipeline from a configuration file.☆15Updated 11 months ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- ☆22Updated this week
- A simple Slack message text formatting to HTML code converter.☆27Updated 5 years ago
- Scrape various open data directories to create an index of what's available out there☆29Updated this week
- A git scraper recording the CDC's Covid Data Tracker numbers on number of vaccinations per state.☆24Updated 10 months ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 4 years ago
- Python package for converting xml and epubs to text files☆34Updated 4 years ago
- ☆19Updated 5 years ago
- CLI tool for fetching data using HTTP conditional get☆14Updated 3 years ago
- Quantified Self: A Personal Data Aggregator and Dashboard for Self-Trackers and Quantified Self Enthusiasts☆17Updated last year
- Save an RSS or ATOM feed to a SQLite database☆46Updated last year
- Save data from Google Takeout to a SQLite database☆104Updated last year
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆23Updated 4 years ago
- Flenser is a simple, minimal, automated exploratory data analysis tool.☆78Updated 3 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated last week
- Run Datasette on AWS serverless.☆17Updated 4 years ago
- A Python package that simplifies the use of secrets in a Jupyter notebook☆21Updated 2 years ago
- A maximum-strength name parser for record linkage.☆29Updated last month