mediacloud / feed_seekerLinks
Find rss, atom, xml, and rdf feeds on webpages
☆30Updated 8 months ago
Alternatives and similar repositories for feed_seeker
Users that are interested in feed_seeker are comparing it to the libraries listed below
Sorting:
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- ☆11Updated 6 years ago
- Deduplicate and parse list of `dirty names'☆23Updated 4 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 6 months ago
- A maximum-strength name parser for record linkage.☆37Updated last week
- Presentations on Quantified Self and Self-Tracking with Python☆30Updated 2 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- A Python library for defining rule-based overrides on messy data☆16Updated 2 months ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 2 weeks ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- Scrape various open data directories to create an index of what's available out there☆37Updated 4 months ago
- Examples for getting started using https://case.law☆66Updated 2 years ago
- Service for creating Twitter datasets for research and archiving.☆26Updated 2 years ago
- NLRB data scraper by LexPredict☆12Updated 2 years ago
- Thin argparse wrapper for quick, clear and easy declaration of hierarchical console command interfaces☆14Updated last year
- A browser extension providing Open Access bibliographical services☆17Updated 2 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 3 months ago
- scraper for facebook, gab, google and tiktok☆21Updated this week
- A helper library full of URL-related heuristics.☆69Updated 2 weeks ago
- Interactive and searchable House staffer directory, based on House disbursement data.☆27Updated last year
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆21Updated last year
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- Named-Entity Recognition extension for OpenRefine☆28Updated 2 years ago
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆31Updated 3 weeks ago