mediacloud / feed_seekerLinks
Find rss, atom, xml, and rdf feeds on webpages
☆30Updated 9 months ago
Alternatives and similar repositories for feed_seeker
Users that are interested in feed_seeker are comparing it to the libraries listed below
Sorting:
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Examples for getting started using https://case.law☆66Updated 2 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 8 months ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Deduplicate and parse list of `dirty names'☆23Updated 4 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 5 months ago
- A Python Client for collect and parse public data from the Youtube Data API☆81Updated 2 years ago
- scraper for facebook, gab, google and tiktok☆21Updated last month
- A maximum-strength name parser for record linkage.☆37Updated last month
- The CorpWatch API uses automated parsers to extract the subsidiary relationship information from Exhibit 21 of companies' 10-K filings wi…☆48Updated 6 months ago
- Some tools to help analyze the twitter archive☆62Updated last month
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- The documentation and scripts for the Local News Dataset☆25Updated 3 years ago
- Source files for "An Introduction to VisiData"☆74Updated 5 months ago
- Presentations on Quantified Self and Self-Tracking with Python☆30Updated 2 years ago
- A collection of projects I did while at General Assembly Singapore - as part of Data Science Immersive☆11Updated 4 years ago
- Basic cookiecutter template for Python projects☆21Updated 10 months ago
- Automatically exported from code.google.com/p/guess-language☆52Updated last year
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- A browser user interface for manual labeling of record pairs.☆47Updated 2 years ago
- Mecodify tool for twitter data analysis and visualisation☆42Updated 2 years ago
- Personal news feed: search for results on Reddit/Pinboard/Twitter/Hackernews and read as RSS☆33Updated 3 weeks ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- Parse government documents into well formed JSON☆70Updated 2 weeks ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆37Updated 4 years ago
- Classifying the content of domains☆56Updated 2 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated last month
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A Python library for defining rule-based overrides on messy data☆15Updated this week