mediacloud / feed_seekerLinks
Find rss, atom, xml, and rdf feeds on webpages
☆30Updated 3 weeks ago
Alternatives and similar repositories for feed_seeker
Users that are interested in feed_seeker are comparing it to the libraries listed below
Sorting:
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 6 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated last month
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Examples for getting started using https://case.law☆69Updated 3 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated this week
- Public client for consuming content from the Media Cloud Online News Archive & Directory.☆78Updated last month
- Deduplicate and parse list of `dirty names'☆23Updated 5 years ago
- Source files for "An Introduction to VisiData"☆76Updated 9 months ago
- scraper for facebook, gab, google and tiktok☆21Updated 5 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆14Updated 9 months ago
- ☆11Updated 6 years ago
- Classifying the content of domains☆57Updated 2 months ago
- Command-line utility to help researchers collect video metadata from Youtube API☆29Updated last year
- Data, analytic code, and findings supporting BuzzFeed News's analysis of fentanyl and cocaine overdose deaths.☆13Updated 3 years ago
- Package for performing Reddit-based text analysis☆21Updated 6 years ago
- A maximum-strength name parser for record linkage.☆39Updated 3 months ago
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 9 years ago
- Mecodify tool for twitter data analysis and visualisation☆43Updated 2 years ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆40Updated 4 years ago
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- Now included in rigour☆152Updated last week
- An open interface to GDELT APIs☆61Updated 2 years ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆66Updated last month
- Automatically exported from code.google.com/p/guess-language☆54Updated last month
- A browser user interface for manual labeling of record pairs.☆48Updated 2 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- Presentations on Quantified Self and Self-Tracking with Python☆33Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Interactive and searchable House staffer directory, based on House disbursement data.☆30Updated last year
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago