mediacloud / feed_seekerLinks
Find rss, atom, xml, and rdf feeds on webpages
☆30Updated 9 months ago
Alternatives and similar repositories for feed_seeker
Users that are interested in feed_seeker are comparing it to the libraries listed below
Sorting:
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 7 months ago
- Examples for getting started using https://case.law☆66Updated 2 years ago
- ☆11Updated 6 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated last week
- A maximum-strength name parser for record linkage.☆37Updated 3 weeks ago
- Deduplicate and parse list of `dirty names'☆23Updated 4 years ago
- A repository demonstrating the use of real-estate-scrape to store the estimated value of a property on Redfin and Zillow every night usin…☆34Updated this week
- Presentations on Quantified Self and Self-Tracking with Python☆30Updated 2 years ago
- NLRB data scraper by LexPredict☆12Updated 2 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 4 months ago
- A Docker image for the CLIFF geolocation software.☆15Updated 4 years ago
- This project is wraper for Leilex, legal entity identifier API. Includes ISIN-LEI conversion. Search LEI number using company name.☆24Updated 9 months ago
- Scrape various open data directories to create an index of what's available out there☆37Updated 4 months ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆56Updated last year
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- A collection of projects I did while at General Assembly Singapore - as part of Data Science Immersive☆10Updated 4 years ago
- A helper library full of URL-related heuristics.☆69Updated last month
- A place for me to share VisiData plugins I've written.☆37Updated 3 years ago
- The shared repository for Media Cloud web apps (Explorer, Source Manager, Topic Mapper)☆65Updated last year
- Sidewall is a Python library for interacting with the Dimensions search API.☆17Updated 10 months ago
- Some tools to help analyze the twitter archive☆62Updated last month
- Visual analytics application for qualitative text analysis☆24Updated 2 years ago
- Python based Wikidata framework for easy dataframe extraction☆45Updated last year
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- Simple tools for summarizing .mbox email archives.☆11Updated 5 years ago
- The Web Scraping Sandbox☆15Updated 6 months ago
- Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence☆64Updated last year