inferlink / landmark-extractor
☆11Updated 5 years ago
Alternatives and similar repositories for landmark-extractor:
Users that are interested in landmark-extractor are comparing it to the libraries listed below
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- R tools to download, ingest, and analyze the Phoenix dataset from the Open Event Data Alliance☆12Updated 8 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- framework for making streamcorpus data☆11Updated 8 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- scraper for facebook, gab, google and tiktok☆21Updated 10 months ago
- R tools for GDELT and the Global Knowledge Graph☆14Updated 11 years ago
- R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization☆19Updated 9 years ago
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 10 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Interactive and searchable House staffer directory, based on House disbursement data.☆27Updated last year
- The core of sunlightlabs' Data Commons project. Includes the Transparency Data site and the APIs that power TransparencyData.com and Infl…☆38Updated 8 years ago
- Extract networks of entities from journalistic reporting☆48Updated last year
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 6 months ago
- Jupyter notebook + Code for reproducing Reddit Subreddit graphs☆18Updated 8 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- A maximum-strength name parser for record linkage.☆37Updated this week
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 2 months ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 5 months ago
- A financial disclosure data extraction tool.☆16Updated last year
- searching large heterogenous data dumps with Universal Sentence Encoder☆62Updated 3 years ago
- Tools for analyzing the Hillary Clinton emails☆13Updated 9 years ago
- Scraping Assisted by Learning☆35Updated 3 weeks ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 6 months ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 4 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 4 years ago
- R Code + R Notebook on how to process and visualize the official IMDb datasets.☆12Updated 6 years ago
- Machine assisted dossiers☆19Updated 7 years ago