openeventdata/scraper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openeventdata/scraper)

openeventdata / scraper

Scrapes sites. Gets news. Eventually events.

☆86

Alternatives and similar repositories for scraper

Users that are interested in scraper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

openeventdata / phoenix_pipeline
View on GitHub
Turning news into events since 2014.
☆52May 1, 2017Updated 9 years ago
openeventdata / petrarch
View on GitHub
The Python-language successor to the TABARI event-data coding software.
☆45Jul 21, 2017Updated 9 years ago
openeventdata / Dictionaries
View on GitHub
PETRARCH actor, agent and verb dictionaries
☆22Aug 3, 2018Updated 7 years ago
00krishna-tools / gdelt_download
View on GitHub
Set of scripts to aid in the download of the GDELT data files from www.gdeltproject.org
☆12May 17, 2014Updated 12 years ago
openeventdata / petrarch2
View on GitHub
Another next-generation event coding platform.
☆77Mar 18, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
GovLab / LegisLetters
View on GitHub
Coding space for the LegisLetters project.
☆11Jun 10, 2015Updated 11 years ago
newsreader / eso-and-ceo
View on GitHub
Events and Situations Ontology
☆14Apr 20, 2018Updated 8 years ago
E3-JSI / newsfeed
View on GitHub
A pipeline for crawling of RSS feeds and the associated content. Demo at newsfeed.ijs.si.
☆20Nov 12, 2012Updated 13 years ago
johnb30 / gdelt_download
View on GitHub
Set of scripts to aid in the download of the GDELT data files from gdelt.utdallas.edu
☆18May 14, 2014Updated 12 years ago
kevinschaul / llm-fragments-us-legislation
View on GitHub
Load bills from Congress.gov as LLM fragments
☆16Jun 2, 2025Updated last year
StorjOld / bitcointalkbot
View on GitHub
For monitoring keywords on BitcoinTalk, and posting them to Slack.
☆19Jun 2, 2015Updated 11 years ago
nytlabs / pageinfo
View on GitHub
Python module for extracting information from web pages
☆41Jun 12, 2014Updated 12 years ago
shasafoster / bitcointalk-ANN
View on GitHub
Scrapes 1000+ thread into a single html document for better reading and analysis
☆10Oct 7, 2018Updated 7 years ago
sim31 / polleos
View on GitHub
Poll system smart contract on EOS
☆13Jan 8, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
anthonydb / data-wrangling-python-nicar-2017
View on GitHub
Materials for the NICAR 2017 Data Wrangling with Python hands-on class
☆14Mar 4, 2017Updated 9 years ago
wirescrossed / panorama-rules-to-excel
View on GitHub
Create an Excel Spreadsheet from your firewall rules in Palo Alto Networks Panorama
☆13Aug 11, 2016Updated 9 years ago
nchambers / schemas
View on GitHub
Analyzes news stories for event schemas and templates.
☆17Mar 31, 2016Updated 10 years ago
bitslabsyr / stack
View on GitHub
The BITS Lab STACK tool for social media collection and analysis.
☆39Dec 26, 2022Updated 3 years ago
anidata / palantiri
View on GitHub
Web crawler to collect data on ht
☆18Nov 27, 2017Updated 8 years ago
KBNLresearch / KB-python-API
View on GitHub
Python API for KB data-services
☆20Jan 30, 2020Updated 6 years ago
semanticize / st
View on GitHub
Semanticizest: dump parser and client
☆20May 11, 2016Updated 10 years ago
brussell123 / 3dwikipedia
View on GitHub
☆21Feb 7, 2016Updated 10 years ago
paulhoule / telepath
View on GitHub
System for mining Wikipedia Usage data to read our collective mind
☆20Sep 28, 2014Updated 11 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rossf7 / elasticrawl
View on GitHub
Launch AWS Elastic MapReduce jobs that process Common Crawl data.
☆49Feb 15, 2017Updated 9 years ago
johnb30 / atlas
View on GitHub
Scrapes the web. Gets the news.
☆13Sep 6, 2016Updated 9 years ago
mitll / topic-clustering
View on GitHub
☆44Jan 15, 2016Updated 10 years ago
rodricios / crawl-to-the-future
View on GitHub
An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors
☆35Mar 19, 2015Updated 11 years ago
bhavishya235 / Web-Classification
View on GitHub
This project deals with hierarchical classification of web pages based on dmoz dataset.
☆14Apr 10, 2014Updated 12 years ago
adelevie / downlaw
View on GitHub
Write markdown with legal citations on the left, get rendered markdown on the right. Oh, and the legal citations become links.
☆20May 4, 2014Updated 12 years ago
socialsensor / multimedia-geotagging
View on GitHub
Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …
☆15Oct 15, 2016Updated 9 years ago
socialsensor / storm-focused-crawler
View on GitHub
Collects multimedia content shared through social networks.
☆19Feb 18, 2015Updated 11 years ago
talnsoftware / deepsyntacticparsing
View on GitHub
Automatically exported from code.google.com/p/deepsyntacticparsing
☆23Mar 19, 2015Updated 11 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
microsoft / Computational-Use-of-Data-Agreement
View on GitHub
Computational Use of Data Agreement - Removing Barriers to Data Innovation
☆21Jun 12, 2023Updated 3 years ago
Batres35 / binance_coins
View on GitHub
A binance correlated coins finder
☆10Jun 13, 2021Updated 5 years ago
zygmuntz / stardose
View on GitHub
A recommender system for GitHub repositories
☆14Jun 21, 2014Updated 12 years ago
michaeljyeates / eosshop
View on GitHub
Prototype ecommerce contract for EOS
☆22Dec 27, 2017Updated 8 years ago
mille856 / CMU_memex
View on GitHub
☆20Nov 1, 2017Updated 8 years ago
juliemkauffman / DyCoNet
View on GitHub
A Gephi plugin for community detection in dynamic networks
☆12Jan 14, 2014Updated 12 years ago
sujitpal / reuters-docsim
View on GitHub
Different approaches to computing document similarity
☆28Jan 14, 2017Updated 9 years ago