fhamborg / NewsBirdServer
Matrix-based News Aggregation to Explore Media Bias
☆20Updated 6 years ago
Alternatives and similar repositories for NewsBirdServer
Users that are interested in NewsBirdServer are comparing it to the libraries listed below
Sorting:
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 5 years ago
- A Google Trends Analytics Package☆13Updated 11 months ago
- Meta-repository for the open-source version of the SUMMA Platform☆15Updated last year
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better unders…☆45Updated 3 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 6 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 7 months ago
- Phantombuster's SDK☆14Updated 6 months ago
- A repository of datasets for learning and mastering Gephi☆10Updated 5 months ago
- Source real estate prices from the Common Crawl.☆27Updated 6 years ago
- FeedCrunch.IO - Take RSS Feeds to the next level with personnalized recommendations☆15Updated 2 years ago
- Real-time insights into the news you read☆27Updated 2 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- A text similarity computation using minhashing and Jaccard distance on reuters dataset☆17Updated 6 years ago
- ☆14Updated 3 years ago
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆12Updated 4 years ago
- This repository contains code for fine-tuning GPT-2 on 76k quotes, and then make a Twitter bot out of it. Demo: @PeeingThoughts☆12Updated last year
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- LLM Oracle is a GPT-4 powered tool for predicting future events. It's like a Magic 8 Ball that is able to perform basic research, calcula…☆17Updated last year
- Open Access PDF harvester☆40Updated last year
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆24Updated 7 years ago
- ☆15Updated 2 years ago
- wrapper for the crossref events api☆21Updated last year
- CommonCrawl keyword scanner. Time for month of CC data on EC2 c5.18xlarge instance for hundreds of keywords takes about 3 hours. LLM (BER…☆15Updated 2 years ago
- A browser extension providing Open Access bibliographical services☆17Updated 2 years ago
- Amber Heard Social Network Analysis of Disinformation/Influence Operations, Bots, & Crime Across-Platforms. - Twitter, Reddit, YouTube, I…☆57Updated 2 years ago
- This Python package can be used to systematically extract multiple data elements (e.g., title, keywords, text) from news sources around t…☆33Updated 2 years ago
- An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.☆9Updated 5 months ago