cldellow / real-estate-prices-ccLinks
Source real estate prices from the Common Crawl.
☆27Updated 6 years ago
Alternatives and similar repositories for real-estate-prices-cc
Users that are interested in real-estate-prices-cc are comparing it to the libraries listed below
Sorting:
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆56Updated last year
- Scraping Assisted by Learning☆35Updated 2 weeks ago
- classify a job description (or noisy job title) into a ONET job title☆19Updated 8 years ago
- ☆11Updated 6 years ago
- R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization☆18Updated 9 years ago
- A library to extract a publication date from a web page, along with a measure of the accuracy.☆41Updated 5 years ago
- Keyword Extraction system using Brown Clustering - (This version is trained to extract keywords from job listings)☆18Updated 10 years ago
- NSS Capstone project to use natural language modeling, classification, and information extraction to get the exact employee count values …☆15Updated 6 years ago
- ☆13Updated 2 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- Language-agnostic political event coding using universal dependencies☆18Updated 5 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Text analysis for automatic bookmarking/keyword extraction☆18Updated 8 years ago
- Turning news into events since 2014.☆51Updated 8 years ago
- Scrapes Google Trends data over long timescales and stitches together for daily data☆72Updated 5 years ago
- Code and visualizations for related/similar subreddits☆19Updated 8 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated 3 months ago
- API - extract a list of keywords from a text.☆18Updated 7 years ago
- Identifying bias in the media with sentiment analysis: a case study.☆16Updated 8 years ago
- ☆34Updated last year
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 4 years ago
- Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles☆23Updated 6 years ago
- Scrapers from a project in 2018. Yelp, Spyfu, Similarweb, Morningstar, Linkedin, Instagram, Inside, Glassdoor, Facebook, Eat24, Doordash,…☆97Updated 6 years ago
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Set of scripts to aid in the download of the GDELT data files from www.gdeltproject.org☆12Updated 11 years ago
- A search engine for Open Data☆53Updated 2 years ago
- A toolkit for clustering web pages based on various similarity measures.☆33Updated 3 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆40Updated 5 years ago
- An automated, programming-free web scraper for interactive sites☆111Updated last year
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 8 years ago