js-enabled-crawling / specificationLinks
A community-formulated contract for crawling JavaScript-heavy websites
☆11Updated 10 years ago
Alternatives and similar repositories for specification
Users that are interested in specification are comparing it to the libraries listed below
Sorting:
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Updated 9 years ago
- A monitor and alerting system for your data☆11Updated 8 years ago
- The goal of this experiment is to take articles and certain metadata and group them by topic.☆11Updated 9 years ago
- Parse.ly's open source implementation of time engaged tracking☆21Updated 9 years ago
- A PHP class that examines websites to learn about the software used.☆22Updated 5 years ago
- Focused Crawler for VT's CTRNet☆10Updated 12 years ago
- A sample app that combines geolocated entities from Freebase with Maps API☆42Updated 11 years ago
- Blog crawler for the blogforever project.☆23Updated 11 years ago
- ☆33Updated 7 years ago
- Bicycle Incident reporting☆13Updated 3 years ago
- Newsclipse: The IDE for news production.☆91Updated 10 years ago
- A trip planner and mapper for hiking/biking/riding trail systems.☆30Updated 9 years ago
- Feed discovery to share :)☆41Updated 8 years ago
- Generates visualizations of influential tweets about a given hashtag.☆11Updated 8 years ago
- ☆21Updated 9 years ago
- ☆12Updated 10 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 13 years ago
- Simple Framework for LDP architectures☆11Updated 6 years ago
- Redefining your relationship to the web☆13Updated 9 years ago
- NU Infolab news context project☆11Updated 7 years ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- This is the open source code of the City72 platform. Fork this code, then deploy your own City72 site.☆29Updated 9 years ago
- JavaScript library for getting geojson from the Wikipedia API☆22Updated 10 years ago
- Digitization information system build on top of Fedora repository☆16Updated 6 years ago
- An idiomatic DSL for SPARQL queries☆42Updated 14 years ago
- ☆14Updated 8 years ago
- XPath extension for extraction from interactive web sites. NOTE: This code is currently out of sync. A more recent, but precompiled versi…☆27Updated 12 years ago
- Client side extractive text summarization using JS, based on TextRank. Since there's no server trip involved, one can can safely use it f…☆16Updated 11 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Human-Powered Data Analysis with Mechanical Turk☆300Updated 12 years ago