sbarman / webscriptLinks
A record and replay system for the browser (renamed Ringer)
☆30Updated 8 years ago
Alternatives and similar repositories for webscript
Users that are interested in webscript are comparing it to the libraries listed below
Sorting:
- Advanced similarity and duplicate source code at scale.☆56Updated 6 years ago
- Search for similar short strings☆53Updated 5 years ago
- A Chrome extension for writing custom web scraping programs and web automation programs. Just demonstrate how to collect the first row o…☆260Updated last year
- Implementation of Microsoft Vips algorithm in Python☆19Updated 6 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 3 years ago
- Tools to construct and process Common Crawl webgraphs☆103Updated 2 weeks ago
- A Python library for learning from dimensionality reduction, supporting sparse and dense matrices.☆78Updated 8 years ago
- Deployment of pywb as a CommonCrawl Index Server☆21Updated 8 years ago
- Interactive SQL analytics in your browser!☆22Updated 7 years ago
- Run information flow experiments on the Web☆39Updated 4 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆59Updated 8 months ago
- Mad (╯°□°)╯'ing☆10Updated 3 years ago
- OOPSLA 2019 Artifact for AutoPandas. Website at https://rbavishi.github.io/autopandas☆31Updated 3 years ago
- DBpedia Distributed Extraction Framework: Extract structured data from Wikipedia in a parallel, distributed manner☆41Updated 3 years ago
- Fixes Java syntax errors with LSTM neural networks! [proof-of-concept]☆18Updated 4 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 3 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆47Updated 8 years ago
- A Python implementation of a political forecasting model by Scholz, Calbert & Smith.☆11Updated 9 years ago
- Supervised learning for novelty detection in text☆78Updated 9 years ago
- Ranked Programming Extension for Racket☆54Updated 5 years ago
- An efficient approximation for tree edit-distance.☆45Updated 14 years ago
- Dataset for programming language identification.☆24Updated 2 years ago
- sourced.ml is a library and command line tools to build and apply machine learning models on top of Universal Abstract Syntax Trees☆142Updated 6 years ago
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 6 years ago
- A extensible conversational agent for data science tasks☆123Updated 8 years ago
- MetroMaps Release☆16Updated 11 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆195Updated last week
- A machine learning software for extracting information from scholarly documents☆23Updated 4 years ago
- Natural Language Generation for Gramex applications.☆25Updated 3 years ago