Python library with common functionality for writing web scrapers
☆102Jul 6, 2015Updated 10 years ago
Alternatives and similar repositories for scrapekit
Users that are interested in scrapekit are comparing it to the libraries listed below
Sorting:
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 10 years ago
- A pastebin for tables.☆34Sep 30, 2013Updated 12 years ago
- ⛏ a library for scraping unreliable pages☆212Feb 20, 2026Updated 2 weeks ago
- Extends Django Rest Framework to add a Session Authentication Viewpoint.☆21Apr 2, 2015Updated 10 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- A collection of various discourse segmenters☆10Jun 30, 2017Updated 8 years ago
- Document management system. Based on bill tracking needs. Simple model for stages, priorities, authors, content (abstract, tags), releate…☆19Sep 16, 2014Updated 11 years ago
- JavaScript library for getting geojson from the Wikipedia API☆22Sep 25, 2015Updated 10 years ago
- RenRen Python Library☆28Aug 30, 2015Updated 10 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆26Aug 27, 2012Updated 13 years ago
- Send data about Celery events to statsd☆29May 20, 2023Updated 2 years ago
- Memcache administration tools for Django.☆24Apr 13, 2016Updated 9 years ago
- Exemplos de programação concorrente em Python☆12Jan 27, 2015Updated 11 years ago
- Browser-based annotation tool for Framenet☆16Jan 27, 2015Updated 11 years ago
- Página do curso R☆11Feb 6, 2021Updated 5 years ago
- Various Python scripts to scrape sites that store data about you.☆28Jan 6, 2014Updated 12 years ago
- ☆14Aug 24, 2021Updated 4 years ago
- A text analysis interface for the humanities☆27Jun 23, 2011Updated 14 years ago
- A set of utilities to track and mine Twitter streaming API data☆46Sep 30, 2013Updated 12 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Aug 3, 2011Updated 14 years ago
- Code for the book http://forging-python.com☆16Jan 17, 2019Updated 7 years ago
- Insert matching punctuation for mismatched quotation marks, parentheses, etc. Good postprocessing for N-gram text synthesis.☆15Mar 29, 2016Updated 9 years ago
- Python buildpack fork for GeoDjango☆65Feb 24, 2014Updated 12 years ago
- An interactive map of global infrastructure.☆14Feb 29, 2024Updated 2 years ago
- SHERG rule extraction and parsing tools☆24Oct 9, 2015Updated 10 years ago
- PinPress is a creative Flat Blog/Magazine blogger template and it is the perfect choice for professionals who’s looking for a magazine te…☆10Nov 28, 2021Updated 4 years ago
- PyPI package for traversing extremely large FTP directory trees☆19Dec 18, 2017Updated 8 years ago
- Turn your Django project into RESTFul APIs in a minute.☆17Dec 8, 2015Updated 10 years ago
- open source, distributed, restful crawler engine☆14Feb 3, 2015Updated 11 years ago
- ☆36Aug 13, 2017Updated 8 years ago
- Django + Pagination made easy.☆36Nov 7, 2016Updated 9 years ago
- Extract, parse and populate templates from strings☆27Apr 4, 2019Updated 6 years ago
- ☆21Sep 9, 2012Updated 13 years ago
- For interacting with nutch via Python☆29Feb 18, 2026Updated 2 weeks ago
- Python HTTP clients for APIs represented by JSON Schema.☆97Jul 13, 2014Updated 11 years ago
- A cookiecutter template for creating Django 1.7+ / Python 3 projects quickly, thought optimized for Heroku in the meantime.☆21Dec 2, 2017Updated 8 years ago
- Use Python with the Twitter API and Alchemy API to create personas quickly.☆19Dec 1, 2015Updated 10 years ago
- A dashboard with various internet-y widgets☆18Sep 19, 2017Updated 8 years ago
- Helper methods for generating text that conforms to "The New York Times Manual of Style and Usage"☆27May 13, 2014Updated 11 years ago