Scifabric / pybossaLinks
PYBOSSA is the ultimate crowdsourcing framework (aka microtasking) to analyze or enrich data that can't be processed by machines alone.
☆759Updated last year
Alternatives and similar repositories for pybossa
Users that are interested in pybossa are comparing it to the libraries listed below
Sorting:
- Lightweight web scraping toolkit for documents and structured data.☆314Updated last year
- Media Cloud is an open source, open data platform that allows researchers to answer quantitative questions about the content of online me…☆285Updated last year
- A simple Python library/tool for pulling location information from unstructured text☆186Updated 14 years ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆189Updated 4 years ago
- The OpenRefine Python Client Library provides an interface to communicating with an OpenRefine server.☆179Updated 6 years ago
- A cross-platform command line tool for parallelised content extraction and analysis.☆248Updated last month
- Demonstration of using Python to process the Common Crawl dataset with the mrjob framework☆167Updated 3 years ago
- Backend of Common Search. Analyses webpages and sends them to the index.☆122Updated 8 years ago
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago
- Train NLTK objects with zero code☆743Updated 5 years ago
- Open Data Catalog is an open data catalog based on Django, Python and PostgreSQL. It was originally developed for OpenDataPhilly.org, a …☆249Updated 9 years ago
- Open source large document set visualization platform☆270Updated 2 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 3 years ago
- A toolkit for making domain-specific probabilistic parsers☆805Updated 11 months ago
- Python interface to the Stanford Named Entity Recognizer☆292Updated 3 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- NER toolkit for HTML data☆259Updated last year
- Docker-based CKAN environments, with bells and whistles☆71Updated 7 years ago
- The data journalism platform with built in training☆309Updated 8 months ago
- framework for scraping legislative/government data☆86Updated 11 months ago
- python library for extracting html microdata☆167Updated 2 years ago
- The Daemo crowdsourcing platform☆148Updated 3 years ago
- Approve or reject statements from third-party datasets☆146Updated 7 years ago
- a python library for parsing unstructured western names into name components.☆609Updated 3 months ago
- Parse, normalize and render postal addresses.☆185Updated last year
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- The Beneficial Ownership Data Standard (BODS) is an open standard providing a specification for modelling and publishing information on t…☆68Updated last month
- A lightweight server to allow HTTP requests to the Stanford Named Entity Recognized and a heavily modified CLAVIN geoparser.☆119Updated 3 years ago
- Mechanical Turk on your own machine.☆207Updated 9 months ago