A data processing pipeline that schedules and runs content harvesters, normalizes their data, and outputs that normalized data to a variety of output streams. This is part of the SHARE project, and will be used to create a free and open dataset of research (meta)data. Data collected can be explored at https://osf.io/share/, and viewed at https:/…
☆42Jun 22, 2016Updated 9 years ago
Alternatives and similar repositories for scrapi
Users that are interested in scrapi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Apr 26, 2016Updated 10 years ago
- WaterButler is a Python web application for interacting with various file storage services via a single RESTful API, developed at Center …☆63Updated this week
- This project is no longer supported. A pre-configured collection of tools including Social Feed Manager and Lentil for easily building Tw…☆16Feb 9, 2018Updated 8 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 11 years ago
- Basic linked data fragments endpoint.☆15Apr 20, 2017Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- mltk - Moz Language Tool Kit☆12Mar 6, 2015Updated 11 years ago
- MOVED to https://gitlab.com/crossref/rest_api☆17Apr 25, 2022Updated 4 years ago
- MIRO – Minimal Information for Reporting of an Ontology☆13Feb 6, 2019Updated 7 years ago
- Manage LXD/LXC Containers on a remote Linux Container Host☆13Feb 3, 2018Updated 8 years ago
- Combine /kämˌbīn/ - Metadata Aggregator Platform☆27Apr 23, 2026Updated last week
- Trends in scientific publishing delays☆11Aug 20, 2021Updated 4 years ago
- (Python) Execute tesseract OCR on a multi-page PDF.☆19Jun 30, 2023Updated 2 years ago
- [DEPRECATED] A Rails engine for orcid.org integration.☆14Jul 24, 2017Updated 8 years ago
- A basic editor for samvera objects.☆10Feb 4, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Jul 16, 2013Updated 12 years ago
- ☆21Jan 23, 2016Updated 10 years ago
- Simple FM radio made on Arduino + RRD-102 V2 (RDA5807M)☆18Jan 8, 2015Updated 11 years ago
- A contextual news development environment.☆49Dec 19, 2014Updated 11 years ago
- HTML5 audio/video clipper☆13Mar 7, 2018Updated 8 years ago
- ☆16Jun 24, 2025Updated 10 months ago
- R package for data preprocessing☆13Dec 18, 2019Updated 6 years ago
- Little JSON object want to be graphs, too!☆17Oct 2, 2015Updated 10 years ago
- Linked Data tools for SMEs☆16Oct 3, 2016Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Jul 2, 2017Updated 8 years ago
- This repository contains the project files for the HSU Library seating application☆11Feb 17, 2022Updated 4 years ago
- Contains the implementation of algorithms that estimate the geographic location of media content based on their content and metadata. It …☆15Oct 15, 2016Updated 9 years ago
- ☆10Apr 22, 2024Updated 2 years ago
- ☆11Nov 4, 2015Updated 10 years ago
- Collects multimedia content shared through social networks.☆19Feb 18, 2015Updated 11 years ago
- Place Pulse code repository☆16Mar 6, 2013Updated 13 years ago
- convert weibo(sina/tencent/netease) data source into an intermediate format supported by citespace☆10Sep 27, 2011Updated 14 years ago
- ☆14Feb 28, 2017Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Docker image for the Archives Unleashed Toolkit☆12Nov 17, 2022Updated 3 years ago
- Python ETL and Data Warehouse☆34Oct 5, 2015Updated 10 years ago
- A crawler, indexer, and query interface all in Python with distributed processing via Pyro4.☆23Mar 16, 2012Updated 14 years ago
- A Python wrapper around the NetSuite OpenAir XML API.☆25Mar 14, 2019Updated 7 years ago
- An OpenSource cheap bathroom scale that connects to Android using Bluetooth and saves/log the weight value, making visualization graph.☆29Mar 3, 2012Updated 14 years ago
- comparative lispology☆42Aug 8, 2013Updated 12 years ago
- An IG focused on improving Islandora as an IR platform☆13Jan 19, 2023Updated 3 years ago