API definition, resources and reference implementation of URL Frontiers
☆60Jan 23, 2026Updated 3 months ago
Alternatives and similar repositories for url-frontier
Users that are interested in url-frontier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A scalable, mature and versatile web crawler based on Apache Storm☆976Apr 21, 2026Updated last week
- A set of reusable Java components that implement functionality common to any web crawler☆256Updated this week
- JavaFlow reimagines the core ideas of FoundationDB's Flow actor framework in idiomatic Java, leveraging JDK continuations instead of any …☆24Feb 16, 2026Updated 2 months ago
- This project has been archived and is no longer being developed or supported. The Curator's Workbench is an extensible digital collectio…☆24Jun 25, 2020Updated 5 years ago
- Charter proposal for an “RDF Dataset Canonicalization and Hash Working Group”☆11Dec 16, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An evil web server.☆13May 9, 2015Updated 10 years ago
- Easily crawl news portals or blog sites using Storm Crawler.☆21Nov 15, 2022Updated 3 years ago
- Common web archive utility code.☆63Apr 1, 2026Updated last month
- Add editing UI and other power-user features to Datasette.☆14Mar 4, 2023Updated 3 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆47Sep 24, 2021Updated 4 years ago
- Download GitHub repositories☆12May 10, 2025Updated 11 months ago
- gRPC to EPP proxy☆18Apr 15, 2026Updated 2 weeks ago
- Original GOKb repo - Moving to https://github.com/openlibraryenvironment/gokb☆11Jan 23, 2018Updated 8 years ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- demos using the OpenRNDR framework☆13Mar 27, 2020Updated 6 years ago
- visualizations/charts for media collections, based on mediainfo☆14Sep 15, 2022Updated 3 years ago
- Storm / Solr Integration☆19Feb 2, 2024Updated 2 years ago
- HTML parser and tag balancer.☆19Updated this week
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- Mavuno: A Hadoop-Based Text Mining Toolkit☆47Feb 7, 2012Updated 14 years ago
- Parquet IO for Tablesaw☆12Mar 2, 2026Updated last month
- Seeder - Czech webarchive curating tool and public site☆17Feb 12, 2026Updated 2 months ago
- JNumberTools is an open-source Java library for solving complex problems in combinatorics and number theory. Whether you're a researcher,…☆15Mar 23, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repo for Live Coding at Droidcon Berlin☆10Sep 5, 2017Updated 8 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- My attempt to learn more than one Deep Learning framework☆15Apr 7, 2019Updated 7 years ago
- Contextually-aware notebooks with built-in AI assistant☆20Updated this week
- A bidirectional LSTM example for sequence labeling.☆13May 23, 2018Updated 7 years ago
- The BES framework, which forms the basis for the Hyrax server☆16Apr 25, 2026Updated last week
- Snowball Stemmer for Clojure☆18Jun 7, 2022Updated 3 years ago
- Go OCFL Implementation☆18Apr 23, 2026Updated last week
- Apache Nutch fork tunned for web services and data discovery.☆10May 18, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The CMR Metadata Review tool is used to curate NASA EOSDIS collection and granule level metadata in CMR for correctness, completeness and…☆25Sep 4, 2025Updated 7 months ago
- Repository for revision of PREMIS OWL ontology group☆13May 12, 2022Updated 3 years ago
- Create CovJSON files from common scientific data formats☆14Apr 24, 2018Updated 8 years ago
- Specification for a query language to request Verifiable Presentations from wallets etc.☆10Apr 23, 2026Updated last week
- Takes query parameters from a url to create the first cell of a jupyter notebook.☆17Nov 13, 2024Updated last year
- Code for the paper Faster Phrase-Based Decoding by Refining Feature State☆14Jan 9, 2023Updated 3 years ago
- Citadel: Enterprise Search☆15May 2, 2023Updated 3 years ago