Easily crawl news portals or blog sites using Storm Crawler.
☆21Nov 15, 2022Updated 3 years ago
Alternatives and similar repositories for crawling-framework
Users that are interested in crawling-framework are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Integration between Reaction ECommerce and Accelerated Text to provide product descriptions for an e-shop.☆13Feb 22, 2021Updated 5 years ago
- Beagle helps you identify keywords, phrases, regexes, and complex search queries of interest in streams of text documents.☆55Jun 30, 2021Updated 4 years ago
- Leiningen template for AWS Lambda custom runtime with GraalVM native image compiled Clojure projects.☆45Oct 5, 2020Updated 5 years ago
- Accelerated Text is a no-code natural language generation platform. It will help you construct document plans which define how your data …☆808Mar 10, 2023Updated 3 years ago
- Clojure wrapper for the `jackson-jq `. Embed `jq` scripts into your app. Compatible with GraalVM native-image.☆21Sep 29, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Dataset of Lithuania legal entities☆13Nov 21, 2023Updated 2 years ago
- Convert a number to an approximated text expression: from '0.23' to 'less than a quarter'.☆201Jan 20, 2021Updated 5 years ago
- Python wrapper for Accelerated Text☆12Oct 5, 2021Updated 4 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆69Apr 14, 2026Updated 3 weeks ago
- A curated list of resources dedicated to Natural Language Generation (NLG)☆480Sep 3, 2023Updated 2 years ago
- Repeat statsd packets to riemann☆17Jan 1, 2015Updated 11 years ago
- Storm / Solr Integration☆19Feb 2, 2024Updated 2 years ago
- Opinionated command line argument handling, with excellent support for subcommands☆50Apr 10, 2026Updated 3 weeks ago
- A Java library that can do URL normalization, unshorten URL, and URL extraction.☆19Oct 19, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Clojure LLM - Dataset curation for fine tuning an LLM for Clojure.☆17Jun 12, 2023Updated 2 years ago
- A simple accounting applications for Django☆19Sep 13, 2013Updated 12 years ago
- A http proxy demo written with Rust/Tokio☆16Sep 17, 2020Updated 5 years ago
- Run Mattermost on Heroku☆35Sep 19, 2022Updated 3 years ago
- Zulia Search Engine☆36Apr 23, 2026Updated last week
- Snowball Stemmer for Clojure☆18Jun 7, 2022Updated 3 years ago
- Timestone enables you to create deterministic and easy-to-understand unit tests for time-dependent, concurrent Go code.☆16Apr 21, 2025Updated last year
- A package to simplify the thread declaration directly either by using decorator or pass it through function. It also allows you to stop t…☆14Aug 5, 2025Updated 9 months ago
- StrapiRuby is a Ruby wrapper gem around Strapi REST API. #hacktoberfest☆15Jul 20, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A robots.txt parser written in Clojure.☆16Dec 15, 2011Updated 14 years ago
- Human Friendly Way to look for Kubernetes Events☆30Mar 24, 2026Updated last month
- w3act is an annotation and curation tool for building web archive collections☆21Jan 30, 2024Updated 2 years ago
- Tooling to build LLM applications: prompt templating and composition, agents, LLM memory, and other instruments for builders of AI applic…☆371Jan 8, 2026Updated 3 months ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆34Apr 23, 2025Updated last year
- GraalVM GitHub action☆13Jun 25, 2022Updated 3 years ago
- A Leiningen 2.0 plugin for copying dependencies into a "lib/" folder in your project☆19Feb 24, 2013Updated 13 years ago
- Tools for Lithuanian language processing☆16Jun 15, 2016Updated 9 years ago
- Polyglot workflows without leaving the comfort of your technology stack.☆864Apr 3, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Helpers to extend Django Admin with data from external service with minimal hacks☆26Aug 25, 2025Updated 8 months ago
- AI + Formula 1 = 🔥🧑💻☆13Feb 25, 2025Updated last year
- Demo using Apache Lucene has a reverse geocoder, running as a CLI app via Graal, AWS Lambda or Google Cloud Run☆12Apr 20, 2021Updated 5 years ago
- The Clojure programming language☆15Apr 21, 2026Updated 2 weeks ago
- The Solr Package Directory and Sanctuary☆13Oct 14, 2025Updated 6 months ago
- Quarkus Lucene Extension☆16Apr 8, 2026Updated 3 weeks ago
- A tool to assign Sustainable Development Goals to a scientific abstract☆18Feb 25, 2021Updated 5 years ago