Backend of Common Search. Analyses webpages and sends them to the index.
☆122May 31, 2017Updated 8 years ago
Alternatives and similar repositories for cosr-back
Users that are interested in cosr-back are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for managing deployment & operations of Common Search.☆12Aug 26, 2016Updated 9 years ago
- Set of scripts to aid in the download of the GDELT data files from www.gdeltproject.org☆12May 17, 2014Updated 12 years ago
- ☆18Jun 24, 2017Updated 8 years ago
- Launch AWS Elastic MapReduce jobs that process Common Crawl data.☆49Feb 15, 2017Updated 9 years ago
- ☆22Aug 24, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Index URLs in Common Crawl☆197Sep 19, 2017Updated 8 years ago
- An alternative implementation of the `slice::select_nth_unstable` method with improved speed☆11Jan 1, 2024Updated 2 years ago
- Extract statistics from Wikipedia Dump files.☆26Aug 2, 2021Updated 4 years ago
- Node.js bindings to Tantivy Search☆13Dec 8, 2022Updated 3 years ago
- Common web archive utility code.☆63May 2, 2026Updated 2 weeks ago
- Zign OAuth plugin for HTTPie☆22Dec 14, 2015Updated 10 years ago
- Secret Handshake implementation in Python☆22Jan 6, 2026Updated 4 months ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆194Apr 29, 2022Updated 4 years ago
- Question Parsing module for the PPP using a grammatical approch☆32Nov 28, 2017Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Experiments in applying interpretability techniques to learned reward functions.☆10Dec 11, 2020Updated 5 years ago
- Library for Distributed Retrieval☆15Feb 21, 2014Updated 12 years ago
- This is a mirror, development happens on Framagit☆32Apr 20, 2021Updated 5 years ago
- Python wrapper for a C++ Double Metaphone☆15Jan 12, 2026Updated 4 months ago
- Graph Engine for Exploration and Search☆42Jan 26, 2024Updated 2 years ago
- ☆29May 13, 2026Updated last week
- Wikipedia Live Monitor☆22Dec 21, 2024Updated last year
- Nordlys: Toolkit for entity-oriented and semantic search☆31Mar 23, 2021Updated 5 years ago
- Des idées de nouveaux services publics, sur le style de https://beta.gouv.fr/ficheproduit/☆16May 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Legi Data☆16May 12, 2026Updated last week
- The development of PeARS has been moved to https://github.com/PeARSearch/PeARS-orchard☆54Jul 22, 2017Updated 8 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆53Jun 12, 2020Updated 5 years ago
- Track public endpoints and connections across AWS accounts using VPC Flow Logs☆12Jun 14, 2016Updated 9 years ago
- Events and Situations Ontology☆14Apr 20, 2018Updated 8 years ago
- STUPS-related agent to retrieve application credentials☆12Jul 17, 2018Updated 7 years ago
- A simple Rust crate to cache data both in-memory and on disk☆11Dec 26, 2021Updated 4 years ago
- Run an ssb-server (scuttlebot server) in a Docker container☆12Sep 16, 2019Updated 6 years ago
- Synthesized models for PHOG to make the results reproducible by the research community☆11Jan 23, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Sort-friendly URI Reordering Transform (SURT) python module☆45Sep 11, 2025Updated 8 months ago
- 💻 Goobox file share desktop app (Moved to https://github.com/storewise/file-share-desktop)☆13Sep 1, 2021Updated 4 years ago
- Module that converts from/to SSB keys and BIP39 mnemonic codes☆15Mar 3, 2022Updated 4 years ago
- vCat Java code☆11Updated this week
- Several scripts to analyse Wikidata dumps☆33Apr 7, 2014Updated 12 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Aug 14, 2015Updated 10 years ago
- Parquet IO for Tablesaw☆13May 12, 2026Updated last week