iscc / iscc-coreLinks
ISCC - Codec & Algorithms
☆22Updated 2 months ago
Alternatives and similar repositories for iscc-core
Users that are interested in iscc-core are comparing it to the libraries listed below
Sorting:
- ISCC - Software Development Kit☆19Updated 4 months ago
- SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions☆65Updated 9 months ago
- Tools to construct and process Common Crawl webgraphs☆105Updated last week
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated 2 years ago
- ISCC: Command Line Tool☆17Updated last year
- The OpenLink Structured Data Sniffer (OSDS) is Web Extensions compliant Browser Extension for Chrome, Firefox and Opera browsers that det…☆51Updated this week
- Add website scraping abilities to Datasette☆66Updated 2 years ago
- Download and attach provenance to public datasets☆37Updated 10 months ago
- A tool for collection archival slivers of the web and web archives☆17Updated 11 months ago
- ISCC: International Standard Content Code☆50Updated last year
- Command line tool for digging into WARC files☆50Updated last week
- CSV on the web☆47Updated 4 months ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 3 years ago
- SOftware Metadata Extraction Framework: A tool for automatically extracting relevant software information from code repositories (using R…☆69Updated 3 weeks ago
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆40Updated last year
- LLM plugin for clustering embeddings☆82Updated last year
- Generate embeddings for images and text using CLIP with LLM☆76Updated last year
- CG Note on JSON-LD*☆29Updated 3 months ago
- LLM plugin for embeddings using sentence-transformers☆74Updated 9 months ago
- Small python package to measure OCR quality and other related metrics.☆26Updated last year
- Extract networks of entities from journalistic reporting☆49Updated 2 years ago
- image-to-text model for PDF.js☆50Updated 10 months ago
- Ergonomic line-by-line transcription of scanned text.☆54Updated last week
- A polite and user-friendly downloader for Common Crawl data☆67Updated 5 months ago
- Metadata management and dissemination system for Open Access books☆54Updated this week
- wabac.js - Web Archive Browsing Augmentation Client☆122Updated last week
- Centralised repository for WARC usage specifications.☆124Updated 3 months ago
- A static site generator for SPARQL backends.☆139Updated 3 weeks ago
- sponge your gmail with artificial intelligence☆22Updated last year
- A Python module to manipulate data on a Wikibase instance (like Wikidata) through the MediaWiki Wikibase API and the Wikibase SPARQL endp…☆87Updated 2 weeks ago