iscc / iscc-coreLinks
ISCC - Codec & Algorithms
☆22Updated 2 months ago
Alternatives and similar repositories for iscc-core
Users that are interested in iscc-core are comparing it to the libraries listed below
Sorting:
- ISCC - Software Development Kit☆19Updated 4 months ago
- ISCC: Command Line Tool☆17Updated last year
- ISCC: International Standard Content Code☆50Updated last year
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated 2 years ago
- Download and attach provenance to public datasets☆37Updated 10 months ago
- Small python package to measure OCR quality and other related metrics.☆26Updated last year
- Command line tool for digging into WARC files☆50Updated last week
- MkDocs plugin to generate semantic reference Markdown pages from a knowledge graph☆40Updated last year
- A polite and user-friendly downloader for Common Crawl data☆67Updated 5 months ago
- A tool for collection archival slivers of the web and web archives☆17Updated 11 months ago
- Centralised repository for WARC usage specifications.☆124Updated 3 months ago
- Makes Wikibase data available in Semantic MediaWiki☆18Updated 10 months ago
- Extract networks of entities from journalistic reporting☆49Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Updated last week
- ☆16Updated 4 months ago
- Tracking instruction-tuned LLM openness. Paper: Liesenfeld, Andreas, Alianda Lopez, and Mark Dingemanse. 2023. “Opening up ChatGPT: Track…☆119Updated 11 months ago
- SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions☆65Updated 9 months ago
- Convert ALTO XML to plain text + minimal metadata☆17Updated last year
- Add website scraping abilities to Datasette☆66Updated 2 years ago
- Add DuckDB, Parquet, CSV and JSON lines support to Datasette☆58Updated last year
- A Memento Aggregator CLI and Server in Go☆76Updated 11 months ago
- A tool for detecting viruses and NSFW material in WARC files☆17Updated last month
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆16Updated 3 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated 3 years ago
- image-to-text model for PDF.js☆50Updated 10 months ago
- CSV on the web☆47Updated 4 months ago
- LLM plugin for clustering embeddings☆82Updated last year
- Node starter kit for semantic-search. Uses Mighty Inference Server with Qdrant vector search.☆15Updated 2 years ago
- Data cleaning and validation functions for names, languages, identifiers, etc.☆52Updated this week
- Tools to construct and process Common Crawl webgraphs☆105Updated last week