A file utility for accessing both local and remote files through a unified interface.
☆47Mar 20, 2026Updated 3 weeks ago
Alternatives and similar repositories for cached_path
Users that are interested in cached_path are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A file-backed dictionary for Python☆12Aug 15, 2022Updated 3 years ago
- utilities for batched llm calls with retries☆49Updated this week
- Drop-in replacements for Python's map function☆15Sep 5, 2023Updated 2 years ago
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35May 24, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Rust library for indexing and quickly searching large pretraining corpora☆31Oct 30, 2025Updated 5 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆30Nov 18, 2025Updated 4 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Feb 18, 2024Updated 2 years ago
- Utilities and boilerplate code to use wandb with allennlp☆21May 22, 2023Updated 2 years ago
- ☆25May 20, 2020Updated 5 years ago
- Multidocument Summarization for Literature Review Shared Task 2022☆30Oct 16, 2022Updated 3 years ago
- This repository contains code used for our Multi Sentence Inference NAACL'22 paper.☆12Mar 6, 2023Updated 3 years ago
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- ☆10Jun 29, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆24Jan 28, 2025Updated last year
- A string tokenizer library for Rust☆11May 16, 2018Updated 7 years ago
- A small library for presistent caching of values in VSCode Extensions☆13Dec 23, 2024Updated last year
- Application written in Go that creates and stores directory fingerprint/hash from all its files in a tree☆12Jun 5, 2020Updated 5 years ago
- ☆15Aug 5, 2023Updated 2 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- ☆33Nov 4, 2024Updated last year
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- ☆24Feb 3, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Map your python dataclasses to pyspark types☆10Feb 11, 2024Updated 2 years ago
- A cookiecutter for linkml projects. An equivalent of `linkml-ws new project-name`.☆27Oct 23, 2025Updated 5 months ago
- A library for training crosscoders☆16May 28, 2025Updated 10 months ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year
- Sparse Embedding Compression for Scalable Retrieval in Recommender Systems☆35Nov 21, 2025Updated 4 months ago
- Well-documented examples of deployment-ready FastAPI applications written from scratch.☆12Apr 24, 2021Updated 4 years ago
- A pure-Python Beaker client☆18Oct 14, 2025Updated 5 months ago
- AFD Dataset Cleaned☆15Apr 9, 2020Updated 6 years ago
- A set of procedures to estimate the readability of a text☆15Apr 30, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NAACL 2018] Robust Sequence Labeling with Adversarial Training☆10Sep 30, 2019Updated 6 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 10 months ago
- Toolkit for building prompt templates for language models☆12Sep 30, 2022Updated 3 years ago
- Code and Data for Evaluation WG☆42May 4, 2022Updated 3 years ago
- Processed E-MTAB-3610 dataset - Transcriptional Profiling of 1,000 human cancer cell lines☆11Dec 23, 2021Updated 4 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆10Oct 24, 2023Updated 2 years ago
- A tool for automated uploading and version management of scientific data to Zenodo☆45Mar 30, 2026Updated 2 weeks ago