allenai / cached_pathLinks
A file utility for accessing both local and remote files through a unified interface.
☆46Updated last month
Alternatives and similar repositories for cached_path
Users that are interested in cached_path are comparing it to the libraries listed below
Sorting:
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- decontamination☆23Updated last month
- Code for SaGe subword tokenizer (EACL 2023)☆27Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated 2 years ago
- **ARCHIVED** Filesystem interface to 🤗 Hub☆58Updated 2 years ago
- ☆13Updated last year
- 🚀🤗 A collection of templates for Hugging Face Spaces☆35Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 3 months ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆67Updated 2 years ago
- Crispy reranking models by Mixedbread☆45Updated 4 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 6 months ago
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆17Updated 2 years ago
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆42Updated 5 years ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated last week
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆47Updated last year
- Datamodels for hugging face tokenizers☆86Updated 2 weeks ago
- Evaluation framework for document processing models and services.☆62Updated last week
- ☆44Updated 2 years ago
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Updated last week
- Library for fast text representation and classification.☆31Updated 2 years ago
- Model implementation for the contextual embeddings project☆40Updated 7 months ago
- ☆90Updated 6 months ago
- Efficiently computing & storing token n-grams from large corpora☆26Updated last year
- ☆59Updated 2 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated last year
- ☆53Updated 11 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Updated last year
- Embedding Recycling for Language models☆38Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated 2 years ago