castorini / hf-spaceriniView external linksLinks
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Aug 5, 2023Updated 2 years ago
Alternatives and similar repositories for hf-spacerini
Users that are interested in hf-spacerini are comparing it to the libraries listed below
Sorting:
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- Toolkit for domain-specific information retrieval experimentation☆19Updated this week
- User-friendly viewer for Parquet files☆10Jan 10, 2026Updated last month
- QLoRA for Masked Language Modeling☆22Sep 11, 2023Updated 2 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 2 years ago
- Python Module implementing SRP☆12Jul 29, 2022Updated 3 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆15Mar 9, 2022Updated 3 years ago
- Topic Model based on Pretrained Sentence Embeddings (with BERT)☆13Feb 8, 2023Updated 3 years ago
- ☆12Dec 6, 2024Updated last year
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆18May 15, 2025Updated 9 months ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 4 months ago
- ☆12Apr 25, 2022Updated 3 years ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- “Generate to Understand for Representation”☆14Apr 18, 2024Updated last year
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- Arabic edition of ALBERT pretrained language models☆16Apr 25, 2021Updated 4 years ago
- https://footprints.baulab.info☆17Oct 4, 2024Updated last year
- DImensionality REduction in JAX☆26Nov 21, 2025Updated 2 months ago
- Example of configuring multiplage apps via a custom config file☆18Nov 14, 2023Updated 2 years ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Jul 27, 2021Updated 4 years ago
- Tokun to can tokens☆18Jun 19, 2025Updated 7 months ago
- Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based L…☆18Sep 24, 2023Updated 2 years ago
- Self-Supervised Document-to-Document Similarity Ranking via Contextualized Language Models and Hierarchical Inference☆45Nov 28, 2022Updated 3 years ago
- Auxiliary tasks for task-oriented dialogue systems. Published in ICNLSP'22 and indexed in the ACL Anthology.☆17Feb 27, 2023Updated 2 years ago
- ☆16Jun 12, 2023Updated 2 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- A extension of Transformers library to include T5ForSequenceClassification class.☆40Apr 17, 2023Updated 2 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆20Jun 12, 2023Updated 2 years ago
- start exploring.☆16Apr 6, 2024Updated last year
- ☆23Oct 30, 2023Updated 2 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆17Oct 17, 2023Updated 2 years ago
- Libraries, Archives and Museums (LAM)☆88Oct 4, 2022Updated 3 years ago
- OTTR for making courses! This is a template repo that helps people write 1 course but publish it in three places. Rendered example: https…☆18Apr 1, 2025Updated 10 months ago
- Supporting links for my DjangoCon 2022 talk☆25Nov 27, 2022Updated 3 years ago
- ☆25May 7, 2025Updated 9 months ago
- A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.☆199Jul 31, 2024Updated last year
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆62Jul 6, 2025Updated 7 months ago