Di-Is / faiss-gpu-wheelsLinks
Unofficial faiss wheel builder for NVIDIA GPU
☆30Updated 2 weeks ago
Alternatives and similar repositories for faiss-gpu-wheels
Users that are interested in faiss-gpu-wheels are comparing it to the libraries listed below
Sorting:
- ☆58Updated 10 months ago
- ☆72Updated last year
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆60Updated 6 months ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆61Updated last year
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆37Updated last month
- Code for Zero-Shot Tokenizer Transfer☆142Updated 11 months ago
- Official implementation of "GPT or BERT: why not both?"☆63Updated 5 months ago
- Official Code for Paper: Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation☆132Updated this week
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆73Updated 8 months ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆86Updated last year
- E5-V: Universal Embeddings with Multimodal Large Language Models☆273Updated last month
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆51Updated 3 months ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38Updated last year
- This is the official repository for Inheritune.☆119Updated 11 months ago
- ☆24Updated 9 months ago
- ☆162Updated last year
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Updated 9 months ago
- Official code for paper "UniIR: Training and Benchmarking Universal Multimodal Information Retrievers" (ECCV 2024)☆175Updated last year
- ☆101Updated 7 months ago
- Code for "SemDeDup", a simple method for identifying and removing semantic duplicates from a dataset (data pairs which are semantically s…☆151Updated 2 years ago
- Code for "Merging Text Transformers from Different Initializations"☆20Updated 11 months ago
- https://footprints.baulab.info☆17Updated last year
- Easy modernBERT fine-tuning and multi-task learning☆63Updated 6 months ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Updated 5 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆225Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆213Updated 6 months ago
- [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs☆58Updated 4 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆81Updated 2 years ago
- ☆25Updated last year
- ☆85Updated 2 months ago