ytgui / PilotANN
Memory-Bounded GPU Acceleration for Vector Search
☆23Updated last month
Alternatives and similar repositories for PilotANN
Users that are interested in PilotANN are comparing it to the libraries listed below
Sorting:
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆64Updated last year
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆23Updated 3 weeks ago
- Compression for Foundation Models☆31Updated last month
- ☆27Updated 2 weeks ago
- MPI Code Generation through Domain-Specific Language Models☆13Updated 5 months ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆61Updated 7 months ago
- ☆13Updated this week
- Latent Large Language Models☆18Updated 8 months ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆26Updated 5 months ago
- state-of-the-art search over vector embeddings and structured data (SIGMOD '24)☆78Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- PipeRAG: Fast Retrieval-Augmented Generation via Algorithm-System Co-design (KDD 2025)☆20Updated 11 months ago
- Lottery Ticket Adaptation☆39Updated 5 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 5 months ago
- ☆21Updated 2 months ago
- BH hackathon☆14Updated last year
- ☆16Updated 2 months ago
- ☆59Updated this week
- ☆15Updated 4 months ago
- Implementation of the paper "Lossless Compression of Vector IDs for Approximate Nearest Neighbor Search" by Severo et al.☆78Updated 3 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated last month
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆144Updated this week
- ☆16Updated 3 weeks ago
- Experiments to assess SPADE on different LLM pipelines.☆16Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- ☆29Updated 4 months ago
- ☆41Updated 5 months ago
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- ☆31Updated 3 weeks ago
- LLM reads a paper and produce a working prototype☆56Updated last month