VectorInstitute / vector-inferenceLinks
Efficient LLM inference on Slurm clusters using vLLM.
☆65Updated this week
Alternatives and similar repositories for vector-inference
Users that are interested in vector-inference are comparing it to the libraries listed below
Sorting:
- LLM finetuning in resource-constrained environments.☆50Updated last year
- PyTorch library for Active Fine-Tuning☆87Updated 5 months ago
- Extract full next-token probabilities via language model APIs☆246Updated last year
- nanoGPT-like codebase for LLM training☆100Updated 2 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 2 months ago
- Understand and test language model architectures on synthetic tasks.☆219Updated last month
- This repository collects all relevant resources about interpretability in LLMs☆363Updated 8 months ago
- LLM-Merging: Building LLMs Efficiently through Merging☆201Updated 9 months ago
- ☆266Updated this week
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆156Updated 3 weeks ago
- A MAD laboratory to improve AI architecture designs 🧪☆123Updated 7 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆206Updated last month
- Code for Zero-Shot Tokenizer Transfer☆133Updated 6 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆127Updated 2 years ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆206Updated 7 months ago
- ☆79Updated 4 months ago
- AI Logging for Interpretability and Explainability🔬☆124Updated last year
- ☆166Updated 2 years ago
- A repository containing the code for translating popular LLM benchmarks to German.☆26Updated last year
- ☆123Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆147Updated 3 weeks ago
- ☆68Updated 2 years ago
- Sparsify transformers with SAEs and transcoders☆584Updated last week
- Steering vectors for transformer language models in Pytorch / Huggingface☆115Updated 4 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆193Updated this week
- ☆101Updated 5 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆265Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 11 months ago
- A repository for research on medium sized language models.☆504Updated last month