VectorInstitute / vector-inference
Efficient LLM inference on Slurm clusters using vLLM.
☆57Updated this week
Alternatives and similar repositories for vector-inference:
Users that are interested in vector-inference are comparing it to the libraries listed below
- LLM finetuning in resource-constrained environments.☆47Updated 10 months ago
- nanoGPT-like codebase for LLM training☆94Updated 3 weeks ago
- Code for Zero-Shot Tokenizer Transfer☆127Updated 3 months ago
- PyTorch library for Active Fine-Tuning☆64Updated 2 months ago
- ☆13Updated last year
- ☆72Updated 11 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆111Updated 4 months ago
- Understand and test language model architectures on synthetic tasks.☆192Updated last month
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆75Updated this week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 8 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆95Updated 2 months ago
- Extract full next-token probabilities via language model APIs☆241Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆57Updated 10 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆196Updated 2 weeks ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated last week
- Language models scale reliably with over-training and on downstream tasks☆96Updated last year
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆71Updated 5 months ago
- A library for efficient patching and automatic circuit discovery.☆63Updated this week
- ☆85Updated 2 weeks ago
- ☆121Updated last year
- ☆38Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆105Updated 5 months ago
- [arXiv] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆17Updated last month
- Discovering Data-driven Hypotheses in the Wild☆76Updated 5 months ago
- ☆91Updated 2 months ago
- Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings☆27Updated 2 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆197Updated 4 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆41Updated 4 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆45Updated last month