VectorInstitute / vector-inferenceLinks

Efficient LLM inference on Slurm clusters using vLLM.

☆82

Alternatives and similar repositories for vector-inference

Users that are interested in vector-inference are comparing it to the libraries listed below

Sorting:

justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
VectorInstitute / vectorlm
LLM finetuning in resource-constrained environments.
☆53Updated last year
epfml / llm-baselines
nanoGPT-like codebase for LLM training
☆110Updated 2 weeks ago
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆130Updated 3 years ago
google-deepmind / mishax
☆143Updated 2 months ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆95Updated last month
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆268Updated 6 months ago
RulinShao / retrieval-scaling
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
☆219Updated 3 weeks ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆133Updated 11 months ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆173Updated 4 months ago
llm-merging / LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
☆205Updated last year
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆85Updated last year
hadasah / btm
☆76Updated last year
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆100Updated last year
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆240Updated last month
probabilistic-inference-scaling / probabilistic-inference-scaling
☆52Updated 8 months ago
allenai / discoverybench
Discovering Data-driven Hypotheses in the Wild
☆118Updated 5 months ago
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆227Updated 11 months ago
ARBORproject / arborproject.github.io
☆83Updated 8 months ago
mechanistic-interpretability-grokking / progress-measures-paper
☆70Updated 3 years ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆277Updated last year
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆194Updated last year
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆141Updated 4 months ago
msakarvadia / AttentionLens
Interpretating the latent space representations of attention head outputs for LLMs
☆34Updated last year
ltgoslo / bert-in-context
Official implementation of "BERTs are Generative In-Context Learners"
☆32Updated 8 months ago
srush / do-we-need-attention
☆166Updated 2 years ago
KihoPark / LLM_Categorical_Hierarchical_Representations
☆111Updated 9 months ago
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated 2 years ago
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated last year
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆86Updated 3 years ago