VectorInstitute / vector-inference
Efficient LLM inference on Slurm clusters using vLLM.
☆62Updated this week
Alternatives and similar repositories for vector-inference
Users that are interested in vector-inference are comparing it to the libraries listed below
Sorting:
- LLM finetuning in resource-constrained environments.☆47Updated 10 months ago
- PyTorch library for Active Fine-Tuning☆72Updated 2 months ago
- nanoGPT-like codebase for LLM training☆94Updated last month
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆72Updated 8 months ago
- Extract full next-token probabilities via language model APIs☆244Updated last year
- ☆73Updated 2 months ago
- Code for Zero-Shot Tokenizer Transfer☆127Updated 4 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆72Updated 6 months ago
- ☆129Updated last month
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆84Updated 5 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆42Updated 5 months ago
- Modalities, a PyTorch-native framework for distributed and reproducible foundation model training.☆77Updated last week
- A library for efficient patching and automatic circuit discovery.☆64Updated 3 weeks ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆55Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆116Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆256Updated 10 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆115Updated 4 months ago
- Understand and test language model architectures on synthetic tasks.☆195Updated 2 months ago
- [arXiv] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆18Updated 2 months ago
- Discovering Data-driven Hypotheses in the Wild☆80Updated 5 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆79Updated last month
- ☆94Updated 3 months ago
- Simple and scalable tools for data-driven pretraining data selection.☆23Updated 3 months ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆148Updated 9 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆120Updated last week
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆189Updated 11 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆100Updated 2 months ago
- ☆72Updated last year
- ☆161Updated 5 months ago
- Composable inference algorithms with LLMs and programmable logic☆67Updated 5 months ago