VectorInstitute / flex_model
☆13Updated last year
Alternatives and similar repositories for flex_model:
Users that are interested in flex_model are comparing it to the libraries listed below
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆25Updated last year
- LLM finetuning in resource-constrained environments.☆47Updated 10 months ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- PyTorch library for Active Fine-Tuning☆64Updated 2 months ago
- ☆27Updated 9 months ago
- ☆38Updated last year
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated 7 months ago
- ☆72Updated 11 months ago
- ☆45Updated last year
- Efficient LLM inference on Slurm clusters using vLLM.☆57Updated this week
- ☆37Updated last year
- ☆16Updated 6 months ago
- ☆47Updated last year
- ☆34Updated last year
- ☆42Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆70Updated 3 weeks ago
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 3 years ago
- ☆51Updated 11 months ago
- Long Context Extension and Generalization in LLMs☆53Updated 7 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆45Updated last month
- ☆96Updated 9 months ago
- Simple and scalable tools for data-driven pretraining data selection.☆22Updated 2 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆17Updated 4 months ago
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆65Updated 3 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 11 months ago
- The original Backpack Language Model implementation, a fork of FlashAttention☆67Updated last year
- [arXiv] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆17Updated last month
- NeurIPS 2024 tutorial on LLM Inference☆42Updated 4 months ago
- ☆44Updated 2 weeks ago