nyunAI / PruneGPT
☆53Updated 11 months ago
Alternatives and similar repositories for PruneGPT:
Users that are interested in PruneGPT are comparing it to the libraries listed below
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆83Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMs☆65Updated last week
- ☆48Updated 5 months ago
- ☆66Updated 11 months ago
- ☆115Updated 3 weeks ago
- entropix style sampling + GUI☆26Updated 6 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆121Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆11Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- ☆33Updated 10 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆150Updated last year
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Data preparation code for Amber 7B LLM☆89Updated 11 months ago
- ☆75Updated last year
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆100Updated this week
- This is the official repository for Inheritune.☆111Updated 2 months ago
- ☆73Updated last year
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- ☆153Updated 9 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆57Updated 11 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 7 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆77Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- A repository for research on medium sized language models.☆76Updated 11 months ago