satabios / sconce
Model Compression/Inference Made Easy
☆39Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for sconce
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆32Updated 3 weeks ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆86Updated last year
- ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization☆87Updated last month
- ☆39Updated 10 months ago
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆111Updated 7 months ago
- Spyx: Spiking Neural Networks in JAX☆102Updated last month
- A new model for quickly training and simulating adaptive leaky integrate-and-fire spiking neural networks.☆14Updated 7 months ago
- Collection of autoregressive model implementation☆67Updated this week
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 11 months ago
- ☆43Updated 4 months ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆36Updated 3 weeks ago
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆45Updated 5 months ago
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆46Updated 4 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆52Updated last month
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆12Updated 11 months ago
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆23Updated 10 months ago
- LLM training in simple, raw C/CUDA☆12Updated last month
- Training Models Daily☆17Updated 11 months ago
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- Tools for merging pretrained large language models.☆19Updated 5 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆50Updated 7 months ago
- ☆25Updated 7 months ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆92Updated last month
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 2 weeks ago
- BH hackathon☆14Updated 7 months ago
- ☆35Updated 3 weeks ago
- This repository contains code for the MicroAdam paper.☆12Updated 4 months ago
- Lottery Ticket Adaptation☆36Updated last month
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Updated last week