Implementation of the BitLinear layer from: The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
☆14Sep 11, 2024Updated last year
Alternatives and similar repositories for bitlinear-pytorch
Users that are interested in bitlinear-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BitLinear implementation☆36May 4, 2026Updated last month
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- Easy local FLUX.1 Inference☆10Aug 29, 2024Updated last year
- a functional parody of Stack Overflow, using AI☆10Apr 4, 2026Updated 2 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 3 years ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆70Sep 19, 2021Updated 4 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated 2 years ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated last month
- PegasusX: The Future of Multimodal Embeddings 🦄 🦄☆14Oct 16, 2024Updated last year
- Evaluation of Sentence Representations in Polish☆23Dec 29, 2022Updated 3 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- Talking AI Avatar in Realtime☆24Mar 30, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆13Oct 29, 2021Updated 4 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- ☆23Oct 22, 2025Updated 7 months ago
- ☆13Aug 19, 2024Updated last year
- Simple AI chat bubble for your website: Wordpress, React, HTML, Shopify. Answer questions about a website's content using RAG, streaming,…☆22Mar 24, 2025Updated last year
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- This repo contains the official code release of the Neural Experts paper, published in NeurIPS 2024.☆14Dec 3, 2024Updated last year
- Mixture of Lora Experts☆11Apr 7, 2024Updated 2 years ago
- [NeurIPS 2024] Low rank memory efficient optimizer without SVD☆33Jul 1, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official implementation of Latent-SFT: teaching LLMs to reason with vocabulary-space latent chains.☆51May 18, 2026Updated 3 weeks ago
- MLX implementation of Hierarchical Reasoning Model (HRM) - Adaptive computation for complex reasoning tasks☆28Aug 27, 2025Updated 9 months ago
- ☆55Sep 26, 2025Updated 8 months ago
- ☆13Sep 7, 2024Updated last year
- Temporal Predictive Coding For Model-Based Planning In Latent Space (ICML-2021)☆13Jul 22, 2024Updated last year
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated last year
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- Domain-Agnostic Supervised Learning with Hyperdimensional Computing☆13Jun 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This repository contains the official authors implementation associated with the paper "Neural Surface Priors for Editable Gaussian Splat…☆13Dec 6, 2024Updated last year
- ☆20Mar 11, 2025Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- CUDA implementation of Wavelet KAN.☆17Jun 8, 2024Updated 2 years ago
- ☆12Jul 30, 2016Updated 9 years ago
- This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"☆16Mar 25, 2025Updated last year
- [EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…☆13Dec 15, 2024Updated last year