Implementation of the BitLinear layer from: The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
☆14Sep 11, 2024Updated last year
Alternatives and similar repositories for bitlinear-pytorch
Users that are interested in bitlinear-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BitLinear implementation☆36May 4, 2026Updated 2 weeks ago
- A Verilog implementation of a hand-written digit recognition Neural Network☆11Nov 16, 2024Updated last year
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!☆11Oct 1, 2024Updated last year
- ☆16Sep 30, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Jul 13, 2024Updated last year
- C++ code for HLS FPGA implementation of transformer☆23Sep 11, 2024Updated last year
- Apply methods described in "Git Re-basin"-paper [1] to arbitrary models --- [1] Ainsworth et al. (https://arxiv.org/abs/2209.04836)☆15Updated this week
- Unofficial implementation of the paper: "NeRF-In: Free-Form NeRF Inpainting with RGB-D Priors"☆11Apr 30, 2023Updated 3 years ago
- Homebrew formulas for CGMiner and BFGMiner☆36Jan 14, 2018Updated 8 years ago
- The accompanying code for "Memory-efficient Transformers via Top-k Attention" (Ankit Gupta, Guy Dar, Shaya Goodman, David Ciprut, Jonatha…☆70Sep 19, 2021Updated 4 years ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated 2 years ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Attempt at cog wrapper for segmind/SSD-1B☆10Dec 11, 2023Updated 2 years ago
- A PyTorch native platform for training generative AI models☆17Apr 21, 2026Updated last month
- MuJS Binding for V/Vlang☆14Dec 23, 2022Updated 3 years ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- Talking AI Avatar in Realtime☆24Mar 30, 2024Updated 2 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- ☆23Oct 22, 2025Updated 7 months ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆18Oct 4, 2025Updated 7 months ago
- Differentiable non-uniform interpolation: https://arxiv.org/abs/2012.13257☆11Oct 3, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repo for BWLer: Barycentric Weight Layer☆30Mar 20, 2026Updated 2 months ago
- ☆17Dec 19, 2024Updated last year
- ☆13Aug 19, 2024Updated last year
- Code accompanying the paper "A contrastive rule for meta-learning"