Notes on quantization in neural networks
☆125Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for quantization-notes
Users that are interested in quantization-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Memory Compiler Tutorial☆14Aug 2, 2022Updated 3 years ago
- Distributed training (multi-node) of a Transformer model☆96Apr 10, 2024Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆49Sep 27, 2024Updated last year
- Slides for "Retrieval Augmented Generation" video☆26Nov 27, 2023Updated 2 years ago
- STSC-SNN: Spatio-Temporal Synaptic Connection with temporal convolution and attention for spiking neural networks☆24Dec 24, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Notes and commented code for RLHF (PPO)☆129Feb 27, 2024Updated 2 years ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 5 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆127Jul 24, 2023Updated 2 years ago
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago
- ☆16Oct 9, 2024Updated last year
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆49May 24, 2024Updated last year
- End-to-End Gradient Inversion (Gradient Leakage in Federated Learning) 【https://ieeexplore.ieee.org/document/9878027】☆11Aug 19, 2022Updated 3 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Oct 29, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆182Jan 7, 2024Updated 2 years ago
- BERT explained from scratch☆17Oct 26, 2023Updated 2 years ago
- An open source project on estimating train delays in India.☆11Oct 29, 2018Updated 7 years ago
- Papers on Search, Recommendations, and Ads (搜广推)☆32Jul 20, 2025Updated 9 months ago
- ☆16Jul 10, 2024Updated last year
- Implementation of BERT-based Language Models☆26Mar 12, 2026Updated last month
- PyTorch Static Quantization Example☆41Apr 29, 2021Updated 5 years ago
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆356May 28, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps☆13Mar 26, 2025Updated last year
- ☆26Jan 4, 2025Updated last year
- Decoupled Kullback-Leibler Divergence Loss (DKL), NeurIPS 2024 / Generalized Kullback-Leibler Divergence Loss (GKL)☆50Jul 21, 2025Updated 9 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆20Feb 12, 2025Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- Experimental Cython wrapper around vtzero☆14Jan 24, 2025Updated last year
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- This repository contains Python code for the paper "Learn What You Want to Unlearn: Unlearning Inversion Attacks against Machine Unlearni…☆20Apr 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2025] VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?☆30May 10, 2025Updated 11 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆27Mar 6, 2026Updated 2 months ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆11Oct 22, 2022Updated 3 years ago
- PyLate efficient inference engine☆84Jan 7, 2026Updated 4 months ago
- Unveiling the Layers: Neural Networks from first principles☆10Oct 1, 2025Updated 7 months ago
- smoothed box embedding code☆16Apr 5, 2020Updated 6 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 9 months ago