Notes on quantization in neural networks
☆121Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for quantization-notes
Users that are interested in quantization-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reference implementation of Mistral AI 7B v0.1 model.☆28Dec 25, 2023Updated 2 years ago
- Memory Compiler Tutorial☆13Aug 2, 2022Updated 3 years ago
- Distributed training (multi-node) of a Transformer model☆94Apr 10, 2024Updated last year
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆49Sep 27, 2024Updated last year
- Notes on Direct Preference Optimization☆24Apr 14, 2024Updated last year
- STSC-SNN: Spatio-Temporal Synaptic Connection with temporal convolution and attention for spiking neural networks☆24Dec 24, 2022Updated 3 years ago
- Notes and commented code for RLHF (PPO)☆127Feb 27, 2024Updated 2 years ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 4 months ago
- LORA: Low-Rank Adaptation of Large Language Models implemented using PyTorch☆126Jul 24, 2023Updated 2 years ago
- ☆34Mar 28, 2025Updated 11 months ago
- LLaMA 2 implemented from scratch in PyTorch☆367Sep 25, 2023Updated 2 years ago
- Training models with ternary quantized weights using PyTorch☆15Jun 12, 2019Updated 6 years ago
- Efficient single-pass hyperdimensional classifier. Mirror of https://gitlab.com/biaslab/onlinehd☆11Jan 31, 2021Updated 5 years ago
- A repository with teaching materials for electrochemical impedance spectroscopy.☆10May 28, 2018Updated 7 years ago
- Experiments in machine learning on graph databases☆14Feb 6, 2018Updated 8 years ago
- Code for the ISCAS23 paper "The Hardware Impact of Quantization and Pruning for Weights in Spiking Neural Networks"☆11Apr 20, 2023Updated 2 years ago
- PyTorch implementation of the estimator proposed in the paper "Estimating Differential Entropy under Gaussian Convolutions"☆13Oct 22, 2020Updated 5 years ago
- PaliGemma Inference and Fine Tuning☆13May 15, 2024Updated last year
- ☆20Nov 23, 2022Updated 3 years ago
- ☆41Mar 5, 2024Updated 2 years ago
- Deep Learning Visualization Tools Using PyTorch☆11Feb 2, 2021Updated 5 years ago
- Implementation of BERT-based Language Models☆26Mar 12, 2026Updated last week
- [ICLR 2025] DGQ: Distribution-Aware Group Quantization for Text-to-Image Diffusion Models☆19Mar 25, 2025Updated 11 months ago
- A study for a custom convolution layer in which the x and y components of an image pixel are added to the kernel inputs.☆12Feb 21, 2020Updated 6 years ago
- Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)☆29Jan 1, 2024Updated 2 years ago
- ☆26Jan 4, 2025Updated last year
- ☆17Mar 23, 2023Updated 3 years ago
- A blog for LLVM(v11.0.0) beginner, step by step, with detailed documents and comments. Record the way I learn LLVM.☆14Jun 17, 2022Updated 3 years ago
- How to create, train and quantize network, then integrate it into pre/post image processing and generate CUDA C++ code for targeting Jets…☆12May 7, 2025Updated 10 months ago
- Minimal unofficial implementation of Consistency Trajectory models on a 1D toy task.☆22Mar 11, 2024Updated 2 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- ☆10Jan 28, 2024Updated 2 years ago
- A Tensorflow implementation of VGG-16 trained on CIFAR-100☆11May 25, 2018Updated 7 years ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 8 months ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆10Jul 27, 2024Updated last year
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated 2 weeks ago
- pytorch版基于gpt+nezha的中文多轮Cdial☆12Oct 22, 2022Updated 3 years ago
- ☆17Dec 23, 2025Updated 3 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Nov 25, 2024Updated last year