Notes on quantization in neural networks
☆124Dec 14, 2023Updated 2 years ago
Alternatives and similar repositories for quantization-notes
Users that are interested in quantization-notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed training (multi-node) of a Transformer model☆96Apr 10, 2024Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆49Sep 27, 2024Updated last year
- Notes on Direct Preference Optimization☆25Apr 14, 2024Updated 2 years ago
- Slides for "Retrieval Augmented Generation" video☆26Nov 27, 2023Updated 2 years ago
- ☆34Mar 28, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- LLaMA 2 implemented from scratch in PyTorch☆369Sep 25, 2023Updated 2 years ago
- Triton Inference Server + TensorRT + метрики☆20Jun 11, 2025Updated 10 months ago
- This is a repository to practice multi-thread programming in C++☆28Feb 21, 2024Updated 2 years ago
- ☆16Jan 14, 2025Updated last year
- PyTorch Static Quantization Example☆41Apr 29, 2021Updated 4 years ago
- A comprehensive Model Context Protocol (MCP) server providing advanced access to the UniProt protein database.☆19Dec 21, 2025Updated 3 months ago
- Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)☆349May 28, 2023Updated 2 years ago
- Persian LicensePlate Recognition System using YOLO11 and OpenCV☆51Jan 1, 2025Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10Jan 28, 2024Updated 2 years ago
- A Tensorflow implementation of VGG-16 trained on CIFAR-100☆11May 25, 2018Updated 7 years ago
- everything about llm based agent☆24Dec 19, 2025Updated 4 months ago
- Attention is all you need implementation☆1,199Jun 8, 2024Updated last year
- Experimental Cython wrapper around vtzero☆14Jan 24, 2025Updated last year
- Simple and efficient memory pool is implemented with C++11.☆10Jun 2, 2022Updated 3 years ago
- [CVPR 2025] VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?☆30May 10, 2025Updated 11 months ago
- ☆14Jul 29, 2021Updated 4 years ago
- Notes on the Mistral AI model☆20Dec 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Dec 23, 2025Updated 3 months ago
- PyLate efficient inference engine☆81Jan 7, 2026Updated 3 months ago
- [ICML2025] LoRA fine-tune directly on the INT4 models.☆40Nov 25, 2024Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 8 months ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 7 months ago
- Transformer Network for Time-Series, Sensor and Wearable Data☆27Feb 8, 2021Updated 5 years ago
- 从jieba分词到BERT-wwm,一步步带你进入中文NLP的世界☆15Sep 1, 2022Updated 3 years ago
- Official implementation of our paper "Bidirectional Consistency Models"; and reproduced Improved Consistency Models (iCT).☆27May 10, 2025Updated 11 months ago
- Rebuttal code for SEGS-SLAM ICCV 2025☆17Jun 30, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Vehicle speed estimation using YOLOv8☆32Apr 10, 2024Updated 2 years ago
- ☆22Dec 16, 2025Updated 4 months ago
- GEMV implementation with CUTLASS☆21Aug 21, 2025Updated 7 months ago
- Global Satellite Assessment Tool (GlobalSAT)☆19Feb 1, 2026Updated 2 months ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)☆13May 1, 2025Updated 11 months ago
- ☆11May 2, 2023Updated 2 years ago