[ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!
☆24Jan 26, 2026Updated last month
Alternatives and similar repositories for OBR
Users that are interested in OBR are comparing it to the libraries listed below
Sorting:
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆33Feb 24, 2026Updated last week
- Minute-long video generation at 24FPS.☆50Feb 2, 2026Updated last month
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆81Feb 10, 2026Updated 3 weeks ago
- Official implementation for LaCo (EMNLP 2024 Findings)☆21Oct 3, 2024Updated last year
- [ICCV2025]Generate one 2K image on single 24GB 3090 GPU!☆84Sep 8, 2025Updated 5 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Nov 25, 2024Updated last year
- The official GitHub page for the survey paper "A Survey of RWKV".☆32Jan 7, 2025Updated last year
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆97Feb 21, 2025Updated last year
- ☆10Sep 4, 2025Updated 6 months ago
- [CVPR 2021] FMO Deblurring Benchmark☆13Jan 12, 2022Updated 4 years ago
- Communication Relay by creating a WiFi Mesh Network using ROS, and using that network for Data Telemetry, with Telemetry radios ( Ubiquit…☆11Dec 18, 2018Updated 7 years ago
- ☆54May 19, 2025Updated 9 months ago
- ☆11Aug 20, 2025Updated 6 months ago
- Official implementation of the paper "LTrack: Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Rep…☆12Jul 26, 2023Updated 2 years ago
- Python资源大全中文版,内容包括:Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理 、文本处理、自然语言处理、机器学习、日志、代码分析等☆11May 24, 2016Updated 9 years ago
- Project focused on enhancing the quality of low-fidelity endoscopy images using Generative Adversarial Networks (GANs) implemented in PyT…☆17Jun 5, 2025Updated 9 months ago
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆46Jun 11, 2025Updated 8 months ago
- [NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.☆180Oct 3, 2024Updated last year
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆42Mar 13, 2023Updated 2 years ago
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- A simple, generic, and flexible keyframe animation library for Rust.☆30Dec 30, 2025Updated 2 months ago
- decontamination☆26Dec 3, 2025Updated 3 months ago
- ☆11May 27, 2022Updated 3 years ago
- ☆10May 15, 2021Updated 4 years ago
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Jan 2, 2024Updated 2 years ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- This dataset consists of images along with annotations that specify whether two faces in the photo are looking at each other. This datase…☆12Aug 27, 2021Updated 4 years ago
- ☆13May 21, 2023Updated 2 years ago
- Computational Neuroscience stuff☆13Aug 12, 2019Updated 6 years ago
- ☆10Aug 29, 2024Updated last year
- BanglaWriting: A multi-purpose offline Bangla handwriting dataset☆12Nov 18, 2020Updated 5 years ago
- [ICCV' 23] MRM: Masked Relation Modeling for Medical Image Pre-Training with Genetics☆10Oct 28, 2024Updated last year
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- Unofficial implementation of SORT, A simple online and real-time tracking algorithm for 2D multiple objects tracking in video sequences, …☆12Jul 1, 2021Updated 4 years ago
- ☆11Nov 30, 2023Updated 2 years ago
- ☆14Mar 5, 2024Updated 2 years ago
- Automatically constructed lexical database for Bangla inspired from Wordnet☆11Jul 12, 2012Updated 13 years ago
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated 2 months ago