AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVFP4 weights and keeps the entire decode path in FP8
☆120Feb 15, 2026Updated 3 months ago
Alternatives and similar repositories for NVFP4-on-4090-vLLM
Users that are interested in NVFP4-on-4090-vLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Data…☆70Sep 14, 2022Updated 3 years ago
- Many market analysts believe that predicting market’s stocks fluctuations is nearly impossible to achieve due to the number of variables …☆17Apr 29, 2019Updated 7 years ago
- Examine and discover LoongArch instructions☆23May 12, 2026Updated 2 weeks ago
- A pure Python DB replication engine for Django that supports SQLite and PostgreSQL.☆16May 19, 2026Updated last week
- New Blockchain technology / Multi-Chain Interoperability Network that leverages virtualization and smart contracts to create a cross-chai…☆16Sep 14, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Python app created with the purpose of speeding up and greatly facilitating the task of cleaning and adjusting Booru-style tags, aimed at…☆12Dec 2, 2023Updated 2 years ago
- Circuit-level PDP-11/34 emulator☆67Apr 8, 2026Updated last month
- 100.000 links, 50.000 artworks dataset. Includes source code that used to scrape data.☆10May 29, 2021Updated 5 years ago
- Must-know Cryptography concepts for web developers☆20Sep 1, 2025Updated 8 months ago
- 3D Open World using threejs and ammojs☆17Mar 23, 2024Updated 2 years ago
- A real-time liminal Quake/Hammer-style CSG level editor and game engine. Optimised for mobile CPUs☆123May 23, 2026Updated last week
- ☆72Feb 13, 2026Updated 3 months ago
- ☆14Apr 28, 2026Updated last month
- A multithreaded discrete event simulation library in C, using POSIX pthreads for parallelized replications and stackful asymmetric corout…☆70Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates th…☆12Sep 24, 2024Updated last year
- Rust-native GPU kernel authoring framework: write GPU compute kernels in Rust, compile to PTX. The Triton equivalent for the Rust ecosyst…☆31May 18, 2026Updated last week
- Verilog code of Loongson's GS132 core☆12Dec 19, 2019Updated 6 years ago
- internal/cpu in Go ( add AVX512)☆28Aug 6, 2024Updated last year
- An unscientific benchmark of SQLite vs the file system (btrfs)☆58Feb 12, 2022Updated 4 years ago
- Electron ports for LoongArch☆17Feb 16, 2026Updated 3 months ago
- Dev workspace is an advanced context management system for Claude Code.☆39Apr 4, 2026Updated last month
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆60Feb 25, 2026Updated 3 months ago
- A simple MIPS CPU for BUAA CO course (and now NSCSCC).☆10May 15, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Apr 10, 2026Updated last month
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆13Jul 28, 2025Updated 10 months ago
- RTL implementation of a ray-tracing GPU☆16Dec 18, 2012Updated 13 years ago
- ☆17May 22, 2026Updated last week
- ☆24May 26, 2023Updated 3 years ago
- ☆38Dec 30, 2025Updated 5 months ago
- Paper Tape is All You Need☆99Mar 30, 2026Updated 2 months ago
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 11 months ago
- A PDF unredactor.☆42Feb 7, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆12Aug 16, 2023Updated 2 years ago
- BitTorrent DHT Protocol && DHT Spider,faster than shiyanhui/dht☆11Aug 30, 2023Updated 2 years ago
- ☆24Nov 8, 2025Updated 6 months ago
- Lower chisel memories to SRAM macros☆13Mar 25, 2024Updated 2 years ago
- Awesome AI Benchmarks☆32Jan 16, 2026Updated 4 months ago
- Code snippets and reproductions from JustAByte☆46Apr 6, 2026Updated last month
- Easy Images captioning under a good pyqt GUI☆21Jun 18, 2023Updated 2 years ago