AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVFP4 weights and keeps the entire decode path in FP8
☆129Feb 15, 2026Updated 4 months ago
Alternatives and similar repositories for NVFP4-on-4090-vLLM
Users that are interested in NVFP4-on-4090-vLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Data…☆70Sep 14, 2022Updated 3 years ago
- Many market analysts believe that predicting market’s stocks fluctuations is nearly impossible to achieve due to the number of variables …☆17Apr 29, 2019Updated 7 years ago
- Examine and discover LoongArch instructions☆25Jun 4, 2026Updated 2 weeks ago
- A pure Python DB replication engine for Django that supports SQLite and PostgreSQL.☆16Updated this week
- New Blockchain technology / Multi-Chain Interoperability Network that leverages virtualization and smart contracts to create a cross-chai…☆16Sep 14, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Service implementing some parts of OAuth 2.0 Token Exchange (https://www.rfc-editor.org/rfc/rfc8693.html)☆20Jun 8, 2026Updated last week
- Overlook is a MacOS-native remote console for GL.iNet GLKVM / Comet-style KVM devices.☆42Jun 7, 2026Updated last week
- Python app created with the purpose of speeding up and greatly facilitating the task of cleaning and adjusting Booru-style tags, aimed at…☆12Dec 2, 2023Updated 2 years ago
- Circuit-level PDP-11/34 emulator☆70Apr 8, 2026Updated 2 months ago
- 🚀 Piri is a high-performance Niri extension tool built with Rust. It leverages efficient Niri IPC interaction and a unified event distri…☆74Jun 5, 2026Updated 2 weeks ago
- 100.000 links, 50.000 artworks dataset. Includes source code that used to scrape data.☆10May 29, 2021Updated 5 years ago
- Must-know Cryptography concepts for web developers☆20Sep 1, 2025Updated 9 months ago
- ☆20May 15, 2020Updated 6 years ago
- Hyprworm is a custom window switcher for the Hyprland Wayland compositor. Built in C, it provides a fast and efficient way to switch betw…☆42Sep 15, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 3D Open World using threejs and ammojs☆16Mar 23, 2024Updated 2 years ago
- Make Weird Worlds: A real-time liminal CSG level editor and game engine. Optimised for mobile CPUs☆125Updated this week
- ☆72Feb 13, 2026Updated 4 months ago
- ☆14Apr 28, 2026Updated last month
- A multithreaded discrete event simulation library in C, using POSIX pthreads for parallelized trials and replications, stackful asymmetri…☆73Updated this week
- OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates th…☆12Sep 24, 2024Updated last year
- Rust-native GPU kernel authoring framework: write GPU compute kernels in Rust, compile to PTX. The Triton equivalent for the Rust ecosyst…☆33Jun 12, 2026Updated last week
- internal/cpu in Go ( add AVX512)☆28Aug 6, 2024Updated last year
- Verilog code of Loongson's GS132 core☆12Dec 19, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An unscientific benchmark of SQLite vs the file system (btrfs)☆58Feb 12, 2022Updated 4 years ago
- Electron ports for LoongArch☆18Feb 16, 2026Updated 4 months ago
- Dev workspace is an advanced context management system for Claude Code.☆39Jun 12, 2026Updated last week
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆60Feb 25, 2026Updated 3 months ago
- A simple MIPS CPU for BUAA CO course (and now NSCSCC).☆10May 15, 2021Updated 5 years ago
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆13Jul 28, 2025Updated 10 months ago
- ☆13Apr 10, 2026Updated 2 months ago
- RTL implementation of a ray-tracing GPU☆16Dec 18, 2012Updated 13 years ago
- ☆17Updated this week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆24May 26, 2023Updated 3 years ago
- ☆37Dec 30, 2025Updated 5 months ago
- Paper Tape is All You Need☆101Mar 30, 2026Updated 2 months ago
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 11 months ago
- A PDF unredactor.☆42Feb 7, 2026Updated 4 months ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆13Aug 16, 2023Updated 2 years ago
- BitTorrent DHT Protocol && DHT Spider,faster than shiyanhui/dht☆11Aug 30, 2023Updated 2 years ago