AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVFP4 weights and keeps the entire decode path in FP8
☆113Feb 15, 2026Updated 2 months ago
Alternatives and similar repositories for NVFP4-on-4090-vLLM
Users that are interested in NVFP4-on-4090-vLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Data…☆70Sep 14, 2022Updated 3 years ago
- Many market analysts believe that predicting market’s stocks fluctuations is nearly impossible to achieve due to the number of variables …☆17Apr 29, 2019Updated 6 years ago
- Examine and discover LoongArch instructions☆23Updated this week
- A pure Python DB replication engine for Django that supports SQLite and PostgreSQL.☆16Updated this week
- New Blockchain technology / Multi-Chain Interoperability Network that leverages virtualization and smart contracts to create a cross-chai…☆17Sep 14, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Python app created with the purpose of speeding up and greatly facilitating the task of cleaning and adjusting Booru-style tags, aimed at…☆12Dec 2, 2023Updated 2 years ago
- 100.000 links, 50.000 artworks dataset. Includes source code that used to scrape data.☆10May 29, 2021Updated 4 years ago
- Must-know Cryptography concepts for web developers☆20Sep 1, 2025Updated 7 months ago
- Automated multi-account farming tool for Kite AI decentralized payment network with faucet claims, token staking, DEX swaps, daily quiz c…☆254Mar 13, 2026Updated last month
- ☆14Apr 1, 2026Updated 2 weeks ago
- OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates th…☆12Sep 24, 2024Updated last year
- Verilog code of Loongson's GS132 core☆12Dec 19, 2019Updated 6 years ago
- internal/cpu in Go ( add AVX512)☆29Aug 6, 2024Updated last year
- An unscientific benchmark of SQLite vs the file system (btrfs)☆57Feb 12, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Electron ports for LoongArch☆17Feb 16, 2026Updated 2 months ago
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆58Feb 25, 2026Updated last month
- A simple MIPS CPU for BUAA CO course (and now NSCSCC).☆10May 15, 2021Updated 4 years ago
- ☆13Apr 10, 2026Updated last week
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆13Jul 28, 2025Updated 8 months ago
- RTL implementation of a ray-tracing GPU☆15Dec 18, 2012Updated 13 years ago
- ☆17Updated this week
- ☆24May 26, 2023Updated 2 years ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆11Aug 16, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 9 months ago
- BitTorrent DHT Protocol && DHT Spider,faster than shiyanhui/dht☆12Aug 30, 2023Updated 2 years ago
- ☆21Nov 8, 2025Updated 5 months ago
- Code snippets and reproductions from JustAByte☆41Apr 6, 2026Updated last week
- Lower chisel memories to SRAM macros☆13Mar 25, 2024Updated 2 years ago
- Awesome AI Benchmarks☆30Jan 16, 2026Updated 3 months ago
- Easy Images captioning under a good pyqt GUI☆21Jun 18, 2023Updated 2 years ago
- Hoddarla is an OS project in Golang targeting RISC-V 64-bit system.☆12Oct 28, 2021Updated 4 years ago
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆26Feb 19, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A mod that injects MGL and patches Minecraft to work with it.☆12Apr 10, 2024Updated 2 years ago
- Buildroot with customizations for building the OpenDingux root file system☆19Dec 11, 2012Updated 13 years ago
- xserver ddx driver for loongson's display controller and GPU☆11Mar 14, 2023Updated 3 years ago
- LWJGL is a Java library that enables cross-platform access to popular native APIs useful in the development of graphics (OpenGL, Vulkan),…☆10Jan 27, 2020Updated 6 years ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- SoC for muntjac☆13Jun 18, 2025Updated 10 months ago
- Bluespec H.264 Decoder☆12Jul 17, 2014Updated 11 years ago