AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVFP4 weights and keeps the entire decode path in FP8
☆101Feb 15, 2026Updated last month
Alternatives and similar repositories for NVFP4-on-4090-vLLM
Users that are interested in NVFP4-on-4090-vLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Data…☆70Sep 14, 2022Updated 3 years ago
- Many market analysts believe that predicting market’s stocks fluctuations is nearly impossible to achieve due to the number of variables …☆17Apr 29, 2019Updated 6 years ago
- New Blockchain technology / Multi-Chain Interoperability Network that leverages virtualization and smart contracts to create a cross-chai…☆17Sep 14, 2022Updated 3 years ago
- Python app created with the purpose of speeding up and greatly facilitating the task of cleaning and adjusting Booru-style tags, aimed at…☆12Dec 2, 2023Updated 2 years ago
- 100.000 links, 50.000 artworks dataset. Includes source code that used to scrape data.☆10May 29, 2021Updated 4 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Automated multi-account farming tool for Kite AI decentralized payment network with faucet claims, token staking, DEX swaps, daily quiz c…☆253Mar 13, 2026Updated 2 weeks ago
- Code snippets and reproductions from JustAByte☆28Jan 25, 2026Updated 2 months ago
- OhanashiGPT is an application that generates personalized children's stories based on parameters like age and preferences. It narrates th…☆12Sep 24, 2024Updated last year
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆57Feb 25, 2026Updated last month
- ☆13Updated this week
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆13Jul 28, 2025Updated 8 months ago
- ☆24May 26, 2023Updated 2 years ago
- Awesome AI Benchmarks☆27Jan 16, 2026Updated 2 months ago
- A list of articles outside of the official MLIR docs that I've found useful for learning MLIR☆11Aug 16, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AirLLM 70B inference with single 4GB GPU☆20Jun 27, 2025Updated 9 months ago
- Converting text-LMs into Visual Language Models☆56Jan 31, 2026Updated last month
- Easy Images captioning under a good pyqt GUI☆21Jun 18, 2023Updated 2 years ago
- AI voice assistant that uses Twilio Voice and ConversationRelay, and the Google Gemini API to engage in two-way conversations over a phon…☆25Feb 19, 2026Updated last month
- A mod that injects MGL and patches Minecraft to work with it.☆12Apr 10, 2024Updated last year
- Mojo Miji | A guide to Mojo programming language from a Pythonista's perspective | Mojo 秘籍☆26Mar 7, 2026Updated 3 weeks ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated last year
- Exploring how optimizations for GEMMs work☆29Feb 28, 2026Updated last month
- The official implementation of "YOND: Practical Blind Raw Image Denoising Free from Camera-Specific Data Dependency"☆16Sep 4, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆19Jan 10, 2025Updated last year
- ECMAScript AST query library.☆12Jul 7, 2020Updated 5 years ago
- ☆23Dec 8, 2025Updated 3 months ago
- Pytorch implementation of the paper: Zero-Reference Deep Curve Estimation for Low-Light Image Enhancement.☆10Oct 17, 2020Updated 5 years ago
- Uses AI to pick stocks.☆12Dec 30, 2024Updated last year
- “There is no such thing as a moral or an immoral book. Books are well written, or badly written.” I want to find all the well written con…☆20Nov 6, 2024Updated last year
- MatIR: A Hybrid Mamba-Transformer Image Restoration Model☆20Feb 6, 2025Updated last year
- [ICCV 2023] Efficient Unified Demosaicing for Bayer and Non-Bayer Patterned Image Sensors☆13Oct 12, 2023Updated 2 years ago
- ☆16May 14, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Nov 10, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- A multi-step reasoning pipeline powered by the Datarus-R1-14B-Preview model☆224Aug 21, 2025Updated 7 months ago
- MVC fastify decorator Dependency injection Inversion of Control Typescript☆11Jan 5, 2023Updated 3 years ago
- The nginx module to invalidate complete cache zone☆11Jul 1, 2020Updated 5 years ago
- ☆27Mar 30, 2023Updated 3 years ago
- [ACCV 2024] Official code of our paper "Joint Image Super-resolution and Low-light Enhancement in the Dark"☆14Jun 23, 2025Updated 9 months ago