Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs
☆27Dec 17, 2024Updated last year
Alternatives and similar repositories for nitro
Users that are interested in nitro are comparing it to the libraries listed below
Sorting:
- A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.☆24Jul 18, 2025Updated 7 months ago
- ☆21Aug 14, 2024Updated last year
- Various LLM Benchmarks☆24Feb 20, 2026Updated last week
- ☆52May 19, 2025Updated 9 months ago
- Triton adapter for Ascend. Mirror of https://gitcode.com/ascend/triton-ascend☆110Updated this week
- (WIP) Parallel inference for black-forest-labs' FLUX model.☆19Nov 18, 2024Updated last year
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆79Aug 12, 2024Updated last year
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- Enhanced Explainable Neural Network☆10Dec 25, 2021Updated 4 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- vLLM performance dashboard☆42Apr 26, 2024Updated last year
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- LangChain + LiteLLM that works☆50Sep 1, 2025Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆114Updated this week
- ☆11Jan 13, 2026Updated last month
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 3 years ago
- ☆10Aug 16, 2023Updated 2 years ago
- JSSP dataset for LLMs☆17May 29, 2025Updated 9 months ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- Multi-resource Dynamic Coordinated Planning of Flexible Distribution Network☆15Jun 11, 2024Updated last year
- ☆10Oct 26, 2022Updated 3 years ago
- ☆10Apr 15, 2022Updated 3 years ago
- ☆10Sep 29, 2024Updated last year
- Deep Generative Model (Torch)☆11Apr 19, 2016Updated 9 years ago
- BlockCIrculantRNN (LSTM and GRU) using TensorFlow☆14Oct 30, 2018Updated 7 years ago
- Adaptive and Robust Multi-Task Learning☆10May 19, 2024Updated last year
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆13Nov 1, 2022Updated 3 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆11Oct 24, 2023Updated 2 years ago
- Repository for code samples.☆11Jul 8, 2016Updated 9 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- ☆12Feb 27, 2023Updated 3 years ago
- Home server set up☆13Oct 5, 2025Updated 4 months ago
- Lightweight framework for 3D rendering.☆11Jun 5, 2023Updated 2 years ago
- Shared repo supporting the App Center client apps.☆13Nov 17, 2017Updated 8 years ago
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- SunFounder PiRobot Car☆11Jan 17, 2018Updated 8 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago