High-efficiency LLM inference engine in C++/CUDA. Run Llama 70B on RTX 3090.
☆457Feb 22, 2026Updated 3 months ago
Alternatives and similar repositories for ntransformer
Users that are interested in ntransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jul 12, 2025Updated 10 months ago
- Experiments with the Mojo 🔥 programming language on macOS arm64 guided by tests☆13Jan 8, 2026Updated 4 months ago
- DeepDream for video with temporal consistency. Features RAFT optical flow estimation and occlusion masking to prevent ghosting. A PyTorch…☆63Jan 2, 2026Updated 4 months ago
- QwenPaw(原 CoPaw 项目)的 Docker 部署方案,支持一键构建和运行,相比官方镜像更小。☆52May 19, 2026Updated last week
- a5eq.lv2 - Another 5-Band Equalizer☆34Mar 26, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- MoonLang is a lightweight static programming language built with C++ and LLVM, featuring dual syntax styles (`: end` and `{ }`). Supports…☆38Feb 23, 2026Updated 3 months ago
- ☆125May 17, 2026Updated last week
- A lightweight, feature-rich dock for Linux written in Python with GTK 3 and Cairo☆55May 22, 2026Updated last week
- GLOVE (GL Over Vulkan) is a software library that acts as an intermediate layer between an OpenGL application and Vulkan☆17Sep 7, 2018Updated 7 years ago
- Server for Matching Long/Lat to Timezone☆47Feb 21, 2026Updated 3 months ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs☆45May 18, 2026Updated last week
- ... because printf doesn't show the binary representation of a number☆17Mar 29, 2017Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆211Sep 12, 2024Updated last year
- new optimizer☆20Aug 4, 2024Updated last year
- A crate built on top of `axum-sessions`, implementing the CSRF Synchronizer Token Pattern☆15Updated this week
- Implementations of the Are-we-fast-yet benchmark suite in Oberon, C++, C, Pascal, Micron and Luon☆70May 17, 2026Updated last week
- 柠檬网盘网页端☆11Sep 14, 2025Updated 8 months ago
- Docker-based inference engine for AMD GPUs☆233Oct 7, 2024Updated last year
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago
- RISC-V emulator in Rust that boots Linux with JIT on ARM64/x86_64 and Sv39 virtual memory☆102Apr 7, 2026Updated last month
- A straightforward zero-dependency in-memory database☆16Sep 1, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Oct 31, 2016Updated 9 years ago
- The typed graph between your code and whichever warehouse, table format, or query engine you've chosen — typed compiler, branches, replay…☆257May 23, 2026Updated last week
- 15-721 Spring 2024 - Cache #1☆12May 2, 2024Updated 2 years ago
- Converts between country names, ISO 3166-1 codes, and Unicode flag emojis.☆17Updated this week
- A hassle-free utility to encrypt error handling strings in public binaries to protect business logic☆26Apr 29, 2022Updated 4 years ago
- Simple tool to communicate between kernel mode kext and user mode daemon on Mac OS X☆13Nov 2, 2016Updated 9 years ago
- ☆16Apr 20, 2024Updated 2 years ago
- "Hello Boing" - executable raytraced graphics in m68k assembly for Commodore Amiga.☆10Sep 21, 2015Updated 10 years ago
- dotfiles: karabiner, starship, tmux, skhd, yabai, alacritty☆18May 10, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 💭 Chat with AI via API☆33Oct 20, 2024Updated last year
- Use Civet in any project.☆15Jun 20, 2024Updated last year
- ☆47Jun 11, 2025Updated 11 months ago
- Open-source CUDA, Triton and HIP compiler targeting multiple GPU architectures.☆1,687Updated this week
- https://ansi.tools☆35May 17, 2026Updated last week
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆32Dec 4, 2024Updated last year
- ☆82Mar 21, 2026Updated 2 months ago