slashml / amd_inferenceLinks
Docker-based inference engine for AMD GPUs
☆230Updated last year
Alternatives and similar repositories for amd_inference
Users that are interested in amd_inference are comparing it to the libraries listed below
Sorting:
- Run and explore Llama models locally with minimal dependencies on CPU☆190Updated last year
- ☆198Updated 6 months ago
- ☆161Updated 7 months ago
- Dead Simple LLM Abliteration☆235Updated 8 months ago
- ☆154Updated last year
- A monitoring station for carnivorous flora.☆129Updated 6 months ago
- TUI app- Give it a YouTube URL and you get a transcription with possible speaker identification and optional summary or translation, all …☆324Updated this week
- High-Performance Implementation of OpenAI's TikToken.☆460Updated 4 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆222Updated 10 months ago
- Examples and guides for using the VLM Run API☆297Updated last month
- Documentation and code for Hack the MontyHome device for extended applications.☆233Updated 4 months ago
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆177Updated 8 months ago
- ☆126Updated 5 months ago
- This is a python implementation for stitching images.☆233Updated last year
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆360Updated 5 months ago
- browser plugin to send youtube, insta (all social videos) to local backend and process audio and video in all sorts of ways.☆247Updated 2 months ago
- vtc: Video Traffic Counter☆60Updated 11 months ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- A clock that let's you understand if you should have another cup of coffee. It calculates the half-life your caffeine level as you metabo…☆220Updated last year
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆103Updated last year
- ai for jq☆245Updated last year
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆254Updated last year
- Sequential Logic☆114Updated last week
- ☆281Updated 5 months ago
- This repo contains a new way to use bloom filters to do lossless video compression☆250Updated 5 months ago
- Ask GPT to run a command☆195Updated 2 months ago
- ☆96Updated last year
- This Windows Forms application in C# allows users to download YouTube videos as MP3 files. Open Source.☆119Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆237Updated 2 years ago
- Optimally allocate poker chips using constrained, nonlinear optimization☆174Updated 11 months ago