slashml / amd_inferenceLinks
Docker-based inference engine for AMD GPUs
☆230Updated last year
Alternatives and similar repositories for amd_inference
Users that are interested in amd_inference are comparing it to the libraries listed below
Sorting:
- Run and explore Llama models locally with minimal dependencies on CPU☆190Updated last year
- ☆199Updated 7 months ago
- This is a python implementation for stitching images.☆233Updated last year
- ☆164Updated 8 months ago
- Dead Simple LLM Abliteration☆243Updated 9 months ago
- A monitoring station for carnivorous flora.☆129Updated 6 months ago
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆175Updated 9 months ago
- ☆280Updated 5 months ago
- Documentation and code for Hack the MontyHome device for extended applications.☆233Updated 5 months ago
- TUI app- Give it a YouTube URL and you get a transcription with possible speaker identification and optional summary or translation, all …☆328Updated 2 weeks ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆222Updated 11 months ago
- ai for jq☆246Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated 2 years ago
- ☆125Updated 6 months ago
- Sequential Logic☆114Updated 3 weeks ago
- A clock that let's you understand if you should have another cup of coffee. It calculates the half-life your caffeine level as you metabo…☆220Updated last year
- Examples and guides for using the VLM Run API☆297Updated this week
- This repo contains a new way to use bloom filters to do lossless video compression☆249Updated 6 months ago
- ☆191Updated last year
- ☆154Updated last year
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆360Updated 6 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆255Updated 2 years ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)☆357Updated last month
- High-Performance Implementation of OpenAI's TikToken.☆461Updated 5 months ago
- vtc: Video Traffic Counter☆60Updated last year
- Ask GPT to run a command☆195Updated 2 months ago
- A hub for various industry-specific schemas to be used with VLMs.☆537Updated 6 months ago
- ☆96Updated last year
- browser plugin to send youtube, insta (all social videos) to local backend and process audio and video in all sorts of ways.☆248Updated 3 months ago