slashml / amd_inferenceLinks
Docker-based inference engine for AMD GPUs
☆230Updated 11 months ago
Alternatives and similar repositories for amd_inference
Users that are interested in amd_inference are comparing it to the libraries listed below
Sorting:
- Run and explore Llama models locally with minimal dependencies on CPU☆189Updated 11 months ago
- ☆196Updated 5 months ago
- This is a python implementation for stitching images.☆233Updated last year
- Dead Simple LLM Abliteration☆232Updated 7 months ago
- ☆163Updated 6 months ago
- ☆189Updated last year
- Documentation and code for Hack the MontyHome device for extended applications.☆232Updated 3 months ago
- A monitoring station for carnivorous flora.☆130Updated 4 months ago
- High-Performance Implementation of OpenAI's TikToken.☆455Updated 3 months ago
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆177Updated 7 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆219Updated 9 months ago
- A clock that let's you understand if you should have another cup of coffee. It calculates the half-life your caffeine level as you metabo…☆220Updated 11 months ago
- ☆280Updated 3 months ago
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆360Updated 4 months ago
- fractal-structure inspired, parent-children orbiting, zooming-elements based interactive graph visualization user interface☆130Updated 7 months ago
- TUI app- Give it a YouTube URL and you get a transcription with possible speaker identification and optional summary or translation, all …☆319Updated 6 months ago
- Examples and guides for using the VLM Run API☆293Updated 2 months ago
- Homemade automated solar concentrator 🔧 ☀️ 🔎☆356Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆104Updated last year
- Optimally allocate poker chips using constrained, nonlinear optimization☆174Updated 9 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- ☆125Updated 4 months ago
- ☆155Updated 11 months ago
- ai for jq☆244Updated last year
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆253Updated last year
- This repo contains a new way to use bloom filters to do lossless video compression☆251Updated 4 months ago
- Sequential Logic☆112Updated last month
- A hub for various industry-specific schemas to be used with VLMs.☆534Updated 4 months ago
- browser plugin to send youtube, insta (all social videos) to local backend and process audio and video in all sorts of ways.☆241Updated last month
- vtc: Video Traffic Counter☆60Updated 10 months ago