eliranwong / MultiAMDGPU_AIDev_Ubuntu
Multi AMD GPU Setup for AI Development on Ubuntu with ROCM
☆24Updated this week
Alternatives and similar repositories for MultiAMDGPU_AIDev_Ubuntu:
Users that are interested in MultiAMDGPU_AIDev_Ubuntu are comparing it to the libraries listed below
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆110Updated 6 months ago
- ☆77Updated 2 months ago
- EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU☆32Updated 4 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆58Updated 3 weeks ago
- Distributed Inference for mlx LLm☆82Updated 6 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- entropix style sampling + GUI☆25Updated 3 months ago
- Port of Facebook's LLaMA model in C/C++☆20Updated last year
- ☆65Updated 8 months ago
- ☆14Updated 5 months ago
- Synthify: Seamlessly generate ai datasets with a no-code UI | https://synthify.toolstack.run☆48Updated 2 weeks ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆226Updated 2 months ago
- automatically quant GGUF models☆155Updated this week
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆36Updated 5 months ago
- LLM inference in C/C++☆14Updated this week
- LLM as Interpreter for Natural Language Programming, Pseudo-code Programming and Flow Programming of AI Agents☆35Updated 6 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆53Updated this week
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆58Updated 7 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆26Updated last month
- Implementation of nougat that focuses on processing pdf locally.☆79Updated last month
- A pipeline parallel training script for LLMs.☆124Updated 3 weeks ago
- ☆28Updated 4 months ago
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆42Updated 7 months ago
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated 4 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆46Updated last week
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆80Updated 2 weeks ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆110Updated 7 months ago
- Very minimal (and stateless) agent framework☆41Updated last month
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated last month
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆130Updated this week