EmbeddedLLM: API server for Embedded Device Deployment. Currently support CUDA/OpenVINO/IpexLLM/DirectML/CPU
☆52Oct 6, 2024Updated last year
Alternatives and similar repositories for embeddedllm
Users that are interested in embeddedllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆95Jun 5, 2026Updated last week
- A typescript library to comunicate with Adguard home API☆12Mar 21, 2022Updated 4 years ago
- LLM powered local Search Engine☆31Apr 30, 2026Updated last month
- Low-rank Highway Networks☆13Mar 11, 2016Updated 10 years ago
- ☆14Dec 24, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10Dec 25, 2023Updated 2 years ago
- Rust crate for some audio utilities☆28Mar 8, 2025Updated last year
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- A collection of mixins extending HTMLElement with properties☆16Jan 7, 2023Updated 3 years ago
- Android releases of Clubhouse App☆14Apr 9, 2021Updated 5 years ago
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- CLI that ingests data from a CSV file, transforms it, updates a Notion database with it. Built using the Notion JS SDK and TypeScript.☆13May 29, 2026Updated 2 weeks ago
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆86Sep 13, 2024Updated last year
- Quickly get custom prompt contexts☆14May 29, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆22Mar 28, 2026Updated 2 months ago
- Efficiently Composable Data Augmentation on the GPU with Jax☆42May 16, 2025Updated last year
- Convert claude to chatgpt form api through Slack☆15Jun 7, 2023Updated 3 years ago
- ☆16Feb 1, 2025Updated last year
- Chat4GPT Experiments for Security☆11Mar 27, 2023Updated 3 years ago
- 一個Python ChatGPT TelegramBot快速建置平台。☆12Dec 27, 2022Updated 3 years ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆43Oct 20, 2023Updated 2 years ago
- ☆15Jun 9, 2023Updated 3 years ago
- The command line interface for windows, mac and linux to run HYCHAIN's guardian node software. Guardian node keys can be purchased at htt…☆11Apr 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- German "Who Wants To Be A Millionaire" LLM Benchmarking.☆50Jun 4, 2026Updated last week
- a cross-platform local-first open source alternative to AI recall apps (windows Recall, rewind.ai)☆15Jul 11, 2024Updated last year
- THIS PROJECT HAS MOVED! Check the link or README!☆14May 9, 2025Updated last year
- AI Assistant☆20Feb 21, 2026Updated 3 months ago
- This is AutoGenDemo☆11Mar 12, 2024Updated 2 years ago
- Optimizing diffusion for production-ready speeds☆40Jan 10, 2026Updated 5 months ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆59Feb 23, 2026Updated 3 months ago
- This repository demonstrates how to leverage OpenAI's GPT-4 models with JSON Strict Mode to extract structured data from web pages. It c…☆20Aug 14, 2024Updated last year
- A collection of high-performance, modular utilities for enhancing testing, transactional consistency, efficiency, security and stability …☆28Apr 6, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17Jan 2, 2026Updated 5 months ago
- Simple Prompt Plugin is a plugin for Obsidian that allows you generate content in your notes using LLMs.☆14Jun 16, 2024Updated last year
- In-browser semantic search demo using EmbeddingGemma and Transformers.js. No server required.☆36Sep 7, 2025Updated 9 months ago
- 📲 An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.☆12Apr 18, 2025Updated last year
- Ops files for https//github.com/meta-llama/llama-stack☆17Jun 28, 2025Updated 11 months ago
- An ontology of imaging and related techniques and technologies, image processing and analysis, image data and formats, within bio- and ot…☆12May 4, 2026Updated last month
- Main color theme compatible with "Xi" wiki markup language☆23May 28, 2026Updated 2 weeks ago