☆50Oct 10, 2023Updated 2 years ago
Alternatives and similar repositories for exllama-runpod-serverless
Users that are interested in exllama-runpod-serverless are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Aug 18, 2023Updated 2 years ago
- ☆54Jun 11, 2023Updated 2 years ago
- TheBloke's Dockerfiles☆308Mar 8, 2024Updated 2 years ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆410Mar 18, 2026Updated last week
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆60Jun 11, 2025Updated 9 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Starting point to build your own custom serverless endpoint☆133May 9, 2025Updated 10 months ago
- 🖼️ | RunPod worker for all Stable Diffusion v1 endpoints.☆19May 8, 2025Updated 10 months ago
- A curated list of amazing RunPod projects, libraries, and resources☆127Aug 20, 2024Updated last year
- RunPod Serverless Worker for Real-ESRGAN Restoration and Upscaling☆10Feb 13, 2026Updated last month
- ☆18Apr 3, 2023Updated 2 years ago
- A guidance compatibility layer for llama-cpp-python☆36Sep 11, 2023Updated 2 years ago
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function call…☆17Apr 7, 2024Updated last year
- Next Auth IORedisAdapter example☆20Feb 21, 2023Updated 3 years ago
- Lennard-Jones Molecular Dynamics for beginners☆15Sep 20, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Feb 15, 2023Updated 3 years ago
- Opinionated Langchain setup with Qdrant vector store and Kong gateway☆32Apr 7, 2023Updated 2 years ago
- ☆22Jul 25, 2023Updated 2 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- An application that helps you summarize your meetings in real time using OpenAI's ChatGPT APIs.☆12Mar 14, 2023Updated 3 years ago
- ☆16Jul 12, 2024Updated last year
- An MCP tool server that provides stateful, TUI-compatible terminal sessions.☆14Feb 3, 2025Updated last year
- ☆14Nov 13, 2023Updated 2 years ago
- sync put.io to a local directory☆19Apr 3, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Nov 27, 2023Updated 2 years ago
- This repository contains the code snippets used in "LLM Prompt Engineering For Developers"☆12Apr 22, 2024Updated last year
- A WebSocket server implementation for running Model Context Protocol (MCP) servers. This application enables MCP servers to be accessed v…☆20Mar 17, 2025Updated last year
- ☆27Jul 25, 2023Updated 2 years ago
- A library of "Micro Agents" that make it easy to add reliable intelligence to any application.☆13Sep 21, 2024Updated last year
- 对话集提取器是一个基于chatglm模型的工具,用于从文本中提取对话集。该工具可以帮助用户从小说、剧本等文本中自动提取出对话,以便进行分析、标注或其他应用。☆12Nov 22, 2024Updated last year
- Iowa House Prices Kaggle (top 5%)☆15Jun 17, 2024Updated last year
- LMQL implementation of tree of thoughts☆36Jan 31, 2024Updated 2 years ago
- Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS☆16Apr 24, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Pytorch directly integrated to the cloud all through Bench AI!☆10Dec 10, 2023Updated 2 years ago
- Source code and input files associated to the paper "Targeted free energy perturbation revisited: Accurate free energies from mapped refe…☆13Sep 14, 2021Updated 4 years ago
- Frontend (and soon also midleware and backend) for a new, opensource image generation platform.☆14Nov 5, 2022Updated 3 years ago
- An efficient LinkedIn Messaging Bot that Messages a group of interests, YouTube link in Documentation for guidance.☆12Aug 25, 2021Updated 4 years ago
- Nim shell scripting library☆11Mar 27, 2019Updated 7 years ago
- agenty☆44Feb 15, 2025Updated last year
- 2D LBM channel flow simulation with particle interaction.☆12Apr 16, 2022Updated 3 years ago