cognitivecomputations / runpod-vllmLinks

☆15

Alternatives and similar repositories for runpod-vllm

Users that are interested in runpod-vllm are comparing it to the libraries listed below

Sorting:

fsndzomga / baby_agi_dspy
a version of baby agi using dspy and typed predictors
☆17Updated last year
g-aggarwal / mlx-hub
A python command-line tool to download & manage MLX AI models from Hugging Face.
☆18Updated 10 months ago
Gogolian / babyagi-js-html
☆19Updated 2 years ago
omkaark / agenata
Build Web Datasets with Ease
☆33Updated last year
simonw / llm-embed-jina
Embedding models from Jina AI
☆61Updated last year
AnswerDotAI / llm-ctx
Create an LLM XML context document from an llms.txt file
☆21Updated 10 months ago
vintrocode / curation-buddy
Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…
☆22Updated last year
omkaark / spotty
Simple orchestration for EC2 spot containers
☆19Updated 9 months ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
TextGeneratorio / text-generator.io
Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io
☆35Updated this week
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆27Updated 8 months ago
Arbaaz-Mahmood / cognosis-II
☆4Updated 10 months ago
burningion / pydantic-video-editing-agent
☆18Updated last week
fxnai / fxn
Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore
☆64Updated last week
ivanfioravanti / autogram
Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.
☆82Updated last year
Attunewise / GPT
OpenAI GPT hosted Agent Framework for Windows and MacOS
☆36Updated last year
aigeek0x0 / radiantloom-email-assist-7b
Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…
☆14Updated last year
Alignment-Lab-AI / AutoMaticAssistant
☆24Updated last year
raidendotai / clone-interpreter
Code Interpreter Replica
☆24Updated 2 years ago
modal-labs / cadre
🛠 Self-hosted, fast, and consistent remote configuration for apps.
☆15Updated 2 years ago
cartesia-ai / dev-showcase
Developer showcase of projects built on Cartesia
☆17Updated 10 months ago
jmanhype / dspy-self-discover-framework
Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…
☆62Updated 11 months ago
simonw / llm-embed-onnx
Run embedding models using ONNX
☆34Updated last year
highlight-ing / highlight-conversations
Capture your conversations with transcripts and intelligence
☆22Updated 6 months ago
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
homanp / nagato
🌸 The open framework for question answering fine-tuning LLMs on private data
☆69Updated last year
nateraw / modal-examples
Apps that run on modal.com
☆12Updated last week
simonw / llm-mlx-llama
Run Llama 2 using MLX on macOS
☆34Updated last year
andreasjansson / AutoCog
☆40Updated 2 months ago
chimezie / mlx-tuning-fork
Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.
☆42Updated 3 weeks ago