QuixiAI / runpod-vllmLinks
☆15Updated last year
Alternatives and similar repositories for runpod-vllm
Users that are interested in runpod-vllm are comparing it to the libraries listed below
Sorting:
- Embedding models from Jina AI☆65Updated last year
- a version of baby agi using dspy and typed predictors☆17Updated last year
- Simple orchestration for EC2 spot containers☆19Updated last year
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated 2 years ago
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆30Updated last week
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 7 months ago
- ☆19Updated 2 years ago
- ☆47Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆71Updated 2 weeks ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆19Updated last year
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆82Updated last year
- ☆38Updated last year
- Vanilla-Python ergonomics on top of DSPy☆38Updated 5 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- The Swarm Ecosystem☆26Updated last year
- ☆17Updated 4 months ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆135Updated 5 months ago
- Build Web Datasets with Ease☆33Updated last year
- converts url content into JSON with a simple prefix☆71Updated last year
- ☆35Updated 3 months ago
- A couple scripts to grab stats from email☆43Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆47Updated last month
- Code Interpreter Replica☆25Updated 2 years ago
- Apps that run on modal.com☆12Updated 2 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated last year
- ☆42Updated last year
- A function to do all☆35Updated last year