QuixiAI / runpod-vllmLinks
☆15Updated 2 years ago
Alternatives and similar repositories for runpod-vllm
Users that are interested in runpod-vllm are comparing it to the libraries listed below
Sorting:
- Using modal.com to process FineWeb-edu data☆20Updated 10 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- ☆38Updated last year
- Embedding models from Jina AI☆65Updated 2 years ago
- powerful and fast tool calling agents☆80Updated 10 months ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆81Updated 2 years ago
- A proxy for minimax-m2, enabling interleaved thinking, and tool calls.☆38Updated 2 months ago
- Vanilla-Python ergonomics on top of DSPy☆39Updated 8 months ago
- 🌸 The open framework for question answering fine-tuning LLMs on private data☆69Updated 2 years ago
- Apps that run on modal.com☆12Updated 4 months ago
- Code Interpreter Replica☆26Updated 2 years ago
- Simple orchestration for EC2 spot containers☆19Updated last year
- ☆19Updated 2 years ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆73Updated 3 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆19Updated last year
- Build Web Datasets with Ease☆33Updated last year
- The Swarm Ecosystem☆26Updated last year
- a version of baby agi using dspy and typed predictors☆16Updated last year
- An introduction to DSPy☆33Updated 5 months ago
- Run AI models anywhere. https://muna.ai/explore☆83Updated last week
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆17Updated 3 years ago
- Developer showcase of projects built on Cartesia☆20Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆50Updated 3 months ago
- ☆17Updated 7 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- A couple scripts to grab stats from email☆43Updated last year
- Run large models from the terminal using Apple MLX.☆31Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago