premAI-io / prem-operator
π‘ Deploy AI models and apps to Kubernetes without developing a hernia
β23Updated 5 months ago
Related projects β
Alternatives and complementary repositories for prem-operator
- Quickly and securely turn any Linux box into a build and deployment assistantβ25Updated last month
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ36Updated 9 months ago
- β116Updated this week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β59Updated last week
- β31Updated 2 weeks ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.β27Updated this week
- β25Updated 2 months ago
- Python module that creates a context map for AI code generationβ14Updated 2 months ago
- β64Updated 5 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!β20Updated this week
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasksβ31Updated 5 months ago
- Embed anything.β29Updated 5 months ago
- Self-host LLMs with vLLM and BentoMLβ72Updated last week
- Estimate Your LLM's Token Toll Across Various Platforms and Configurationsβ27Updated 3 months ago
- Simple examples using Argilla tools to build AIβ38Updated last week
- Using modal.com to process FineWeb-edu dataβ19Updated 2 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structureβ41Updated last month
- β43Updated 3 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.β53Updated this week
- The official Python library for Formulaicβ14Updated 6 months ago
- A QT GUI for large language modelsβ24Updated 10 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generateβ¦β36Updated 2 months ago
- Routing on Random Forest (RoRF)β83Updated last month
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β23Updated last week
- Distributed Inference for mlx LLmβ69Updated 3 months ago
- One Line To Build Zero-Data Classifiers in Minutesβ30Updated last month
- Data preparation code for CrystalCoder 7B LLMβ42Updated 6 months ago
- β12Updated last month
- β40Updated 6 months ago
- β31Updated 4 months ago