deepinfra / deepctl
Command line tool for Deep Infra cloud ML inference service
☆28Updated 7 months ago
Alternatives and similar repositories for deepctl:
Users that are interested in deepctl are comparing it to the libraries listed below
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆24Updated 2 months ago
- GRDN.AI app for garden optimization☆70Updated 11 months ago
- An LLM playground similar to the OpenAI API playground☆21Updated last year
- ☆30Updated last year
- Easy to use, High Performant Knowledge Distillation for LLMs☆40Updated 2 weeks ago
- ☆38Updated 10 months ago
- ☆109Updated last month
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆29Updated 8 months ago
- ☆21Updated 7 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- ☆24Updated last year
- ☆41Updated 9 months ago
- auto fine tune of models with synthetic data☆74Updated 11 months ago
- Embed anything.☆28Updated 8 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Mistral-7B finetuned for function calling☆15Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆36Updated 8 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 7 months ago
- ☆38Updated last year
- Demo of AI chatbot that predicts user message to generate response quickly.☆100Updated 11 months ago
- ☆31Updated last year
- Tools for formatting large language model prompts.☆12Updated last year
- ☆38Updated 4 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 9 months ago
- ☆65Updated 8 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆28Updated 3 weeks ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆70Updated last month
- ☆29Updated last month
- Embedding models from Jina AI☆57Updated last year