deepinfra / deepctl
Command line tool for Deep Infra cloud ML inference service
☆29Updated 8 months ago
Alternatives and similar repositories for deepctl:
Users that are interested in deepctl are comparing it to the libraries listed below
- GRDN.AI app for garden optimization☆70Updated last year
- LLM finetuning☆42Updated last year
- ☆111Updated 2 months ago
- ☆20Updated last year
- Routing on Random Forest (RoRF)☆114Updated 4 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- ☆65Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- Falcon40B and 7B (Instruct) with streaming, top-k, and beam search☆40Updated last year
- ☆30Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆29Updated last month
- Easy to use, High Performant Knowledge Distillation for LLMs☆46Updated last month
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 7 months ago
- ☆38Updated 11 months ago
- Chat Markup Language conversation library☆55Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- A framework for evaluating function calls made by LLMs☆36Updated 6 months ago
- Embed anything.☆29Updated 8 months ago
- Very minimal (and stateless) agent framework☆41Updated last month
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- ☆24Updated last year
- ☆38Updated 4 months ago
- Easily create LLM automation/agent workflows☆58Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆23Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆20Updated 3 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago