deepinfra / deepctlLinks

Command line tool for Deep Infra cloud ML inference service

☆32

Alternatives and similar repositories for deepctl

Users that are interested in deepctl are comparing it to the libraries listed below

Sorting:

mistralai / vllm-release
A high-throughput and memory-efficient inference and serving engine for LLMs
☆53Updated last year
QuixiAI / kraken
☆66Updated last year
4dh / GRDN
GRDN.AI app for garden optimization
☆70Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
kyegomez / Andromeda
An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast
☆151Updated 11 months ago
log10-io / log10
Python client library for improving your LLM app accuracy
☆98Updated 5 months ago
jina-ai / jerboa
LLM finetuning
☆42Updated last year
parea-ai / parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
☆78Updated 5 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆162Updated last year
DeployQL / LintDB
Vector Database with support for late interaction and token level embeddings.
☆55Updated last month
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
wandb / programmer
☆57Updated last month
bigcode-project / jupytercoder
☆141Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆111Updated 3 months ago
yoheinakajima / autofinetune
auto fine tune of models with synthetic data
☆76Updated last year
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
fw-ai / cookbook
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
☆120Updated last week
discus-labs / discus
A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ
☆63Updated last year
Alignment-Lab-AI / AutoMaticAssistant
☆24Updated last year
tg1482 / priomptipy
A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.
☆112Updated 3 weeks ago
Preemo-Inc / text-generation-inference
☆199Updated last year
seanchatmangpt / dspygen
A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.
☆126Updated 9 months ago
cohere-ai / quick-start-connectors
This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…
☆151Updated 10 months ago
ibm-granite / granite-3.0-language-models
☆261Updated last month
sacha-ichbiah / outlines-mlx
A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX
☆55Updated last year
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆76Updated 9 months ago
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆187Updated 10 months ago
QuixiAI / OpenChatML
☆157Updated last year