NumexaHQ / frugalLinks

⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️

☆22

Alternatives and similar repositories for frugal

Users that are interested in frugal are comparing it to the libraries listed below

Sorting:

shreyashankar / spade-experiments
Experiments to assess SPADE on different LLM pipelines.
☆17Updated last year
foundation-model-stack / fms-hf-tuning
🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.
☆47Updated this week
Trainy-ai / llm-atc
Fine-tuning and serving LLMs on any cloud
☆90Updated last year
IBM / text-generation-inference
IBM development fork of https://github.com/huggingface/text-generation-inference
☆60Updated last month
jjleng / paka
Fast-track AI apps to production with LLaMA 3, Mistral, and other top LLMs!
☆19Updated 11 months ago
axiomic-ai / axiomic
Creating Generative AI Apps which work
☆17Updated 2 months ago
Arize-ai / open-inference-spec
A specification for OpenInference, a semantic mapping of ML inferences
☆47Updated last year
kevinwu23 / StanfordFineTuneBench
☆30Updated 7 months ago
rubra-ai / rubra
Open Weight, tool-calling LLMs
☆152Updated 8 months ago
cedana / cedana-cli
Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/perform…
☆58Updated last month
tom-doerr / awesome-dspy
☆18Updated last year
Beyond-ML-Labs / BeyondML
Software for developing sparse, performant, multitask artificial neural networks
☆32Updated last year
HammingHQ / bug-in-the-code-stack
A new benchmark for measuring LLM's capability to detect bugs in large codebase.
☆30Updated last year
zeno-ml / zeno-hub
AI Evaluation Platform
☆46Updated last month
amoffat / HeimdaLLM
Constrain LLM output
☆112Updated 11 months ago
FalkorDB / code-graph-backend
☆19Updated last month
SDharashivkar / TrojanVectors
This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.
☆10Updated last year
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆173Updated 9 months ago
proxis-dev / vscode-triton
vscode extension to convert computationally intensive pytorch kernels to triton
☆22Updated 8 months ago
runpod-workers / worker-sglang
SGLang is fast serving framework for large language models and vision language models.
☆23Updated 4 months ago
Snowflake-Labs / vllm
☆15Updated 2 months ago
deepset-ai / haystack-rag-app
An example of a RAG backend plus UI
☆51Updated 6 months ago
invariantlabs-ai / explorer
A better way of testing, inspecting, and analyzing AI Agent traces.
☆38Updated 3 weeks ago
haizelabs / sphynx
Sphynx Hallucination Induction
☆54Updated 5 months ago
prem-research / prem-operator
📡 Deploy AI models and apps to Kubernetes without developing a hernia
☆32Updated last year
log10-io / log10
Python client library for improving your LLM app accuracy
☆98Updated 4 months ago
jlewi / foyle
Foyle is a copilot to help developers deploy and operate their applications.
☆130Updated 3 months ago
DeployQL / LintDB
Vector Database with support for late interaction and token level embeddings.
☆55Updated last week
mkuchnik / relm
ReLM is a Regular Expression engine for Language Models
☆106Updated 2 years ago
substratusai / runbooks
Finetune LLMs on K8s by using Runbooks
☆170Updated 10 months ago