NumexaHQ / frugalLinks
⚡️ Transform AI/ML operations: Transparency, Control and Cost Optimization. ⚡️
☆22Updated last year
Alternatives and similar repositories for frugal
Users that are interested in frugal are comparing it to the libraries listed below
Sorting:
- Experiments to assess SPADE on different LLM pipelines.☆17Updated last year
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆47Updated this week
- Fine-tuning and serving LLMs on any cloud☆90Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated last month
- Fast-track AI apps to production with LLaMA 3, Mistral, and other top LLMs!☆19Updated 11 months ago
- Creating Generative AI Apps which work☆17Updated 2 months ago
- A specification for OpenInference, a semantic mapping of ML inferences☆47Updated last year
- ☆30Updated 7 months ago
- Open Weight, tool-calling LLMs☆152Updated 8 months ago
- Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/perform…☆58Updated last month
- ☆18Updated last year
- Software for developing sparse, performant, multitask artificial neural networks☆32Updated last year
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆30Updated last year
- AI Evaluation Platform☆46Updated last month
- Constrain LLM output☆112Updated 11 months ago
- ☆19Updated last month
- This repo contains a demo of adversarial strings poisoning vector database and forching specific hallucinations on RAG chatbot.☆10Updated last year
- Routing on Random Forest (RoRF)☆173Updated 9 months ago
- vscode extension to convert computationally intensive pytorch kernels to triton☆22Updated 8 months ago
- SGLang is fast serving framework for large language models and vision language models.☆23Updated 4 months ago
- ☆15Updated 2 months ago
- An example of a RAG backend plus UI☆51Updated 6 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆38Updated 3 weeks ago
- Sphynx Hallucination Induction☆54Updated 5 months ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated last year
- Python client library for improving your LLM app accuracy☆98Updated 4 months ago
- Foyle is a copilot to help developers deploy and operate their applications.☆130Updated 3 months ago
- Vector Database with support for late interaction and token level embeddings.☆55Updated last week
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- Finetune LLMs on K8s by using Runbooks☆170Updated 10 months ago