PrunaAI / awesome-ai-efficiencyLinks
A curated list of materials on AI efficiency
☆206Updated last month
Alternatives and similar repositories for awesome-ai-efficiency
Users that are interested in awesome-ai-efficiency are comparing it to the libraries listed below
Sorting:
- A Deep Research agent from scratch☆214Updated 8 months ago
- Courses on building, compressing, evaluating, and deploying efficient AI models.☆66Updated 2 months ago
- ☆87Updated last year
- ☆169Updated last year
- Learn to build and deploy local Visual Language Models for Edge AI☆374Updated 3 months ago
- ☆101Updated last year
- Model Activity Visualiser☆521Updated 10 months ago
- ☆80Updated 6 months ago
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆395Updated 2 months ago
- A simple tool that let's you explore different possible paths that an LLM might sample.☆201Updated 9 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆228Updated 3 months ago
- This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API☆258Updated 11 months ago
- A PyTorch implementation of the GPT-OSS-20B architecture. All components are coded from scratch: RoPE with YaRN, RMSNorm, SwiGLU with cla…☆204Updated 2 months ago
- ☆238Updated 2 months ago
- I learn about and explain quantization☆26Updated last year
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated last year
- ☆170Updated last year
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes…☆366Updated this week
- A command-line interface tool for serving LLM using vLLM.☆468Updated 2 weeks ago
- ☆209Updated last year
- ☆181Updated 11 months ago
- ☆74Updated last year
- Fetch arxiv data to LLM-friendly text☆128Updated last week
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.☆683Updated this week
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆278Updated 2 months ago
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆212Updated 3 months ago
- ☆415Updated 9 months ago
- A prompting library☆190Updated 7 months ago