PrunaAI / awesome-ai-efficiencyLinks
A curated list of materials on AI efficiency
☆77Updated last week
Alternatives and similar repositories for awesome-ai-efficiency
Users that are interested in awesome-ai-efficiency are comparing it to the libraries listed below
Sorting:
- I learn about and explain quantization☆26Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 7 months ago
- RAG example using DSPy, Gradio, FastAPI☆83Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 9 months ago
- chrome & firefox extension to chat with webpages: local llms☆123Updated 7 months ago
- Courses on building, compressing, evaluating, and deploying efficient AI models.☆24Updated this week
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆82Updated last year
- ☆75Updated 10 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆150Updated last month
- ☆125Updated 3 weeks ago
- Low memory full parameter finetuning of LLMs☆52Updated 3 weeks ago
- Arxflix turns your boring Arxiv research paper into a captivating video.☆52Updated last week
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 11 months ago
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆48Updated 11 months ago
- An introduction to LLM Sampling☆79Updated 7 months ago
- ☆86Updated 10 months ago
- ☆134Updated 11 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last month
- ☆66Updated last year
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆218Updated last week
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆74Updated 4 months ago
- ☆80Updated last year
- ☆19Updated last year
- Gradio UI for a Cog API☆69Updated last year
- ☆102Updated 11 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated 3 months ago
- Self-host LLMs with vLLM and BentoML☆140Updated last week
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆48Updated last year