NVIDIA / GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
☆2,451Updated this week
Related projects ⓘ
Alternatives and complementary repositories for GenerativeAIExamples
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,653Updated this week
- PyTorch native finetuning library☆4,336Updated this week
- Open-source AI cookbook☆1,684Updated this week
- ☆1,271Updated 2 weeks ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,057Updated 2 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,830Updated this week
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,256Updated 3 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,322Updated this week
- ☆2,746Updated 2 months ago
- This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are t…☆2,487Updated last week
- All things prompt engineering☆5,424Updated 5 months ago
- Agentic components of the Llama Stack APIs☆3,894Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,734Updated last week
- Parse files for optimal RAG☆3,173Updated last week
- The easiest way to use Agentic RAG in any enterprise☆3,866Updated this week
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆2,010Updated 3 weeks ago
- Composable building blocks to build Llama Apps☆4,594Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,383Updated this week
- Tools for merging pretrained large language models.☆4,816Updated 2 weeks ago
- Supercharge Your LLM Application Evaluations 🚀☆7,261Updated this week
- The LLM Evaluation Framework☆3,696Updated this week
- Build large language model (LLM) apps with Python, ChatGPT and other models. This is the companion repository for the book on generative …☆637Updated 2 months ago
- ☆892Updated last month
- nanoGPT style version of Llama 3.1☆1,246Updated 3 months ago
- A blazing fast inference solution for text embeddings models☆2,846Updated 2 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆6,127Updated this week
- A comprehensive guide to building RAG-based LLM applications for production.☆1,714Updated 3 months ago
- LLM Finetuning with peft☆2,164Updated 4 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,529Updated 4 months ago
- Deploy your agentic worfklows to production☆1,834Updated this week