NVIDIA / GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
☆2,137Updated this week
Related projects: ⓘ
- High-quality datasets, tools, and concepts for LLM fine-tuning.☆1,664Updated last month
- A Native-PyTorch Library for LLM Fine-tuning☆3,942Updated this week
- Build resilient language agents as graphs.☆5,662Updated this week
- ☆1,161Updated 3 weeks ago
- Tools for merging pretrained large language models.☆4,501Updated this week
- AIOS: LLM Agent Operating System☆3,219Updated this week
- LLM Finetuning with peft☆2,058Updated 2 months ago
- ☆2,652Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆3,155Updated this week
- Robust recipes to align language models with human and AI preferences☆4,481Updated 3 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆2,739Updated this week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆4,725Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆2,817Updated 2 weeks ago
- To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x com…☆4,435Updated 3 weeks ago
- Open-source AI cookbook☆1,605Updated this week
- Go ahead and axolotl questions☆7,554Updated this week
- Parse files for optimal RAG☆2,450Updated this week
- Code examples and resources for DBRX, a large language model developed by Databricks☆2,496Updated 4 months ago
- Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.☆2,055Updated this week
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,288Updated 5 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆2,884Updated last month
- ☆1,517Updated this week
- Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.☆3,063Updated this week
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,393Updated last month
- The easiest way to use Agentic RAG in any enterprise☆3,132Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆9,780Updated this week
- A code-first agent framework for seamlessly planning and executing data analytics tasks.☆5,194Updated this week
- A comprehensive guide to building RAG-based LLM applications for production.☆1,671Updated last month
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,008Updated this week
- A framework for prompt tuning using Intent-based Prompt Calibration☆2,038Updated this week