NVIDIA / GenerativeAIExamplesLinks
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
☆3,259Updated this week
Alternatives and similar repositories for GenerativeAIExamples
Users that are interested in GenerativeAIExamples are comparing it to the libraries listed below
Sorting:
- Build large language model (LLM) apps with Python, ChatGPT and other models. This is the companion repository for the book on generative …☆933Updated last week
- ☆1,902Updated last week
- Open-source AI cookbook☆2,153Updated this week
- A set of LangChain Tutorials from my youtube channel☆1,509Updated last year
- Documentation for Google's Gen AI site - including the Gemini API and Gemma☆2,063Updated last week
- LLM Finetuning with peft☆2,565Updated 5 months ago
- A set of LLM Tutorials from my youtube channel☆934Updated 2 years ago
- PyTorch native post-training library☆5,347Updated this week
- A comprehensive guide to building RAG-based LLM applications for production.☆1,805Updated 11 months ago
- LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading a…☆1,193Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,272Updated 4 months ago
- Knowledge Agents and Management in the Cloud☆4,052Updated this week
- ☆933Updated 7 months ago
- ☆3,851Updated last week
- Agentic components of the Llama Stack APIs☆4,268Updated 2 months ago
- Deploy your agentic worfklows to production☆2,038Updated 2 weeks ago
- Composable building blocks to build Llama Apps☆7,914Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,309Updated last month
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆326Updated last month
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,715Updated last year
- A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM☆3,014Updated 3 months ago
- ☆2,984Updated 10 months ago
- ☆988Updated 5 months ago
- A unified evaluation framework for large language models☆2,661Updated last week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,069Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,569Updated 2 months ago
- Curated list of datasets and tools for post-training.☆3,261Updated 5 months ago
- ☆1,272Updated last year
- Training LLMs with QLoRA + FSDP☆1,494Updated 8 months ago
- Tools for merging pretrained large language models.☆6,053Updated this week