NVIDIA / GenerativeAIExamplesLinks
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
☆3,594Updated this week
Alternatives and similar repositories for GenerativeAIExamples
Users that are interested in GenerativeAIExamples are comparing it to the libraries listed below
Sorting:
- ☆2,069Updated last week
- Open-source AI cookbook☆2,495Updated 2 weeks ago
- Documentation for Google's Gen AI site - including the Gemini API and Gemma☆2,195Updated last month
- PyTorch native post-training library☆5,595Updated this week
- Build production-ready LLM applications and advanced agents using Python, LangChain, and LangGraph. This is the companion repository for …☆1,147Updated 3 weeks ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,324Updated last year
- Agentic components of the Llama Stack APIs☆4,278Updated 3 months ago
- LangServe 🦜️🏓☆2,199Updated last month
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,514Updated this week
- A comprehensive guide to building RAG-based LLM applications for production.☆1,840Updated last year
- LLM Finetuning with peft☆2,705Updated 3 months ago
- Composable building blocks to build Llama Apps☆8,156Updated last week
- ☆978Updated 3 months ago
- ☆1,009Updated 9 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,761Updated 6 months ago
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,133Updated this week
- A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM☆3,080Updated 7 months ago
- This SDK is now deprecated, use the new unified Google GenAI SDK.☆2,238Updated 2 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆3,146Updated this week
- LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading a…☆1,220Updated last year
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,617Updated 2 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,533Updated 6 months ago
- 🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial …☆3,366Updated 11 months ago
- Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.☆3,711Updated last week
- ☆3,038Updated last year
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,783Updated last year
- Large Language Model Text Generation Inference☆10,656Updated this week
- Knowledge Agents and Management in the Cloud☆4,204Updated this week
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆344Updated 3 months ago
- ☆967Updated last year