NVIDIA / GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
☆3,004Updated last week
Alternatives and similar repositories for GenerativeAIExamples:
Users that are interested in GenerativeAIExamples are comparing it to the libraries listed below
- PyTorch native post-training library☆5,103Updated this week
- LLM Finetuning with peft☆2,427Updated 2 months ago
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,404Updated 2 months ago
- Curated list of datasets and tools for post-training.☆2,968Updated 2 months ago
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,034Updated last month
- The easiest way to use Agentic RAG in any enterprise☆4,201Updated 3 months ago
- Robust recipes to align language models with human and AI preferences☆5,138Updated 5 months ago
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…☆10,294Updated this week
- ☆1,640Updated last week
- A comprehensive guide to building RAG-based LLM applications for production.☆1,785Updated 8 months ago
- Tools for merging pretrained large language models.☆5,571Updated this week
- Set of tools to assess and improve LLM security.☆3,048Updated 2 months ago
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,306Updated last year
- AIOS: AI Agent Operating System☆4,060Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,955Updated this week
- ☆2,915Updated 7 months ago
- Go ahead and axolotl questions☆9,165Updated this week
- Build large language model (LLM) apps with Python, ChatGPT and other models. This is the companion repository for the book on generative …☆765Updated last month
- Modeling, training, eval, and inference code for OLMo☆5,519Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,926Updated last week
- Large Language Model Text Generation Inference☆10,031Updated this week
- An awesome & curated list of best LLMOps tools for developers☆4,742Updated this week
- 🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial …☆3,241Updated 4 months ago
- Documentation for Google's Gen AI site - including the Gemini API and Gemma☆1,968Updated this week
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,662Updated 9 months ago
- A framework for few-shot evaluation of language models.☆8,684Updated last week
- A PyTorch native library for large-scale model training☆3,627Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,590Updated 9 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,848Updated 8 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,016Updated 2 months ago