NVIDIA-AI-Blueprints / video-search-and-summarizationLinks
Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
β266Updated last week
Alternatives and similar repositories for video-search-and-summarization
Users that are interested in video-search-and-summarization are comparing it to the libraries listed below
Sorting:
- Collection of reference workflows for building intelligent agents with NIMsβ175Updated 8 months ago
- Inference and fine-tuning examples for vision models from π€ Transformersβ162Updated 2 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ269Updated 2 months ago
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vectorβ¦β320Updated 11 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β89Updated last week
- This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.β295Updated 2 weeks ago
- Ultralytics Notebooks πβ114Updated this week
- β167Updated this week
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbenchβ179Updated 5 months ago
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, aβ¦β132Updated last year
- Context-Aware RAG library for Knowledge Graph ingestion and retrieval functions.β35Updated last week
- Build computer vision models in a fraction of the time and with less data.β372Updated this week
- Take your LLM to the optometrist.β40Updated 2 months ago
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overalβ¦β193Updated 2 months ago
- Fine tune Gemma 3 on an object detection taskβ85Updated 2 months ago
- β42Updated last month
- From scratch implementation of a vision language model in pure PyTorchβ243Updated last year
- The NVIDIA RTXβ’ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCβ¦β175Updated 10 months ago
- Quick start scripts and tutorial notebooks to get started with TAO Toolkitβ108Updated last week
- Implementation of End-to-End YOLO Models for DeepStreamβ62Updated 11 months ago
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)β340Updated last month
- β106Updated last month
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision modelsβ128Updated 2 weeks ago
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. π₯ [Paper + Code + Demo]β787Updated 3 months ago
- Notebooks for fine tuning pali gemmaβ117Updated 5 months ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includβ¦β34Updated 9 months ago
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.β1,407Updated this week
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetsonβ213Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS modelsβ456Updated last month
- Route LLM requests to the best model for the task at hand.β108Updated 2 weeks ago