NVIDIA-AI-Blueprints / video-search-and-summarizationLinks
Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
β373Updated last month
Alternatives and similar repositories for video-search-and-summarization
Users that are interested in video-search-and-summarization are comparing it to the libraries listed below
Sorting:
- Collection of reference workflows for building intelligent agents with NIMsβ184Updated 11 months ago
- Inference and fine-tuning examples for vision models from π€ Transformersβ163Updated 5 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of modelsβ276Updated 5 months ago
- Context-Aware RAG library for Knowledge Graph ingestion and retrieval functions.β50Updated 2 months ago
- Ultralytics Notebooks πβ176Updated last month
- This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.β431Updated this week
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vectorβ¦β340Updated last year
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, aβ¦β134Updated last year
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overalβ¦β213Updated last month
- The NVIDIA NeMo Agent Toolkit UI streamlines interacting with NeMo Agent Toolkit workflows in an easy-to-use web application.β63Updated this week
- Build computer vision models in a fraction of the time and with less data.β437Updated this week
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.β95Updated 3 weeks ago
- β181Updated last week
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.β1,702Updated this week
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision modelsβ132Updated 3 weeks ago
- β74Updated 5 months ago
- Computer Vision projectsβ29Updated 2 months ago
- Fine tune Gemma 3 on an object detection taskβ95Updated 5 months ago
- β43Updated this week
- This repo has the code of the 3 demos I presented at Google Gemma2 DevDay Tokyo, using Gemma2 on a Jetson Orin Nano device.β60Updated 5 months ago
- The NVIDIA RTXβ’ AI Toolkit is a suite of tools and SDKs for Windows developers to customize, optimize, and deploy AI models across RTX PCβ¦β180Updated last month
- Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.β318Updated last week
- A CLI to estimate inference memory requirements for Hugging Face models, written in Python.β261Updated this week
- Developer Asset Hub for NVIDIA Nemotron β A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference examplβ¦β314Updated last week
- β253Updated this week
- The official Roboflow Python package. Manage your datasets, models, and deployments. Roboflow has everything you need to build a computerβ¦β522Updated 3 weeks ago
- A project demonstrating how to make DeepStream docker images.β92Updated 3 months ago
- π¨ NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.β620Updated this week
- About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. π₯ [Paper + Code + Demo]β835Updated 6 months ago
- β254Updated this week