NVIDIA-AI-Blueprints / video-search-and-summarizationLinks
Blueprint for Ingesting massive volumes of live or archived videos and extract insights for summarization and interactive Q&A
☆146Updated last week
Alternatives and similar repositories for video-search-and-summarization
Users that are interested in video-search-and-summarization are comparing it to the libraries listed below
Sorting:
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆154Updated 2 months ago
- Collection of reference workflows for building intelligent agents with NIMs☆165Updated 6 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆242Updated last week
- Multimodal AI agent with Llama 3.2: A Streamlit app that processes text, images, PDFs, and PPTs, integrating NIM microservices, Milvus, a…☆121Updated 9 months ago
- ☆159Updated 3 weeks ago
- This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.☆164Updated last week
- NVIDIA AI Blueprint for multimodal PDF data extraction for enterprise RAG☆335Updated 3 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆174Updated 2 months ago
- An NVIDIA AI Workbench example project for Retrieval Augmented Generation (RAG)☆326Updated last month
- Customizable, AI-driven virtual assistant designed to streamline customer service operations, handle common inquiries, and improve overal…☆151Updated last week
- Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector…☆295Updated 8 months ago
- Ultralytics Notebooks 🚀☆90Updated this week
- Fine tune Gemma 3 on an object detection task☆69Updated last week
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models☆120Updated this week
- ☆113Updated 7 months ago
- ☆179Updated 5 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆81Updated this week
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆33Updated 6 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- ☆267Updated this week
- Ranking LLMs on agentic tasks☆148Updated 2 weeks ago
- ☆32Updated 3 weeks ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆473Updated 6 months ago
- The NVIDIA NeMo Agent toolkit is an open-source library for efficiently connecting and optimizing teams of AI agents.☆1,110Updated this week
- A tutorial introducing knowledge distillation as an optimization technique for deployment on NVIDIA Jetson☆201Updated last year
- Turn a fresh Linux installation into a fully configured, sleek, and modern on device AI development system by running a single command.☆89Updated last month
- The NVIDIA AIQToolkit UI streamlines interacting with AIQToolkit workflows in an easy-to-use web application.☆33Updated 3 weeks ago
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆241Updated this week
- ☆102Updated 4 months ago
- An open-source tool for general prompt optimization.☆557Updated last week