Jpickard1 / BRAD-VideoLinks
Retrieval Augmented Generation for youtube videos with a BRAD agent
☆33Updated 10 months ago
Alternatives and similar repositories for BRAD-Video
Users that are interested in BRAD-Video are comparing it to the libraries listed below
Sorting:
- rmp data ranking☆14Updated 3 weeks ago
- Modify-Anything is based on yolov5,yolov8 for video and image detection. Segment-anything,lama_cleaner is applied to segment, modify, era…☆17Updated 2 years ago
- Nano Banana Studio: AI-Powered Marketing Asset Creator with Real-Time Brand Enhancement☆38Updated 2 months ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated last month
- ☆16Updated last year
- A high-performance, distributed memory management system for LLM agents built with LangGraph, LangChain, Ray, and vLLM. Features multi-la…☆11Updated 7 months ago
- Multimodal Open Source Framework for Conversational Agent Research and Development.☆21Updated 9 months ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Updated last year
- ☆19Updated last year
- Official code and dataset release for "JAFPro: Joint Appearance Fusion and Propagation for Human Video Motion Transfer from Multiple Refe…☆14Updated 4 years ago
- ☆16Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆15Updated this week
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆13Updated 2 years ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- Conversational Speaker Diarization using OpenAI AI Language Models(gpt-4) and OpenAI Whisper.☆14Updated 2 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Updated 5 months ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Updated last year
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆14Updated last year
- An experiment to see if we can process G2 reviews to extract topics from reviews☆10Updated last year
- ☆22Updated last year
- ☆13Updated last year
- The UnisonAI Multi-Agent Framework built on custom workflow which allows ai agents to talk together and provides a flexible and extensibl…☆22Updated 2 weeks ago
- ☆12Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆22Updated 5 months ago
- A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.☆18Updated 6 months ago
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆50Updated 9 months ago
- This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file☆15Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 9 months ago
- Incredibly descriptive audiovisual summaries for videos☆40Updated last year