Jpickard1 / BRAD-Video
Retrieval Augmented Generation for youtube videos with a BRAD agent
☆33Updated 2 months ago
Alternatives and similar repositories for BRAD-Video:
Users that are interested in BRAD-Video are comparing it to the libraries listed below
- ☆11Updated 8 months ago
- ☆9Updated last year
- The official repository for CVPRW2024 paper "What’s in a Name? Beyond Class Indices for Image Recognition"☆12Updated 7 months ago
- AI_Video_Shorts_Creator is a python-based tool that uses OpenAI's GPT-4 power to automatically analyze videos, extract the most interesti…☆19Updated last year
- ☆16Updated last year
- Official code and dataset release for "JAFPro: Joint Appearance Fusion and Propagation for Human Video Motion Transfer from Multiple Refe…☆14Updated 3 years ago
- ☆12Updated last year
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆12Updated 2 months ago
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆14Updated 2 weeks ago
- A minimal re-implementation of orthogonal fine-tuning (OFT) for LLMs. Based on nanoGPT and minLoRA.☆12Updated last year
- ☆19Updated 4 months ago
- a naive 3d human pose editor GUI.☆19Updated last year
- ☆15Updated last year
- Project Page for VividTalk☆15Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 2 months ago
- ☆17Updated 2 years ago
- ☆11Updated last year
- Directed masked autoencoders☆14Updated 2 years ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆23Updated 9 months ago
- Description and applications of OpenAI's paper about DALL-E (2021) and implementation of other (CLIP-guided) zero-shot text-to-image gene…☆32Updated 2 years ago
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆12Updated last year
- ☆12Updated 7 months ago
- ☆16Updated last year
- The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"☆28Updated 2 months ago
- ☆13Updated last year
- FlexiFilm: Long Video Generation with Flexible Conditions☆32Updated 11 months ago
- This is the official code for BMVC 2024 paper, G3FA: Geometry-guided GAN for Face Animation☆19Updated 2 weeks ago
- ☆16Updated last month
- Fine-tune of Florence-2 for shot categorization.☆22Updated 3 weeks ago