mithunparab / text2segment_videoLinks
Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.
☆10Updated 10 months ago
Alternatives and similar repositories for text2segment_video
Users that are interested in text2segment_video are comparing it to the libraries listed below
Sorting:
- ☆47Updated last year
- Incredibly descriptive audiovisual summaries for videos☆41Updated last year
- ☆29Updated 2 years ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- Playground Web UI using segment-anything-2 models from the Meta.☆55Updated last year
- Gradio app to track objects in video and add visual effects☆17Updated 4 months ago
- ☆40Updated 2 years ago
- Video Diffusion WebUI: Text2Video + Image2Video + Video2Video WebUI☆66Updated last year
- wav2lip-api☆11Updated 2 years ago
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆49Updated last year
- ☆12Updated last year
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆28Updated last year
- ☆13Updated last year
- Real-Time Open-Vocabulary Object Detection☆12Updated last year
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆74Updated 6 months ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆42Updated 2 months ago
- ☆78Updated last year
- optimized wav2lip☆18Updated last year
- ☆69Updated 8 months ago
- ☆43Updated last year
- Official Repo For the [AAAI'26 Oral] Paper “StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback”☆24Updated 2 weeks ago
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆64Updated 2 years ago
- Simple CogVLM client script☆14Updated 2 years ago
- ☆16Updated 4 months ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆68Updated last year
- ☆23Updated last year
- ☆24Updated last year
- ☆55Updated 2 years ago
- ImageSlider custom component for gradio.☆43Updated last year
- FLUX.1-dev LoRA Outfit Generator can create an outfit by detailing the color, pattern, fit, style, material, and type.☆70Updated last year