mithunparab / text2segment_video
Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes advanced AI models, specifically Florence2 and SAM2, to detect and segment specific objects or activities in a video based on textual descriptions.
☆10Updated 2 months ago
Alternatives and similar repositories for text2segment_video
Users that are interested in text2segment_video are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- ☆46Updated last year
- Diffusers Image Fill v3 -- Inpaint or Remove objects from an image - or Outpaint - or Outpaint Video Zoom: 16GB+ GPU | 32GB+ RAM | 20GB+…☆12Updated 6 months ago
- Playground Web UI using segment-anything-2 models from the Meta.☆48Updated 5 months ago
- Gradio app to track objects in video and add visual effects☆16Updated 7 months ago
- Passively collect images for computer vision datasets on the edge.☆33Updated last year
- ☆19Updated last year
- ☆32Updated last year
- Incredibly descriptive audiovisual summaries for videos☆40Updated 9 months ago
- The Facial Landmark Preprocessing Toolkit.☆14Updated this week
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆63Updated 9 months ago
- ☆24Updated 11 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆70Updated 10 months ago
- Simple CogVLM client script☆14Updated last year
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆20Updated last year
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆30Updated last year
- 6D Rotation Representation for Unconstrained Head Pose Estimation☆13Updated last year
- ☆23Updated last year
- ☆70Updated last month
- Talking head animation☆27Updated last year
- ☆13Updated 5 months ago
- ☆11Updated last year
- Python scripts for performing Image Inpainting using the MST model in ONNX☆16Updated 2 years ago
- Project Page for VividTalk☆15Updated last year
- ☆12Updated 7 months ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- ☆8Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- ☆12Updated last year
- ☆12Updated last year