An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.
☆30Feb 28, 2025Updated last year
Alternatives and similar repositories for SAM_Molmo_Whisper
Users that are interested in SAM_Molmo_Whisper are comparing it to the libraries listed below
Sorting:
- POC for creating a AI Video Editor for Content Creators with Various Capabilities.☆12Jan 18, 2025Updated last year
- A powerful video summarization tool that utilizes Moondream alongside multiple AI models to provide comprehensive video understanding thr…☆24Jan 28, 2025Updated last year
- ☆15Apr 28, 2023Updated 2 years ago
- Fisheye lens☆14Feb 5, 2022Updated 4 years ago
- Use AI to personify books, so that you can talk to them 🙊☆18Mar 25, 2023Updated 2 years ago
- A simple slimmed down mono slam implementation☆32Jul 7, 2025Updated 7 months ago
- This repository contains C++ implementation of A* search algorithm for finding path to goal state for 8 puzzle problem in AI.☆11Dec 2, 2023Updated 2 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆38Jan 5, 2025Updated last year
- A sample REST application to help testers learn to write API automation☆11Jun 12, 2024Updated last year
- ☆10May 15, 2022Updated 3 years ago
- A curated list of overcoming imposter syndrome content (Articles, Books, Courses, Talks) that could help anyone find all the resources at…☆20May 23, 2020Updated 5 years ago
- AI eXplainable Inference & Search. Open Sourcing on-premise, ultra-fast latency intelligence to all.☆36Feb 28, 2025Updated last year
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated 2 months ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Oct 18, 2023Updated 2 years ago
- A python web scraper built on Selenium to gather profile data from okcupid.com☆11Oct 15, 2022Updated 3 years ago
- A control system for use with an agricultural spraying drone☆11Aug 11, 2017Updated 8 years ago
- 基于paddlex目标检测的工业场景下违规使用手机识别。☆11Jun 11, 2022Updated 3 years ago
- gen0 gazebo simulation based on ROS2☆12Nov 13, 2025Updated 3 months ago
- ☆10Jul 29, 2022Updated 3 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- Restricted WebView to one host url for Android☆11Aug 2, 2023Updated 2 years ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- Exporting YOLOv5 for CPU inference with ONNX and OpenVINO☆37Jul 3, 2025Updated 8 months ago
- 本项目基于RuoYi-Vue框架为xiaozhi-esp32提供Java后端聊天服务器。帮助个人、企业快速部署的xiaozhi-esp32后端服务。☆21Jun 19, 2025Updated 8 months ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- A simple GPT-3 interface to automate core legal writing tasks☆12Mar 8, 2023Updated 2 years ago
- Documentation and code for predictive maintenance data and assess scripts.☆11Jun 8, 2023Updated 2 years ago
- 一个用YOLO足球视频分析的任务,检测视频中的人与球。 A task of football video analysis to detect people and balls in the video with YOLO☆12Sep 5, 2020Updated 5 years ago
- This Repo contains a fully functional API ready application for delineating fields for smart farming platform☆15Jan 20, 2023Updated 3 years ago
- Pytorch Implementation of the Explainable Conditional Adversarial Autoencoder using Saliency Maps and SHAP (J. of Imaging - MDPI)☆12Mar 5, 2025Updated last year
- "SSPNet: An interpretable 3D-CNN for classification of schizophrenia using phase maps of resting-state complex-valued fMRI data," publish…☆10May 13, 2022Updated 3 years ago
- A reddit scraping and analysis bot to visualize linguistic and content trends☆11Oct 5, 2021Updated 4 years ago
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCP☆19Nov 11, 2025Updated 3 months ago
- WindTurbineHighSpeedBearingPrognosis-Data☆10Aug 19, 2020Updated 5 years ago
- Citrusbyte vagrant/server provisioning, deployment and management stack☆14Jul 14, 2011Updated 14 years ago
- Deep metric learning: Triplet, Magnet and VMF loss☆11Aug 19, 2022Updated 3 years ago