softbankrobotics-labs / pepper-deep-learningLinks
Object recognition with Pepper using a deep learning model
☆10Updated 4 years ago
Alternatives and similar repositories for pepper-deep-learning
Users that are interested in pepper-deep-learning are comparing it to the libraries listed below
Sorting:
- LoRA fine-tuned Stable Diffusion Deployment☆31Updated 2 years ago
- ☆27Updated 4 years ago
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆21Updated last year
- Dataset of trash objects for waste classification and detection☆10Updated 3 years ago
- An interactive demo based on Segment-Anything for stroke-based painting which enables human-like painting.☆35Updated 2 years ago
- ☆11Updated 4 years ago
- Vision-based heart rate estimation using OAK-D camera☆33Updated 4 years ago
- This project detects eye blinks in real-time using Eye Aspect Ratio (EAR) and MediaPipe's facial landmark detection. It draws eye landmar…☆25Updated 10 months ago
- YouTube Assistant☆12Updated 2 years ago
- EdgeSAM model for use with Autodistill.☆29Updated last year
- Entity detection / tone detection real time chat app using NLP and web sockets☆11Updated 3 years ago
- PyTorch implementation of paper: Paint Transformer: Feed Forward Neural Painting with Stroke Prediction, ICCV 2021.☆28Updated 4 years ago
- A ROS package for calculating the 3D distance of a face from the sensor☆12Updated 4 years ago
- Aware Driving (AD) is a mobile app that will assist you while you are driving.☆10Updated 6 months ago
- Load any clip model with a standardized interface☆21Updated last week
- Pepper Robot Enhanced Human Interaction☆14Updated 2 years ago
- Video chat apps with computer vision filters built on top of Streamlit☆50Updated 2 years ago
- (WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…☆82Updated 2 months ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- Segment Anything with Webcam in Real-Time with FastSAM☆10Updated last year
- Official repository for MaGNET, ICLR 2022☆24Updated 3 years ago
- Web-based tool to convert model into MyriadX blob☆17Updated 4 months ago
- image-transfer-with-background-preserved, based on AnimeGANv2 and Mask-RCNN☆16Updated last year
- ☆24Updated 2 years ago
- ☆61Updated 2 years ago
- ☆14Updated 3 years ago
- Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scra…☆52Updated 2 years ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Updated 2 years ago
- This is a vision-based 3d model manipulation and control UI☆49Updated 4 years ago
- The official code for our paper "A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents", published…☆35Updated 4 years ago