Convert the predicted annotated text into voice responses
☆30Nov 16, 2020Updated 5 years ago
Alternatives and similar repositories for Object-Detection-with-Voice-Feedback-YOLO-v3-and-gTTS
Users that are interested in Object-Detection-with-Voice-Feedback-YOLO-v3-and-gTTS are comparing it to the libraries listed below
Sorting:
- ☆10Mar 22, 2020Updated 5 years ago
- ☆11Jan 4, 2023Updated 3 years ago
- ☆18Apr 22, 2025Updated 10 months ago
- ☆17Jun 5, 2021Updated 4 years ago
- Training code for the ACAM action detection model.☆28Feb 2, 2023Updated 3 years ago
- PyTorch-based Driver Posture Classification☆29Jun 22, 2018Updated 7 years ago
- ECCV2022 Towards Hard-Positive Query Mining for DETR-based Human-Object Interaction Detection☆27May 26, 2023Updated 2 years ago
- Real Time Detection of Anomalous Activity From Videos (mainly crime actvity). Images of the video is trained using AutoEncoder to get th…☆27Apr 9, 2023Updated 2 years ago
- Deep Feature Flow for Video Recognition☆10Jun 9, 2017Updated 8 years ago
- Discovering human interaction with novel objects via zero-shot learning, CVPR, 2020☆42Jul 14, 2020Updated 5 years ago
- ☆36Aug 25, 2021Updated 4 years ago
- calvis: Chest, wAist and peLVIS circumference from 3D human Body meshes for Deep Learning.☆11May 15, 2025Updated 10 months ago
- Python scripts form performing stereo depth estimation using the HITNET model in Tensorflow Lite.☆41Sep 7, 2021Updated 4 years ago
- A Resume-Portfolio template for anyone☆36Aug 26, 2021Updated 4 years ago
- (EMBC 2020) Camera-based Hand Tracking using a Mirror-based Multi-view Setup☆15Dec 11, 2021Updated 4 years ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆47Oct 12, 2021Updated 4 years ago
- Python C extension for the eSpeak speech synthesizer☆12Jan 23, 2021Updated 5 years ago
- Code and resources for the NeurIPS 2025 Paper "BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset" by Zhiheng X…☆19Oct 14, 2025Updated 5 months ago
- Code to construct textured deformed SMPL-X meshes for HUMBI data☆14Jun 17, 2022Updated 3 years ago
- Music to Dance for 3D Avatar☆16Nov 15, 2021Updated 4 years ago
- ☆16Jul 21, 2022Updated 3 years ago
- ☆16Jun 14, 2024Updated last year
- FULiveDemoMac是集成了 Faceunity 面部跟踪和虚拟道具及手势识别功能的 Mac 版 Demo。☆14Aug 3, 2020Updated 5 years ago
- Replication of speech to facial landmarks results☆11Jun 17, 2020Updated 5 years ago
- Source code for the paper "On the effect of age perception biases for real age regression", accepted in FG'2019☆12May 22, 2019Updated 6 years ago
- ☆11Aug 4, 2020Updated 5 years ago
- Real-time Index Finger Detection for the ☝️ Gesture Proof-of-Concept☆16Dec 29, 2017Updated 8 years ago
- object detection of tiny ssd☆58Nov 29, 2018Updated 7 years ago
- GFGE☆15Sep 7, 2022Updated 3 years ago
- The ReprGesture entry to the GENEA Challenge 2022 (IMCI 2022)☆16Nov 8, 2022Updated 3 years ago
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Aug 31, 2018Updated 7 years ago
- [IJCAI 2025] Optimized View and Geometry Distillation from Multi-view Diffuser☆17May 2, 2025Updated 10 months ago
- Official code release for CVPR2019 Workshop paper "Unpaired Pose Guided Human Image Generation"☆13Aug 17, 2021Updated 4 years ago
- Code for "CycleGAN Face-off" by Shangxuan Wu, Xiaohan Jin and Ye Qi.☆17Dec 15, 2017Updated 8 years ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆55Nov 4, 2025Updated 4 months ago
- Official repository for Robust Multimodal Large Language Models Against Modality Conflict☆20Jul 9, 2025Updated 8 months ago
- ☆12Mar 30, 2022Updated 3 years ago
- 《数字媒体(2):多媒体》课程中音频小课堂大作业-人脸美化任务☆15Jul 14, 2022Updated 3 years ago
- LF Aligner helps translators create translation memories from texts and their translations. It relies on Hunalign for automatic sentence …☆15Jan 4, 2016Updated 10 years ago