ArchanaSingandhupe / SpotVision-AILinks
Our cutting-edge application harnesses the power of deep learning and computer vision to analyze skin images and predict potential diseases with remarkable accuracy of 71%.
☆12Updated last year
Alternatives and similar repositories for SpotVision-AI
Users that are interested in SpotVision-AI are comparing it to the libraries listed below
Sorting:
- We created the Telugu dataset to address the challenge of building Automatic Speech Recognition (ASR) systems for Indian languages, consi…☆21Updated last year
- JSP and MySQL based online system to record feedback of students of all departments of PICT along with Report generation and visualisatio…☆13Updated 4 years ago
- Your AI Stack in Your Editor☆336Updated 5 months ago
- Recognition of handwritten text using CRAFT text detection and TrOCR☆26Updated 2 years ago
- ollama client for Emacs☆56Updated last year
- Thoughts on Faith and Judgement☆82Updated this week
- A curated list of amazing RunPod projects, libraries, and resources☆118Updated 11 months ago
- ☆70Updated 3 months ago
- ☆46Updated 6 months ago
- ☆19Updated 10 months ago
- Checkbox Detection Model for Scanned Documents☆76Updated 4 months ago
- Using a single image and just 10 seconds of sample audio, our project enables you to create a video where it appears as if you're speakin…☆38Updated last year
- Foundational Models for State-of-the-Art Speech and Text Translation☆11Updated last year
- ☆32Updated last year
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆60Updated last year
- (CVPR 2023)SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆31Updated last year
- ☆37Updated last year
- ☆79Updated last month
- Structured information extraction from documents☆316Updated 9 months ago
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆269Updated last month
- A modern GUI application that transcribes and translates audio and video files, offering the option to save the subtitles as separate fil…☆15Updated last year
- We run Node.js with Ollama Hosting LLM locally and we use D-ID for Live Avatar☆24Updated last year
- ☆12Updated last year
- Long-Inference, High Quality Synthetic Speaker (AI avatar/ AI presenter)☆261Updated 2 years ago
- This programm allow you to extract skin color from face and classify it to the arbitraty number of clusters☆14Updated 5 years ago
- MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆21Updated last year
- ☆56Updated 2 weeks ago
- Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which includ…☆34Updated 6 months ago
- ☆39Updated last year
- End-to-end natural language to SQL system: schema-aware model fine-tuning, retrieval-augmented prompting, and production-grade CLI, power…☆21Updated last month