This repository includes all computer vision, audio, document AI, and multimodal projects.
☆51Jun 7, 2024Updated last year
Alternatives and similar repositories for Vision_Audio_and_Multimodal_Projects
Users that are interested in Vision_Audio_and_Multimodal_Projects are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial implementation of the paper "MetaHTR: Towards Writer-Adaptive Handwritten Text Recognition" by Bhunia et al. (2021).☆13Jun 22, 2022Updated 3 years ago
- the full pipeline for model retraining with fastapi and github actions☆16Jul 5, 2024Updated last year
- Bangla Text Augmentation☆11Aug 30, 2023Updated 2 years ago
- Detection valid SET combinations from images with SET-cards☆12Dec 8, 2022Updated 3 years ago
- Repository for Deploying multiple machine learning models for inference on AWS Lambda and Amazon EFS☆16Feb 3, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tool to manage ollama model on vast.ai☆19Apr 19, 2024Updated last year
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- A repo for the code for the course "The Complete Nodejs MySQL Login System"☆10Nov 26, 2020Updated 5 years ago
- Here I added 9 projects which have been made by me during my apprenticeship in Yandex.Practicum as data engineer.☆11Dec 16, 2023Updated 2 years ago
- ☆16Nov 18, 2023Updated 2 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Nov 18, 2024Updated last year
- ☆25Feb 1, 2022Updated 4 years ago
- Unofficial implementation of the paper "Full Page Handwriting Recognition via Image to Sequence Extraction" by Singh et al. (2021).☆54Oct 11, 2022Updated 3 years ago
- This project aims to build a digital business card wallet, as a mobile application. This application basically can take a picture of a bu…☆18Jun 25, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This project focuses on using deep learning to replace text in images while retaining the same font and style.☆10Dec 9, 2019Updated 6 years ago
- ☆18Jul 7, 2025Updated 8 months ago
- A Chrome Extension that detects Phishing Websites and alerts the user regarding the same.☆11Jun 23, 2019Updated 6 years ago
- ☆12Jan 21, 2026Updated 2 months ago
- This repo includes ChatGPT prompt curation to use ChatGPT better.☆13Feb 22, 2024Updated 2 years ago
- Create a Scale-able Full Stack Education Platform with React-Tailwind, MongoDB & Nodejs☆10Nov 23, 2025Updated 4 months ago
- Arabic Online Handwriting Recognition☆10Jul 28, 2016Updated 9 years ago
- ☆12Oct 29, 2024Updated last year
- Config files for my GitHub profile.☆13Jan 6, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CLI app to detect face and predict age tuned on UTKFace with docker and Continuous Integration☆16Jun 19, 2024Updated last year
- N8N workflow for Gmail label creation and AI-based email classification.☆13Dec 5, 2025Updated 3 months ago
- Qaz++ is a programming language syntax based on C++ that allows Kazakh-speaking programmers to write code in their native language.☆19Nov 14, 2023Updated 2 years ago
- ☆21Oct 6, 2024Updated last year
- ☆11Jun 19, 2023Updated 2 years ago
- ☆12Jun 9, 2023Updated 2 years ago
- The official Jamstack site☆14Nov 20, 2023Updated 2 years ago
- A simple tool, using Segment Anything and CloudCompare, to measure stock piles volume from point clouds☆17Jun 18, 2024Updated last year
- Intraformer: A Transformer based Approach to Unsupervised Long-tail Face, Facial Expression and Visual Recognition☆23Sep 2, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- replace any object you want on the image with whatever you want☆14Feb 6, 2024Updated 2 years ago
- Config files for my GitHub profile.☆15Feb 13, 2026Updated last month
- ☆11Feb 1, 2026Updated last month
- Garvis: Realtime AI Voice Assistant☆39May 27, 2024Updated last year
- ☆15Feb 9, 2026Updated last month
- This is a list of projects which have curated tasks specifically for new contributors. These issues are a great way to get started with a…☆12Jun 19, 2023Updated 2 years ago
- Our project is an AR Shoe Try-On System that allows users to virtually try on shoes using their smartphones. The system uses AR technolog…☆13Aug 23, 2024Updated last year