This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand
☆39May 23, 2023Updated 2 years ago
Alternatives and similar repositories for ViSR
Users that are interested in ViSR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system In general, I used Portaspeech as an…☆12Nov 24, 2023Updated 2 years ago
- Pushing Deep Learning models into production using torchserve, kubernetes and react web app☆27Jun 15, 2023Updated 2 years ago
- An ECG Foundation Model: Boosting Masked ECG-Text Auto-Encoders as Discriminative Learners (ICML 2025)☆31Mar 7, 2026Updated last month
- I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and…☆17Sep 9, 2022Updated 3 years ago
- This project aims to build a streamlit app which includes face detection, face recognition, face anti-spoofing attacks and sentiment anal…☆33Oct 1, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- VietASR - Vietnamese Automatic Speech Recognition☆166Apr 21, 2026Updated last week
- Reliable Wrist PPG Monitoring by Mitigating Poor Skin Sensor Contact (Scientific Reports)☆20Apr 10, 2026Updated 3 weeks ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- Repository for the paper "ViHOS: Vietnamese Hate and Offensive Spans Detection" (EACL2023)☆37Nov 25, 2023Updated 2 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆58Oct 27, 2023Updated 2 years ago
- RAG for Vietnamese Wikipedia corpus.☆35Nov 30, 2023Updated 2 years ago
- Detecting Omissions in Geographic Maps through Computer Vision (MAPR'24)☆24Jul 31, 2024Updated last year
- Basic Chat Application☆10Jun 23, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Searching a High Performance Feature Extractor for Text Recognition Network. TPAMI 2022☆13Nov 25, 2022Updated 3 years ago
- Python - NSW package for Vietnamese: Normalization system to convert numbers, abbreviations, and words that cannot be pronounced into syl…☆67Jan 1, 2025Updated last year
- [ICLR 2025 - Workshop AgenticAI Oral] Large Language Models powered Neural Solvers for Generalized Vehicle Routing Problems☆27May 29, 2025Updated 11 months ago
- ☆17Mar 20, 2025Updated last year
- Run an open-source data LakeHouse locally using Docker Compose☆12May 31, 2024Updated last year
- [ICME 2023] FlowText: Synthesizing Realistic Scene Text Video with Optical Flow Estimation☆13May 13, 2023Updated 2 years ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Sep 29, 2024Updated last year
- 빅데이터 연합동아리 BOAZ 12기 ADV Vision 팀 [Fight Detection] 레포지토리입니다.☆10Jan 22, 2020Updated 6 years ago
- ☆47Jun 1, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repo is about implementing pose estimation with HRNet and also, is a sub-task of the smart hospital bed project☆12Jan 21, 2022Updated 4 years ago
- ☆12Dec 15, 2022Updated 3 years ago
- This project aims to generate syntactichandwritten mathematical expression. The dataset is generated from the CROHME 2014 training set.☆14Feb 24, 2022Updated 4 years ago
- Few-shot text classification with meta learning and BERT☆11Jun 14, 2021Updated 4 years ago
- ntc-scv is dataset of blogs on website https://streetcodevn.com☆27Oct 21, 2021Updated 4 years ago
- Analyzing NYC's Stormwater Flood Map - Extreme Flood Scenario☆20Nov 14, 2023Updated 2 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆380Nov 22, 2021Updated 4 years ago
- ☆14Feb 22, 2022Updated 4 years ago
- The task aims at extracting required fields in receipts captured by mobile devices☆34Nov 4, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repo includes all of the solutions to the Algorithmic Toolbox course from Coursera☆10Oct 10, 2022Updated 3 years ago
- ☆12Feb 15, 2025Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆151Feb 6, 2025Updated last year
- Create handwritten word embeddings from a text recognition Seq2Seq system.☆11Dec 1, 2022Updated 3 years ago
- Intuitive interface for fine-tuning and retraining a Tesseract OCR language model☆10Jul 4, 2025Updated 10 months ago
- Make your life easier with Facebook Crawler and don't use Facebook API☆11Jul 1, 2020Updated 5 years ago
- Top-tier conference papers on out-of-distribution detection☆11Jun 22, 2023Updated 2 years ago