anthony-wss/glm-4-voice-finetune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anthony-wss/glm-4-voice-finetune)

anthony-wss / glm-4-voice-finetune

☆14

Alternatives and similar repositories for glm-4-voice-finetune

Users that are interested in glm-4-voice-finetune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lkiel / rl-doom
View on GitHub
Reinforcement learning with VizDoom platform
☆13Apr 18, 2022Updated 4 years ago
LeiLiLab / InfiniSST
View on GitHub
☆25May 27, 2026Updated 2 months ago
NKU-HLT / MusicEval-baseline
View on GitHub
☆12Apr 18, 2025Updated last year
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
ntucllab / CLImage_Dataset
View on GitHub
The dataset repo of "CLCIFAR: CIFAR-Derived Benchmark Datasets with Human Annotated Complementary Labels" paper
☆17May 11, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
TMMMU-Benchmark / evaluation
View on GitHub
Evaluation code for benchmarking VLMs in traditional chinese understanding
☆14Dec 22, 2025Updated 7 months ago
kehanlu / DeSTA2.5-Audio
View on GitHub
Code for DeSTA2.5-Audio, general-purpose LALM
☆141Feb 4, 2026Updated 5 months ago
jonflynng / qwen2-audio-finetune
View on GitHub
Colab notebook for fine-tuning Qwen2-Audio with trl's SFT and PPO trainers.
☆24Nov 23, 2024Updated last year
Jazzcharles / AuroLA
View on GitHub
☆28Feb 23, 2026Updated 5 months ago
EIT-NLP / LLaSO
View on GitHub
☆116Oct 21, 2025Updated 9 months ago
voidful / MMLM
View on GitHub
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
☆16Dec 10, 2024Updated last year
NKU-HLT / SpeechLLM-as-Judges
View on GitHub
[ACL 2026]
☆25Dec 6, 2025Updated 7 months ago
gokceuludogan / interactive-music-recommendation
View on GitHub
Personalized and Interactive Music Recommendation with Bandit approach
☆11Sep 15, 2019Updated 6 years ago
anthony-wss / tsmixer-reproduce
View on GitHub
The repo for reproducing the main results in TSMixer: An all-MLP Architecture for Time Series Forecasting.
☆11Jun 15, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gooofy / zbrain
View on GitHub
Infrastructure useful to create natural language processing systems based on transformer networks
☆12Sep 26, 2019Updated 6 years ago
MertKalkanci / Highlights-Maker
View on GitHub
A video highlights creator
☆12Jun 1, 2024Updated 2 years ago
lifeiteng / SoundStorm
View on GitHub
☆71Jul 13, 2023Updated 3 years ago
ChrisIsKing / zero-shot-text-classification
View on GitHub
Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification
☆13Aug 10, 2023Updated 2 years ago
ar4 / lsrtm_1d
View on GitHub
Least-squares Reverse Time Migration using 1D scalar wave equation. Very simple and for demonstration purposes only.
☆12Sep 4, 2017Updated 8 years ago
TheUnsolvedDev / CUDA_NN_FS
View on GitHub
This repository features a from-scratch implementation of a neural network using CUDA and C. The primary goal of this project is to lever…
☆12Mar 20, 2025Updated last year
chenchy / D3Net
View on GitHub
A pytorch implementation of D3Net.
☆11Aug 8, 2021Updated 4 years ago
thuhcsi / Contextual-Biasing-Dataset
View on GitHub
open-source Mandarian biased word dataset
☆14Sep 21, 2023Updated 2 years ago
NKU-HLT / RAMP_MOS
View on GitHub
[IEEE TASLP] Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆33Mar 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
manoskary / SMUG-Explain
View on GitHub
A Framework for Symbolic MUsic Graph Explanations
☆11Jul 30, 2025Updated 11 months ago
juice500ml / xlm_to_xlsr
View on GitHub
Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)
☆12Mar 12, 2024Updated 2 years ago
TaurenMountain / FormalASR
View on GitHub
An end-to-end ASR model, transcribing spoken Chinese to formal text.
☆20Jun 26, 2026Updated last month
diggerdu / AudioMamba
View on GitHub
☆12Jun 1, 2024Updated 2 years ago
echocatzh / Demo-of-DeepComplexAEC
View on GitHub
☆11Jun 15, 2022Updated 4 years ago
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
SamLundberg / MillionSongDataset
View on GitHub
☆13Oct 23, 2018Updated 7 years ago
frankenliu / LOAE
View on GitHub
☆10Sep 25, 2024Updated last year
JaesungHuh / ca-subtitle
View on GitHub
Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"
☆21Nov 3, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CaA23187 / GCCRN_full
View on GitHub
A pytorch implementation of GCCRN
☆14Dec 18, 2021Updated 4 years ago
lucadellalib / ts-asr
View on GitHub
Target speaker automatic speech recognition (TS-ASR)
☆14Oct 14, 2023Updated 2 years ago
Wortmeister-HQ / zahlwort2num
View on GitHub
A small package for handy conversion of german numerals (also ordinal / signed) written as words to numbers.
☆12Jan 22, 2026Updated 6 months ago
ckyang1124 / SAKURA
View on GitHub
Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…
☆25Aug 14, 2025Updated 11 months ago
RapidAI / RapidSpeech.cpp
View on GitHub
On-device speech AI runtime for ASR, TTS, VAD, and voice cloning. Python-simple, C++-native, GGUF-powered.
☆22Jul 15, 2026Updated 2 weeks ago
sign-language-processing / sign-gpt
View on GitHub
Multi Task GPT Model for Sign Language
☆14Feb 16, 2025Updated last year
bmilde / german-asr-lm-tools
View on GitHub
Crawling and creating a German language model resource
☆18Aug 23, 2022Updated 3 years ago