xiaomi-research/gemmax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xiaomi-research/gemmax)

xiaomi-research / gemmax

Gemma-based Multilingual Machine Translation Models

☆51

Alternatives and similar repositories for gemmax

Users that are interested in gemmax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiaomi-research / mecat
View on GitHub
☆44May 12, 2026Updated 2 months ago
wangchengzhong / GALDSE
View on GitHub
☆15Mar 11, 2025Updated last year
XL2248 / CPCC
View on GitHub
Code and Data for the ACL21 paper "Modeling Bilingual Conversational Characteristics for Neural Chat Translation"
☆12Dec 17, 2021Updated 4 years ago
xiaomi-research / guievalkit
View on GitHub
[ICML 2026] GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents
☆23Feb 26, 2026Updated 5 months ago
tango4j / llm_speaker_tagging
View on GitHub
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
xiaomi-research / q-frame
View on GitHub
[ICCV 2025] Implementation of the paper "Q-Frame: Query-aware Frame Selection and Multi-Resolution Adaptation for Video-LLMs"
☆82Oct 25, 2025Updated 9 months ago
yichen14 / FastAdaSP
View on GitHub
Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)
☆17Nov 14, 2024Updated last year
xiaomi-research / dasheng-denoiser
View on GitHub
Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…
☆81Jun 16, 2025Updated last year
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
xiaomi-research / dasheng-glap
View on GitHub
Official Implementation of GLAP - General Language Audio Pretraining
☆74May 14, 2026Updated 2 months ago
zhou-feifei / parakeet-tdt-0.6b-v2-Batch-Transcriber
View on GitHub
A high-performance batch audio transcription tool using nvidia/parakeet-tdt-0.6b-v2 to generate accurate, well-segmented SRT subtitles, w…
☆18Dec 9, 2025Updated 7 months ago
gpengzhi / CrossConST-MT
View on GitHub
Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency …
☆10Jul 18, 2023Updated 3 years ago
XiaoMi / dasheng
View on GitHub
Official PyTorch code for Deep Audio-Signal Holistic Embeddings
☆200Nov 7, 2025Updated 8 months ago
nineninesix-ai / gepard-train
View on GitHub
☆24Jul 12, 2026Updated 2 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NieeiM / Dasheng-Audiogen
View on GitHub
Generate a complete audio clip with music, intelligible speech, and sound effects from text in one pass.
☆44May 27, 2026Updated last month
xiaomi-research / controlfoley
View on GitHub
[ACM MM 2026] ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling
☆142Updated this week
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
ASLP-lab / FMSU-Bench
View on GitHub
Towards Fine-Grained Multi-Dimensional Speech Understanding: Data Pipeline, Benchmark, and Model
☆25May 21, 2026Updated 2 months ago
wawabinger / Awesome-Audio-Watermarking
View on GitHub
🔥UP-TO-DATE audio watermarking techniques🔥
☆15Feb 8, 2026Updated 5 months ago
fodelf / TensorflowView
View on GitHub
一个tensorflow可视化平台，降低机器学习门槛，只需关注数据来源，通过配置，即可提供训练模型服务
☆13Nov 30, 2020Updated 5 years ago
xiaomi-research / xares-llm
View on GitHub
XARES-LLM
☆55Mar 26, 2026Updated 4 months ago
ddlBoJack / Omni-Captioner
View on GitHub
[ICLR 2026] Data Pipeline, Models, and Benchmark for Omni-Captioner.
☆142Apr 7, 2026Updated 3 months ago
xiaomi-research / dasheng-audiogen
View on GitHub
end-to-end text to audio scene generation model
☆50Jun 16, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Studio-Intrinsic / benchmarking-ocr-gepa
View on GitHub
☆17Oct 6, 2025Updated 9 months ago
danpovey / conditional-flow-matching
View on GitHub
☆29Aug 8, 2024Updated last year
gpengzhi / Bi-SimCut
View on GitHub
Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"
☆12May 8, 2023Updated 3 years ago
ByteDance-Seed / Seed-X-7B
View on GitHub
☆170Aug 18, 2025Updated 11 months ago
ICDM-UESTC / COSE
View on GitHub
The implementation of Paper: Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement.
☆16Sep 23, 2025Updated 10 months ago
duterscmy / CD-MoE
View on GitHub
Official PyTorch implementation of CD-MOE
☆12Mar 18, 2026Updated 4 months ago
X-LANCE / Xmart
View on GitHub
Xmart青年论坛仓库，存放历史学生论坛和前沿讲座的视频回放和讲义，获取最新Xmart预告欢迎关注公众号【XLANCE Lab】
☆54Apr 7, 2026Updated 3 months ago
openlanguagedata / seed
View on GitHub
Seed Machine Translation Data
☆34Nov 12, 2024Updated last year
VAGOsolutions / sauerkrautlm-colpali
View on GitHub
☆16Mar 1, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yxduir / m2m-70
View on GitHub
☆18Jun 25, 2026Updated last month
fe1ixxu / ALMA
View on GitHub
State-of-the-art LLM-based translation models.
☆590Apr 9, 2025Updated last year
ImperialNLP / VTLM
View on GitHub
Cross-lingual Visual Pre-training for Multimodal Machine Translation
☆18Dec 28, 2021Updated 4 years ago
egruttadauria98 / SSpaVAlDo
View on GitHub
☆37Jan 6, 2026Updated 6 months ago
shenxiangzhuang / bleuscore
View on GitHub
BLEU Score in Rust
☆12Jul 16, 2026Updated last week
KeiKinn / ParaCLAP
View on GitHub
Towards a general language-audio model for computational paralinguistic tasks
☆30Dec 14, 2024Updated last year
flozi00 / atra
View on GitHub
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …
☆20Sep 17, 2024Updated last year