☆162Aug 18, 2025Updated 6 months ago
Alternatives and similar repositories for Seed-X-7B
Users that are interested in Seed-X-7B are comparing it to the libraries listed below
Sorting:
- ☆13Aug 23, 2024Updated last year
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆79Jul 4, 2025Updated 8 months ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆15Feb 23, 2026Updated 2 weeks ago
- ☆11Mar 4, 2025Updated last year
- ☆19Feb 16, 2026Updated 3 weeks ago
- ☆34Jan 25, 2026Updated last month
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆30Jan 18, 2026Updated last month
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated last year
- ☆25Jun 10, 2025Updated 9 months ago
- 阅读顺序、Layoutreader☆19May 8, 2025Updated 10 months ago
- ☆62Jul 1, 2025Updated 8 months ago
- ☆14May 26, 2023Updated 2 years ago
- ☆39Sep 25, 2025Updated 5 months ago
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Apr 27, 2024Updated last year
- Pytorch implementation of Exploring Simple Siamese Representation Learning☆22May 19, 2023Updated 2 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Aug 30, 2024Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated 8 months ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 3 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆74May 30, 2025Updated 9 months ago
- an easy-to-use knn-mt toolkit☆105Aug 19, 2023Updated 2 years ago
- Official Implementation of "Reasoning Language Models: A Blueprint"☆94Aug 3, 2025Updated 7 months ago
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆64Dec 9, 2025Updated 3 months ago
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- ☆83Jan 25, 2026Updated last month
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆28Jun 28, 2023Updated 2 years ago
- ☆50Sep 8, 2025Updated 6 months ago
- Code for paper "Nearest Neighbor Knowledge Distillation for Neural Machine Translation" by Zhixian Yang, Renliang Sun, and Xiaojun Wan. T…☆32Jul 16, 2022Updated 3 years ago
- Codebase for Instruction Following without Instruction Tuning☆36Sep 24, 2024Updated last year
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆37Aug 29, 2025Updated 6 months ago
- ☆31Jul 2, 2023Updated 2 years ago
- ☆13Apr 29, 2023Updated 2 years ago
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Jul 17, 2023Updated 2 years ago
- Seed-VC voice or sing conversion.☆55Jun 11, 2025Updated 8 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆218Feb 28, 2025Updated last year
- ☆109May 15, 2025Updated 9 months ago