☆172Aug 18, 2025Updated 9 months ago
Alternatives and similar repositories for Seed-X-7B
Users that are interested in Seed-X-7B are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆83Jul 4, 2025Updated 11 months ago
- ☆11Mar 4, 2025Updated last year
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆18Jun 1, 2026Updated last week
- ☆22May 27, 2026Updated 2 weeks ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Aug 31, 2021Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- ☆16Jul 29, 2025Updated 10 months ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated 2 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- Code and pretrained models for "DUB: Discrete Unit Back-translation for Speech Translation" (ACL 2023 Findings)☆27Jun 28, 2023Updated 2 years ago
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆23Apr 27, 2024Updated 2 years ago
- ☆14May 26, 2023Updated 3 years ago
- Compute benchmark of table structure recognition.☆29Dec 2, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆69May 15, 2026Updated 3 weeks ago
- ☆26Jun 10, 2025Updated 11 months ago
- ☆35Feb 7, 2026Updated 4 months ago
- This is an implementation of paper "End-to-end Speech Translation via Cross-modal Progressive Training" (Interspeech2021)☆18May 1, 2022Updated 4 years ago
- Towards Efficient and Effective Adversarial Training, NeurIPS 2021☆16Feb 15, 2022Updated 4 years ago
- Experimental pipeline for FedFace.☆10Jul 6, 2021Updated 4 years ago
- A collection of instruction data and scripts for machine translation.☆20Sep 23, 2023Updated 2 years ago
- Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.☆21Mar 31, 2025Updated last year
- Code for EMNLP-2018 paper "Variational Autoregressive Decoder for Neural Response Generation"☆16Oct 11, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆53Nov 14, 2024Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- Code repo for the UAI 2023 paper "Learning To Invert: Simple Adaptive Attacks for Gradient Inversion in Federated Learning".☆15Jun 15, 2024Updated last year
- Chinese character recognition☆10Oct 27, 2020Updated 5 years ago
- The implementation of "Mitigating Hallucinations and Off-target Machine Translation with Source-Contrastive and Language-Contrastive Deco…☆38Aug 29, 2025Updated 9 months ago
- LLM-based Multi-dimensional Debate Judge with Iterative Chronological Analysis☆20Oct 1, 2025Updated 8 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated last year
- ☆63Jul 1, 2025Updated 11 months ago
- Conversational Multimodal Emotion Recognition☆12Dec 7, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆43Sep 1, 2025Updated 9 months ago
- [ICCV 2025 Highlight] "Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis“☆27May 31, 2026Updated last week
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆82Nov 11, 2025Updated 6 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆65Mar 19, 2026Updated 2 months ago
- ☆14Apr 4, 2025Updated last year
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 4 years ago