ictnlp/LSG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ictnlp/LSG)

ictnlp / LSG

The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”

☆15

Alternatives and similar repositories for LSG

Users that are interested in LSG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ictnlp / SiLLM
View on GitHub
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…
☆18Feb 22, 2024Updated 2 years ago
ictnlp / GMA
View on GitHub
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
☆11Mar 31, 2022Updated 4 years ago
ictnlp / DST
View on GitHub
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
☆11Jun 6, 2024Updated 2 years ago
ictnlp / FastLongSpeech
View on GitHub
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech process…
☆16Jul 22, 2025Updated 11 months ago
ictnlp / Dual-Path
View on GitHub
Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"
☆12Mar 31, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ictnlp / ITST
View on GitHub
Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"
☆13Nov 3, 2022Updated 3 years ago
ictnlp / StreamUni
View on GitHub
StreamUni is a framework that efficiently enables unified Large Speech-Language Models to accomplish streaming speech translation in a co…
☆22Jul 14, 2025Updated last year
ictnlp / NAST-S2x
View on GitHub
A fast speech-to-speech & speech-to-text translation model that supports simultaneous decoding and offers 28× speedup.
☆78Oct 22, 2024Updated last year
ictnlp / ComSpeech
View on GitHub
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
☆27Jul 2, 2024Updated 2 years ago
Jason-Young-AI / YoungToolkit
View on GitHub
A Toolkit for a series of Young projects.
☆23Apr 30, 2021Updated 5 years ago
ictnlp / DASpeech
View on GitHub
Code for NeurIPS 2023 paper "DASpeech: Directed Acyclic Transformer for Fast and High-quality Speech-to-Speech Translation".
☆63Jul 22, 2024Updated last year
choijeongsoo / utut
View on GitHub
[TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
☆31Sep 6, 2024Updated last year
ictnlp / Auto-RAG
View on GitHub
This is the official repository for Auto-RAG.
☆234Jul 18, 2025Updated last year
AntXinyuan / sph2pob
View on GitHub
(IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods
☆14Aug 23, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
xl8-ai / WordSiMT
View on GitHub
Official implementation of EMNLP 2023 Findings paper "Enhanced Simultaneous Machine Translation with Word-level Policies"
☆18Apr 10, 2026Updated 3 months ago
avishaiElmakies / unsupervised_speech_segmentation_using_slm
View on GitHub
☆20Jan 8, 2025Updated last year
HDUyiming / SOCCER
View on GitHub
We are very happy that our work has been accepted by ACM Multimedia 2024！🥰
☆12Jan 8, 2025Updated last year
OSU-STARLAB / Simul-LLM
View on GitHub
[ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.
☆18Apr 21, 2025Updated last year
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
RaidenIV / 3D-Spectrogram
View on GitHub
Audio Processing & Visualization Concepts
☆12Jun 20, 2023Updated 3 years ago
choijeongsoo / av2av
View on GitHub
[CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation
☆48Sep 6, 2024Updated last year
ictnlp / TACS
View on GitHub
Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts
☆17Sep 2, 2024Updated last year
choijeongsoo / lip2speech-unit
View on GitHub
[Interspeech 2023] Intelligible Lip-to-Speech Synthesis with Speech Units
☆47Oct 26, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
YuankaiQi / ORIST
View on GitHub
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
☆16Feb 7, 2022Updated 4 years ago
kaist-ami / voicecraft-dub
View on GitHub
[ICCV'25] Official PyTorch Implementation of "VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models"
☆17Dec 8, 2025Updated 7 months ago
ZZDoog / ProDubber
View on GitHub
[CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…
☆23Jun 6, 2025Updated last year
ictnlp / LNMT-CA
View on GitHub
Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".
☆15Apr 25, 2023Updated 3 years ago
ictnlp / TLAT-NMT
View on GitHub
Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.
☆20Oct 28, 2022Updated 3 years ago
ictnlp / LevelRAG
View on GitHub
The official implementation of "LevelRAG: Enhancing Retrieval-Augmented Generation with Multi-hop Logic Planning over Rewriting Augmented…
☆56Apr 12, 2025Updated last year
facebookresearch / SimulEval
View on GitHub
SimulEval: A General Evaluation Toolkit for Simultaneous Translation
☆126Sep 13, 2024Updated last year
YasserdahouML / visper
View on GitHub
ViSpeR: Multilingual Audio-Visual Speech Recognition
☆58Apr 17, 2025Updated last year
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
yili-19 / SSGPA
View on GitHub
☆17Jul 14, 2025Updated last year
kaistmm / AlignDiT
View on GitHub
[ACM MM 2025] AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation
☆24Oct 28, 2025Updated 8 months ago
ZZDoog / Avatar
View on GitHub
Avatar: An easy-to-use digital portrait PPT presentation video generation system based on Gradio
☆20Nov 7, 2023Updated 2 years ago
BayLing-Models / BayLing
View on GitHub
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…
☆315Dec 3, 2024Updated last year
tuyunbin / Review-of-Change-Captioning
View on GitHub
This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.
☆17Sep 2, 2025Updated 10 months ago
ictnlp / HMT
View on GitHub
Source code for ICLR 2023 spotlight paper "Hidden Markov Transformer for Simultaneous Machine Translation"
☆24Dec 11, 2023Updated 2 years ago
LindgeW / MetaAug4NER
View on GitHub
Robust Self-augmentation for NER with Meta-reweighting
☆29Nov 8, 2022Updated 3 years ago