shaochenze/PatchTrain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shaochenze/PatchTrain)

shaochenze / PatchTrain

Code for paper "Patch-Level Training for Large Language Models"

☆109

Alternatives and similar repositories for PatchTrain

Users that are interested in PatchTrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ictnlp / PCFG-NAT
View on GitHub
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
☆12Jan 4, 2024Updated 2 years ago
ictnlp / DST
View on GitHub
DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently
☆11Jun 6, 2024Updated 2 years ago
ictnlp / FA-DAT
View on GitHub
Official Implementation for the ICLR2023 paper "Fuzzy Alignments in Directed Acyclic Graph for Non-autoregressive Machine Translation"
☆14Mar 1, 2023Updated 3 years ago
Adaxry / Post-Instruction
View on GitHub
☆21Sep 5, 2023Updated 2 years ago
ictnlp / Dual-Path
View on GitHub
Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"
☆12Mar 31, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ictnlp / ITST
View on GitHub
Code for EMNLP 2022 main conference paper "Information-Transport-based Policy for Simultaneous Translation"
☆13Nov 3, 2022Updated 3 years ago
ictnlp / NMLA-NAT
View on GitHub
Code for NeurIPS 2022 Spotlight paper " Non-Monotonic Latent Alignments for CTC-Based Non-Autoregressive Machine Translation"
☆20Nov 16, 2022Updated 3 years ago
ictnlp / LNMT-CA
View on GitHub
Code for EMNLP 2022 main conference paper "Low-resource Neural Machine Translation with Cross-modal Alignment".
☆15Apr 25, 2023Updated 3 years ago
ictnlp / ComSpeech
View on GitHub
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
☆27Jul 2, 2024Updated 2 years ago
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
ictnlp / FastLongSpeech
View on GitHub
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech process…
☆16Jul 22, 2025Updated 11 months ago
ictnlp / SiLLM
View on GitHub
SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…
☆18Feb 22, 2024Updated 2 years ago
songmzhang / CBMI
View on GitHub
The code of ACL2022 paper "Conditional Bilingual Mutual Information based Adaptive Training for Neural Machine Translation"..
☆14Aug 6, 2022Updated 3 years ago
ictnlp / OR-NMT
View on GitHub
Source Code for ACL2019 paper <Bridging the Gap between Training and Inference for Neural Machine Translation>
☆41Nov 10, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OSU-STARLAB / Simul-LLM
View on GitHub
[ACL 2024] An easily extensible framework for simultaneous, text-to-text neural machine translation (SimulMT) for LLMs.
☆18Apr 21, 2025Updated last year
krystalan / ClidSum
View on GitHub
EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization
☆37Jan 13, 2024Updated 2 years ago
kongds / MoRA
View on GitHub
MoRA: High-Rank Updating for Parameter-Efﬁcient Fine-Tuning
☆362Aug 7, 2024Updated last year
BayLing-Models / BayLing
View on GitHub
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型，具有优越的英语/中文能力，在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…
☆315Dec 3, 2024Updated last year
wilson1yan / VideoGPT-Paper
View on GitHub
☆18Apr 15, 2021Updated 5 years ago
ictnlp / BoN-NAT
View on GitHub
☆22Dec 31, 2019Updated 6 years ago
hexuandeng / Mono4SiMT
View on GitHub
The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉
☆12Jul 19, 2023Updated 3 years ago
EvanZhuang / AgenticLU
View on GitHub
Official implementation of Self-Taught Agentic Long Context Understanding (ACL 2025).
☆13Sep 22, 2025Updated 9 months ago
studio-dots-ai / TELL
View on GitHub
TELL: Test-time Experiential Lifelong Learning – a single LLM agent that learns from experience at test time, achieving 43.9% on ARC-AGI-…
☆26Apr 29, 2026Updated 2 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
imagination-research / lbt
View on GitHub
[NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study
☆60Nov 24, 2024Updated last year
ictnlp / TLAT-NMT
View on GitHub
Source code for the EMNLP 2020 long paper <Token-level Adaptive Training for Neural Machine Translation>.
☆20Oct 28, 2022Updated 3 years ago
leoShen917 / DreamMover
View on GitHub
The official repository of DreamMover
☆34Sep 20, 2024Updated last year
ictnlp / CRESS
View on GitHub
Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".
☆16Oct 25, 2023Updated 2 years ago
kroggen / tokenformer-minimal
View on GitHub
Minimal implementation of TokenFormer for inference and learning
☆13Nov 6, 2024Updated last year
shaochenze / EAR
View on GitHub
☆42May 15, 2025Updated last year
kkkevinkkkkk / situated_faithfulness
View on GitHub
☆14Oct 17, 2024Updated last year
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
zqOuO / GWT
View on GitHub
☆13May 4, 2026Updated 2 months ago
NJUNLP / knn-box
View on GitHub
an easy-to-use knn-mt toolkit
☆104Aug 19, 2023Updated 2 years ago
YuchuanTian / DiJiang
View on GitHub
[ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear at…
☆103Jun 14, 2024Updated 2 years ago
ictnlp / DiverseNMT
View on GitHub
Source code for the AAAI 2020 long paper <Modeling Fluency and Faithfulness for Diverse Neural Machine Translation>.
☆19Mar 10, 2020Updated 6 years ago
atosystem / SSL_Interface
View on GitHub
Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024
☆16Nov 19, 2024Updated last year
microsoft / SparseMixer
View on GitHub
Sparse Backpropagation for Mixture-of-Expert Training
☆30Jul 2, 2024Updated 2 years ago