☆36Nov 15, 2023Updated 2 years ago
Alternatives and similar repositories for SentAlign
Users that are interested in SentAlign are comparing it to the libraries listed below
Sorting:
- NOAH's Corpus: Part-of-Speech Tagging for Swiss German☆12Jan 6, 2023Updated 3 years ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Jul 24, 2020Updated 5 years ago
- Overview of Icelandic NLP resources at a glance☆18Jun 20, 2024Updated last year
- ☆133Jan 22, 2026Updated last month
- The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction☆22May 29, 2024Updated last year
- SiLLM is a Simultaneous Machine Translation (SiMT) Framework. It utilizes a Large Language model as the translation model and employs a t…☆18Feb 22, 2024Updated 2 years ago
- Editor for aligned parallel texts (personal desktop application).☆20Jan 15, 2026Updated last month
- A library for minimum Bayes risk (MBR) decoding☆51Nov 2, 2025Updated 3 months ago
- Improved Sentence Alignment in Linear Time and Space☆192Mar 6, 2023Updated 2 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 8 months ago
- Library for fast text representation and classification.☆31Jan 9, 2024Updated 2 years ago
- ☆21Feb 13, 2023Updated 3 years ago
- ☆35Jun 15, 2023Updated 2 years ago
- ☆38Jan 17, 2025Updated last year
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Jul 31, 2024Updated last year
- Adversarial Training and SFT for Bot Safety Models☆40Apr 18, 2023Updated 2 years ago
- Resources for our IJCAI 2020 paper, TopicKA: Generating Commonsense Knowledge-Aware Dialogue Responses Towards the Recommended Topic Fact☆12Nov 30, 2020Updated 5 years ago
- RÚV-DL (ruv-dl) is terminal line client for downloading content from RÚV☆10Dec 16, 2025Updated 2 months ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Nr. 1 ranked "Pitch Detector" on the web. Implemented with WebAssembly.☆11Mar 24, 2021Updated 4 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Codebase, data and models for the Re-Thinking the Shuffle Test paper at ACL2021☆10Oct 14, 2022Updated 3 years ago
- Multilingual Quality Estimation and Automatic Post-editing Dataset☆42Mar 24, 2022Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆162Sep 18, 2025Updated 5 months ago
- IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents (NeurIPS 2024)☆14Jul 14, 2025Updated 7 months ago
- ☆11Jul 28, 2021Updated 4 years ago
- 🚀 Sliding Window Attention Training for Efficient Large Language Models☆15Dec 8, 2025Updated 2 months ago
- ☆11Mar 23, 2025Updated 11 months ago
- python越南语分词器☆10Nov 14, 2019Updated 6 years ago
- ☆12Apr 22, 2024Updated last year
- Code for ACL 2024 findings paper "wav2vec-S: Adapting Pre-trained Speech Models for Streaming"☆10Apr 20, 2025Updated 10 months ago
- ☆11Jan 12, 2023Updated 3 years ago
- ☆26Oct 16, 2025Updated 4 months ago
- textgrid.hpp - a C++ TextGrid parser / writer☆10Aug 4, 2021Updated 4 years ago
- GraphQL and Rest API rewrite of the current Open Targets platform API☆15Updated this week
- This project is the implementation of Li-Roth paper "Learning Question Classifiers" on TREC dataset☆12Mar 7, 2017Updated 8 years ago
- 练习题,python 协同过滤ALS模型实现:商品推荐 + 用户人群放大☆10Jun 4, 2020Updated 5 years ago
- ☆11Jan 9, 2026Updated last month
- Sequence-to-Sequence Model for User Simulation☆10Feb 6, 2017Updated 9 years ago