Blue-Raincoat/SelectIT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Blue-Raincoat/SelectIT)

Blue-Raincoat / SelectIT

☆24

Alternatives and similar repositories for SelectIT

Users that are interested in SelectIT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hexuandeng / Mono4SiMT
View on GitHub
The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉
☆12Jul 19, 2023Updated 3 years ago
zhang-wei-chao / DC-PDD
View on GitHub
This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…
☆23May 21, 2025Updated last year
pygongnlp / PT-M2
View on GitHub
[EMNLP 2022] Revisiting Grammatical Error Correction Evaluation and Beyond
☆20Nov 25, 2022Updated 3 years ago
pldlgb / nuggets
View on GitHub
☆89Dec 29, 2023Updated 2 years ago
xypan0 / G-DIG
View on GitHub
☆12Jun 30, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
SunbowLiu / PTvsBT
View on GitHub
On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))
☆13Nov 21, 2021Updated 4 years ago
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
MaxyLee / 3AM
View on GitHub
Official code and data of "3AM: An Ambiguity-Aware Multi-Modal Machine Translation Dataset"
☆12Dec 8, 2024Updated last year
hkust-nlp / deita
View on GitHub
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]
☆599Dec 9, 2024Updated last year
BatsResearch / nayak-aclfindings24-code
View on GitHub
☆22Jul 16, 2024Updated 2 years ago
limenlp / safer-instruct
View on GitHub
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Feb 22, 2024Updated 2 years ago
tianyi-lab / Cherry_LLM
View on GitHub
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…
☆416Jun 25, 2025Updated last year
xiatingyu / SFT-DataSelection-at-scale
View on GitHub
☆34Feb 9, 2025Updated last year
Abbey4799 / CELLO
View on GitHub
Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)
☆51Apr 19, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yongchanghao / multi-task-nat
View on GitHub
☆11Jul 17, 2021Updated 5 years ago
google / wmt19-paraphrased-references
View on GitHub
☆15Nov 5, 2020Updated 5 years ago
IronBeliever / CaR
View on GitHub
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation
☆91Nov 13, 2024Updated last year
Cohere-Labs-Community / iterative-data-selection
View on GitHub
☆30Nov 5, 2024Updated last year
NLP2CT / norm-nmt
View on GitHub
Norm-Based Curriculum Learning for Neural Machine Translation (ACL 2020)
☆18Aug 1, 2020Updated 5 years ago
neulab / contextual-mt
View on GitHub
A repository with the code related to experiments around context-aware machine translation
☆51Sep 22, 2025Updated 9 months ago
IST-DASLab / peft-rosa
View on GitHub
A fork of the PEFT library, supporting Robust Adaptation (RoSA)
☆15Aug 16, 2024Updated last year
OverfitFlow / KAMG
View on GitHub
[EMNLP 2020] Multi-label Few/Zero-shot Learning with Knowledge Aggregated from Multiple Label Graphs
☆17Jun 5, 2022Updated 4 years ago
layer6ai-labs / CMLMC
View on GitHub
Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"
☆18Mar 11, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
THUMLP / TensorGCN_pytorch
View on GitHub
☆20Oct 27, 2022Updated 3 years ago
tianyi-lab / Superfiltering
View on GitHub
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
☆189Jun 25, 2025Updated last year
mt-upc / joint
View on GitHub
Joint Source-Target Self Attention with Locality Constraints
☆20May 9, 2020Updated 6 years ago
NLP2CT / kNN-TL
View on GitHub
[ACL 2023] kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation
☆17Jul 27, 2023Updated 2 years ago
gpt4life / alpagasus
View on GitHub
Unofficial implementation of AlpaGasus
☆94Sep 23, 2023Updated 2 years ago
cindyxinyiwang / multiview-subword-regularization
View on GitHub
PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"
☆26Jun 2, 2021Updated 5 years ago
Shujun-He / Google-Brain-Ventilator
View on GitHub
☆11Nov 11, 2021Updated 4 years ago
li-aolong / TemplateGEC
View on GitHub
ACL2023 (Oral): TemplateGEC: Improving Grammatical Error Correction with Detection Template
☆23Jul 10, 2023Updated 3 years ago
GTyingzi / Compare_Adversial
View on GitHub
☆11Oct 29, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
morning-hao / domain-self-instruct
View on GitHub
受到self-instruct启发,除了通用LLM还能做垂直领域的小LLM实现定制效果，通过GPT获得question和answer来作为训练数据
☆18May 12, 2023Updated 3 years ago
qinyiwei / InfoBench
View on GitHub
☆61Aug 22, 2024Updated last year
listentm / CROWDSELECT
View on GitHub
We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datas…
☆20May 20, 2025Updated last year
TsinghuaC3I / Intuitive-Fine-Tuning
View on GitHub
[ACL 2025, Main Conference, Oral] Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
☆30Aug 2, 2024Updated last year
cntswj / tprnn
View on GitHub
Topological-LSTM for Information Cascade Modeling
☆12Nov 2, 2017Updated 8 years ago
zs1314 / Fraesormer
View on GitHub
【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"
☆13Mar 21, 2025Updated last year
Stanford-ILIAD / DPP-Batch-Active-Learning
View on GitHub
Companion code to the preprint: E Bıyık, K Wang, N Anari, D Sadigh, "Batch Active Learning using Determinantal Point Processes". arXiv pr…
☆14Jul 25, 2024Updated last year