microsoft/Efficient-Large-LM-Trainer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/microsoft/Efficient-Large-LM-Trainer)

microsoft / Efficient-Large-LM-Trainer

☆39

Alternatives and similar repositories for Efficient-Large-LM-Trainer

Users that are interested in Efficient-Large-LM-Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Georgetown-IR-Lab / covid-neural-ir
View on GitHub
☆24Oct 23, 2020Updated 5 years ago
NEUIR / ConAE
View on GitHub
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…
☆13Oct 20, 2022Updated 3 years ago
NEUIR / P3Ranker
View on GitHub
[SIGIR '22] Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Pr…
☆18Sep 24, 2023Updated 2 years ago
thunlp / ReInfoSelect
View on GitHub
☆36Jun 12, 2023Updated 3 years ago
henryzhao5852 / BeamDR
View on GitHub
☆15Oct 10, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
thunlp / COVID19-IRQA
View on GitHub
☆32Mar 31, 2020Updated 6 years ago
Flawless1202 / Transformer
View on GitHub
A Pytorch-Lightning Implementation of Transformer Network
☆11Oct 22, 2020Updated 5 years ago
luyug / mores_plus
View on GitHub
☆13Jun 6, 2022Updated 4 years ago
castorini / TREC-COVID
View on GitHub
TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.
☆15Aug 31, 2020Updated 5 years ago
tlkh / t2t-tuner
View on GitHub
Convenient Text-to-Text Training for Transformers
☆18Dec 10, 2021Updated 4 years ago
OpenMatch / ANCE-Tele
View on GitHub
Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…
☆18Mar 25, 2024Updated 2 years ago
microsoft / EfficientLongSequenceModeling
View on GitHub
☆54Jan 19, 2023Updated 3 years ago
Victorwz / VaLM
View on GitHub
VaLM: Visually-augmented Language Modeling. ICLR 2023.
☆56Mar 6, 2023Updated 3 years ago
monologg / ko_lm_dataformat
View on GitHub
A utility for storing and reading files for Korean LM training 💾
☆35Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nlpapereading / nlpapereading
View on GitHub
☆58Sep 23, 2022Updated 3 years ago
lemurproject / ClueWeb22
View on GitHub
☆17Dec 11, 2024Updated last year
hpcaitech / ColossalAI-Pytorch-lightning
View on GitHub
☆24Nov 22, 2022Updated 3 years ago
Lightning-Universe / lightning-ColossalAI
View on GitHub
Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI
☆56Jul 6, 2026Updated 2 weeks ago
korean-named-entity / konec
View on GitHub
Korean Named Entity Corpus
☆25May 12, 2023Updated 3 years ago
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
Beomi / transformers-language-modeling
View on GitHub
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23May 20, 2021Updated 5 years ago
dki-lab / few-shot-bioIE
View on GitHub
True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning
☆12Jul 6, 2022Updated 4 years ago
thunlp / OpenMatch
View on GitHub
An Open-Source Package for Information Retrieval.
☆442Oct 7, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
wuch15 / HiTransformer
View on GitHub
ACL 2021: HiTransformer
☆13May 29, 2021Updated 5 years ago
OpenMatch / OpenMatch
View on GitHub
An Open-Source Package for Information Retrieval
☆167Jul 13, 2026Updated last week
ielab / relevation
View on GitHub
Information Retrieval Relevance Judging System
☆29Jan 17, 2022Updated 4 years ago
RunxinXu / ContrastivePruning
View on GitHub
Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》
☆25Dec 15, 2021Updated 4 years ago
google / retrieval-qa-eval
View on GitHub
☆42Sep 25, 2019Updated 6 years ago
fzyzcjy / ai_math_paper_list
View on GitHub
AI for Mathematics Paper List
☆17Jan 14, 2025Updated last year
microsoft / deepnmt
View on GitHub
☆31Jun 28, 2022Updated 4 years ago
tongshoujie / MATCH-TUNING
View on GitHub
MATCH-TUNING
☆15Aug 6, 2022Updated 3 years ago
paust-team / pko-t5
View on GitHub
bpe based korean t5 model for text-to-text unified framework
☆63Apr 17, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
qcznlp / uncertainty_attack
View on GitHub
☆23Sep 2, 2025Updated 10 months ago
MGheini / xattn-transfer-for-mt
View on GitHub
Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…
☆33Sep 15, 2021Updated 4 years ago
JFChi / PLUE
View on GitHub
☆11May 25, 2023Updated 3 years ago
commoncrawl / ia-web-commons
View on GitHub
Web archiving utility library
☆11Jun 19, 2026Updated last month
GregxmHu / OccuBench
View on GitHub
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models
☆21Apr 14, 2026Updated 3 months ago
detail-novelist / novelist-triton-server
View on GitHub
Deploy KoGPT with Triton Inference Server
☆14Nov 18, 2022Updated 3 years ago
jason9693 / oslo-kogpt-finetunig
View on GitHub
kogpt를 oslo로 파인튜닝하는 예제.
☆23Aug 26, 2022Updated 3 years ago