☆39Jul 25, 2024Updated last year
Alternatives and similar repositories for Efficient-Large-LM-Trainer
Users that are interested in Efficient-Large-LM-Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Oct 23, 2020Updated 5 years ago
- [SIGIR '22] Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Pr…☆18Sep 24, 2023Updated 2 years ago
- ☆36Jun 12, 2023Updated 2 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- ☆32Mar 31, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- ☆13Jun 6, 2022Updated 3 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.☆15Aug 31, 2020Updated 5 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)☆15May 4, 2021Updated 5 years ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- ☆58Sep 23, 2022Updated 3 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Sep 1, 2023Updated 2 years ago
- Repository for the Findings of ACL'23 paper Label Agnostic Pre-training for Zero-shot Text Classification☆12Aug 10, 2023Updated 2 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 3 years ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An Open-Source Package for Information Retrieval.☆443Oct 7, 2022Updated 3 years ago
- An Open-Source Package for Information Retrieval☆167Apr 27, 2026Updated 3 weeks ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- Tools for the TREC CAsT benchmark☆30Dec 15, 2022Updated 3 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 5 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated 2 years ago
- ☆12Sep 14, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆33Sep 15, 2021Updated 4 years ago
- ☆11May 25, 2023Updated 2 years ago
- ☆19Nov 4, 2025Updated 6 months ago
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- The official tool for creating proceedings for conferences of the Association for Computational Linguistics (ACL).☆41Jun 2, 2021Updated 4 years ago
- ☆102Dec 17, 2022Updated 3 years ago