☆39Jul 25, 2024Updated last year
Alternatives and similar repositories for Efficient-Large-LM-Trainer
Users that are interested in Efficient-Large-LM-Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Oct 23, 2020Updated 5 years ago
- Microsoft Assent is an enterprise scale product, internally used at Microsoft. It delivers a modern approvals experience for any approval…☆33Apr 3, 2026Updated 3 weeks ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- ☆36Jun 12, 2023Updated 2 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆32Mar 31, 2020Updated 6 years ago
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- ☆13Jun 6, 2022Updated 3 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)☆15May 4, 2021Updated 4 years ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- ☆58Sep 23, 2022Updated 3 years ago
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AI☆56Sep 1, 2023Updated 2 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- An Open-Source Package for Information Retrieval.☆443Oct 7, 2022Updated 3 years ago
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An Open-Source Package for Information Retrieval☆167Mar 16, 2026Updated last month
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- ACL 2021: HiTransformer☆13May 29, 2021Updated 4 years ago
- Tools for the TREC CAsT benchmark☆30Dec 15, 2022Updated 3 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago
- ☆42Sep 25, 2019Updated 6 years ago
- Information Retrieval Relevance Judging System☆29Jan 17, 2022Updated 4 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated 2 years ago
- ☆12Sep 14, 2021Updated 4 years ago
- ☆22Sep 2, 2025Updated 8 months ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆33Sep 15, 2021Updated 4 years ago
- kogpt를 oslo로 파인튜닝하는 예제.☆23Aug 26, 2022Updated 3 years ago
- ☆11May 25, 2023Updated 2 years ago