☆39Jul 25, 2024Updated last year
Alternatives and similar repositories for Efficient-Large-LM-Trainer
Users that are interested in Efficient-Large-LM-Trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Oct 23, 2020Updated 5 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆13Oct 20, 2022Updated 3 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- ☆32Mar 31, 2020Updated 6 years ago
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Pytorch-Lightning Implementation of Transformer Network☆11Oct 22, 2020Updated 5 years ago
- ☆13Jun 6, 2022Updated 4 years ago
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 4 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- Convenient Text-to-Text Training for Transformers☆18Dec 10, 2021Updated 4 years ago
- ☆54Jan 19, 2023Updated 3 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 7 months ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆58Sep 23, 2022Updated 3 years ago
- ☆17Dec 11, 2024Updated last year
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 3 years ago
- An Open-Source Package for Information Retrieval.☆442Oct 7, 2022Updated 3 years ago
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- An Open-Source Package for Information Retrieval☆167May 25, 2026Updated 2 weeks ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ACL 2021: HiTransformer☆13May 29, 2021Updated 5 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 5 years ago
- Information Retrieval Relevance Judging System☆29Jan 17, 2022Updated 4 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- 🦄 Shades of Purple — A professional theme with hand-picked & bold shades of purple for Base16.☆13Jan 13, 2023Updated 3 years ago
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago
- ☆31Jun 28, 2022Updated 3 years ago
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- bpe based korean t5 model for text-to-text unified framework☆63Apr 17, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆22Sep 2, 2025Updated 9 months ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆33Sep 15, 2021Updated 4 years ago
- kogpt를 oslo로 파인튜닝하는 예제.☆23Aug 26, 2022Updated 3 years ago
- ☆11May 25, 2023Updated 3 years ago
- Web archiving utility library☆11May 5, 2026Updated last month
- ☆19Nov 4, 2025Updated 7 months ago
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago