☆39Jul 25, 2024Updated last year
Alternatives and similar repositories for Efficient-Large-LM-Trainer
Users that are interested in Efficient-Large-LM-Trainer are comparing it to the libraries listed below
Sorting:
- ☆24Oct 23, 2020Updated 5 years ago
- ↔️ Utilizing RBERT model structure for KLUE Relation Extraction task☆15Nov 15, 2022Updated 3 years ago
- AI for Mathematics Paper List☆17Jan 14, 2025Updated last year
- Convert pretrained RoBerta models to various long-document transformer models☆11Apr 5, 2022Updated 3 years ago
- Source code for our AAAI'22 paper 《From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression》☆25Dec 15, 2021Updated 4 years ago
- [EMNLP 2022] This is the code repo for our EMNLP‘22 paper "Dimension Reduction for Efficient Dense Retrieval via Conditional Autoencoder"…☆12Oct 20, 2022Updated 3 years ago
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Oct 12, 2022Updated 3 years ago
- ☆16Dec 11, 2024Updated last year
- ☆59Sep 23, 2022Updated 3 years ago
- ☆13Jun 6, 2022Updated 3 years ago
- ☆15Oct 10, 2021Updated 4 years ago
- ☆32Mar 31, 2020Updated 5 years ago
- NumPy로 구현한 딥러닝 라이브러리입니다. (자동 미분 지원)☆15May 4, 2021Updated 4 years ago
- Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Tra…☆33Sep 15, 2021Updated 4 years ago
- TREC-COVID results - this is a mirror of data on the TREC website in a more convenient format.☆15Aug 31, 2020Updated 5 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- Code for our SIGIR 2022 accepted paper : P3 Ranker: Mitigating the Gaps between Pre-training and Ranking Fine-tuning with Prompt-based L…☆18Sep 24, 2023Updated 2 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- MATCH-TUNING☆15Aug 6, 2022Updated 3 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated last year
- An Open-Source Package for Information Retrieval☆168Updated this week
- code for participation in ICDAR2021 Table Recognition track (Team Name: LTIAYN = Kaen Context)☆22Jun 16, 2021Updated 4 years ago
- ☆33Jun 3, 2025Updated 8 months ago
- text classification using ELMO☆16Dec 8, 2018Updated 7 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022☆18May 19, 2022Updated 3 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆19Aug 28, 2023Updated 2 years ago
- ☆24Nov 22, 2022Updated 3 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 2 years ago
- [NAACL'22] TaCL: Improving BERT Pre-training with Token-aware Contrastive Learning☆94Jun 8, 2022Updated 3 years ago
- A comprehensive benchmark for evaluating deep research agents on academic survey tasks☆50Sep 4, 2025Updated 5 months ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- A MBTI test on Large Language Model like GPT-3.☆27May 2, 2022Updated 3 years ago
- VaLM: Visually-augmented Language Modeling. ICLR 2023.☆56Mar 6, 2023Updated 2 years ago
- Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3☆23May 20, 2021Updated 4 years ago
- DocBankLoader is a dataset loader for DocBank, and can convert DocBank to the Object Detection models' format.☆25Mar 17, 2021Updated 4 years ago
- ☆12Sep 14, 2021Updated 4 years ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆120Oct 9, 2025Updated 4 months ago