jaketae / ensemble-transformers
Ensembling Hugging Face transformers made easy
☆63Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ensemble-transformers
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 2 years ago
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- ☆21Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆75Updated 2 months ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆12Updated 11 months ago
- ☆55Updated last year
- ☆20Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated last year
- Calculating Expected Time for training LLM.☆38Updated last year
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆13Updated 7 months ago
- This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.☆45Updated 2 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆39Updated last year
- AVocaDo : Strategy for Adapting Vocabulary to Downstream Domain☆22Updated 2 years ago
- exBERT on Transformers🤗☆10Updated 3 years ago
- PyTorch reimplementation of the paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization"☆16Updated 3 years ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆14Updated 7 months ago
- Dense hybrid representations for text retrieval☆62Updated last year
- Prompt-and-Rerank: A Method for Zero-Shot and Few-Shot Textual Style Transfer☆33Updated 2 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆43Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- Shared code for training sentence embeddings with Flax / JAX☆27Updated 3 years ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆81Updated 3 weeks ago
- ☆9Updated 2 months ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆27Updated 2 months ago
- Observe the slow deterioration of my mental sanity in the github commit history☆13Updated last year
- ☆15Updated 3 months ago