mrpeerat / SCT
SCT: An Efficient Self-Supervised Cross-View Training For Sentence Embedding (TACL)
☆14Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for SCT
- Implementation of ConGen: Unsupervised Control and Generalization Distillation For Sentence Representation (Finding of EMNLP 2022).☆21Updated last year
- Hugging Face RoBERTa with Flash Attention 2☆19Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆74Updated last month
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆12Updated 11 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆27Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆29Updated last month
- ☆16Updated last year
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Using short models to classify long texts☆20Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆70Updated 8 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 7 months ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆23Updated 2 months ago
- ☆62Updated 9 months ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆48Updated 2 months ago
- ☆25Updated last month
- Mr. TyDi is a multi-lingual benchmark dataset built on TyDi, covering eleven typologically diverse languages.☆72Updated 2 years ago
- Code & Data for Comparative Opinion Summarization via Collaborative Decoding (Iso et al; Findings of ACL 2022)☆21Updated last year
- Convenient Text-to-Text Training for Transformers☆19Updated 2 years ago
- Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting ir…☆30Updated 2 weeks ago
- Observe the slow deterioration of my mental sanity in the github commit history☆13Updated last year
- Do Multilingual Language Models Think Better in English?☆41Updated last year
- ☆20Updated last month
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆28Updated 3 weeks ago
- ☆20Updated 3 years ago
- ☆19Updated last year
- Framework for unified summarisation and evaluation of English documents using state-of-the-art models and measures.☆31Updated 5 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆66Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year