jeongukjae / smaller-labse
Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE
☆18Updated 3 years ago
Alternatives and similar repositories for smaller-labse:
Users that are interested in smaller-labse are comparing it to the libraries listed below
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Megatron LM 11B on Huggingface Transformers☆27Updated 3 years ago
- exBERT on Transformers🤗☆10Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆25Updated 9 months ago
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering☆16Updated 2 years ago
- Implementation of stop sequencer for Huggingface Transformers☆16Updated last year
- Anh - LAION's multilingual assistant datasets and models☆27Updated last year
- Calculating Expected Time for training LLM.☆38Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆24Updated last year
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆102Updated 2 years ago
- ☆21Updated 3 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining☆14Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'☆17Updated 2 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆53Updated 6 months ago
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆82Updated 2 years ago
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated last year
- Training T5 to perform numerical reasoning.☆23Updated 3 years ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆28Updated 2 years ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Personal information identification standard☆19Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogue☆32Updated 2 years ago
- reference pytorch code for intent classification☆44Updated 3 months ago
- Experiments with generating opensource language model assistants☆97Updated last year
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Updated 3 years ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentati…☆37Updated 2 years ago
- ☆30Updated 2 years ago