jeongukjae / smaller-labse
Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE
โ18Updated 3 years ago
Related projects โ
Alternatives and complementary repositories for smaller-labse
- exBERT on Transformers๐คโ10Updated 3 years ago
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Updated 2 years ago
- A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretrainingโ12Updated 11 months ago
- Implementation of stop sequencer for Huggingface Transformersโ15Updated last year
- Megatron LM 11B on Huggingface Transformersโ27Updated 3 years ago
- Anh - LAION's multilingual assistant datasets and modelsโ27Updated last year
- Calculating Expected Time for training LLM.โ38Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ92Updated last year
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paperโ13Updated 6 months ago
- Observe the slow deterioration of my mental sanity in the github commit historyโ13Updated last year
- KETOD Knowledge-Enriched Task-Oriented Dialogueโ31Updated last year
- Difference-based Contrastive Learning for Korean Sentence Embeddingsโ24Updated last year
- A Benchmark for Robust, Multi-evidence, Multi-answer Question Answeringโ16Updated last year
- Personal information identification standardโ19Updated 9 months ago
- data related codebase for polyglot projectโ19Updated last year
- ์ธ์ด๋ชจ๋ธ์ ํ์ตํ๊ธฐ ์ํ ๊ณต๊ฐ ํ๊ตญ์ด instruction dataset๋ค์ ๋ชจ์๋์์ต๋๋ค.โ19Updated last year
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorchโ72Updated last year
- reference pytorch code for intent classificationโ45Updated 3 weeks ago
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Treesโ23Updated last year
- โ23Updated last year
- NeuralWOZ: Learning to Collect Task-Oriented Dialogue via Model-based Simulation (ACL-IJCNLP 2021)โ36Updated 3 years ago
- โ20Updated 3 years ago
- baikal.ai's pre-trained BERT models: descriptions and sample codesโ12Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paperโ51Updated last year
- Consists of the largest (10K) human annotated code-switched semantic parsing dataset & 170K generated utterance using the CST5 augmentatiโฆโ33Updated last year
- Beyond LM: How can language model go forward in the future?โ15Updated last year