jeongukjae / smaller-labseLinks

Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE

☆18

Alternatives and similar repositories for smaller-labse

Users that are interested in smaller-labse are comparing it to the libraries listed below

Sorting:

facebookresearch / romqa
A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
☆16Updated 2 years ago
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆82Updated 3 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 3 years ago
LAION-AI / Anh
Anh - LAION's multilingual assistant datasets and models
☆27Updated 2 years ago
Geotrend-research / smaller-transformers
Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.
☆105Updated 3 years ago
jason9693 / ETA4LLMs
Calculating Expected Time for training LLM.
☆38Updated 2 years ago
nreimers / se-pytorch-xla
☆21Updated 3 years ago
Beomi / exbert-transformers
exBERT on Transformers🤗
☆10Updated 4 years ago
hyunwoongko / summarizers
Package for controllable summarization
☆78Updated 2 years ago
Clyde013 / Paraphrase-OPT
Observe the slow deterioration of my mental sanity in the github commit history
☆12Updated 2 years ago
castorini / hf-spacerini
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆32Updated 2 years ago
monologg / EncT5
Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks
☆63Updated 3 years ago
facebookresearch / ketod
KETOD Knowledge-Enriched Task-Oriented Dialogue
☆32Updated 2 years ago
CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆82Updated 10 months ago
hyunwoongko / megatron-11b
Megatron LM 11B on Huggingface Transformers
☆27Updated 4 years ago
amazon-science / transformers-data-augmentation
Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper
☆52Updated 2 years ago
huggingface / data-measurements-tool
Developing tools to automatically analyze datasets
☆74Updated 9 months ago
facebookresearch / ELECTRA-Fewshot-Learning
This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.
☆48Updated 3 years ago
comet-ml / blog-serving-hugging-face-models
☆20Updated 4 years ago
philschmid / optimum-transformers-optimizations
☆30Updated 2 years ago
tlkh / t2t-tuner
Convenient Text-to-Text Training for Transformers
☆19Updated 3 years ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
jungokasai / beam_with_patience
☆46Updated 3 years ago
huggingface / hffs
**ARCHIVED** Filesystem interface to 🤗 Hub
☆58Updated 2 years ago
salesforce / TaiChi
Open source library for few shot NLP
☆78Updated 2 years ago
wietsedv / gpt2-recycle
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
☆48Updated 4 years ago
NathanGodey / headless-lm
Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…
☆27Updated last year
cisnlp / ofa
A Framework aims to wisely initialize unseen subword embeddings in PLMs for efficient large-scale continued pretraining
☆18Updated last year
bigscience-workshop / multilingual-modeling
BLOOM+1: Adapting BLOOM model to support a new unseen language
☆73Updated last year