tylerachang / goldfish
Goldfish: Monolingual language models for 350 languages.
☆12Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for goldfish
- ☆41Updated 2 weeks ago
- ☆24Updated last year
- Official implementation of "GPT or BERT: why not both?"☆36Updated last week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Using short models to classify long texts☆20Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- ☆46Updated this week
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 10 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆44Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆24Updated 2 months ago
- ☆22Updated 2 weeks ago
- ☆31Updated 10 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated 10 months ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆14Updated 9 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆29Updated last month
- ☆32Updated last year
- ☆45Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆17Updated last month
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆50Updated this week
- PyTorch implementation for MRL☆18Updated 9 months ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆29Updated last year
- ☆14Updated 2 weeks ago
- QLoRA for Masked Language Modeling☆20Updated last year
- QLoRA with Enhanced Multi GPU Support☆36Updated last year