tylerachang / goldfishLinks
Goldfish: Monolingual language models for 350 languages.
☆17Updated 9 months ago
Alternatives and similar repositories for goldfish
Users that are interested in goldfish are comparing it to the libraries listed below
Sorting:
- Using short models to classify long texts☆21Updated 2 years ago
- ☆13Updated 2 weeks ago
- ☆12Updated 6 months ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆19Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆55Updated 2 weeks ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Model implementation for the contextual embeddings project☆26Updated this week
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆81Updated 8 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆63Updated last year
- ☆12Updated last year
- ☆45Updated 4 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Updated 8 months ago
- ☆57Updated 8 months ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆72Updated last year
- ☆20Updated last month
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆56Updated last month
- ☆26Updated last week
- ☆20Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challenge☆58Updated last year
- 🔍 Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment☆11Updated 2 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆31Updated 7 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated 3 weeks ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated 2 years ago
- ☆22Updated 4 months ago
- ☆49Updated 7 months ago