penfever / wildchat-50mLinks
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆31Updated 8 months ago
Alternatives and similar repositories for wildchat-50m
Users that are interested in wildchat-50m are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆55Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 4 months ago
- A repository for research on medium sized language models.☆77Updated last year
- ☆88Updated last week
- Fork of Flame repo for training of some new stuff in development☆19Updated 2 weeks ago
- ☆39Updated last year
- Resa: Transparent Reasoning Models via SAEs☆46Updated 2 months ago
- ☆21Updated 4 months ago
- Aioli: A unified optimization framework for language model data mixing☆31Updated 11 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆111Updated 8 months ago
- ☆70Updated last year
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆38Updated 2 months ago
- Lottery Ticket Adaptation☆40Updated last year
- ☆67Updated 8 months ago
- An automated data pipeline scaling RL to pretraining levels☆72Updated 2 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆53Updated last year
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆93Updated last year
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Updated 2 years ago
- ☆52Updated last year
- ☆89Updated last year
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆123Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆73Updated 8 months ago
- ☆80Updated last month
- ☆41Updated 6 months ago
- ☆52Updated 6 months ago
- This is the official repository for Inheritune.☆116Updated 10 months ago
- Generate interleaved text and image content in a structured format you can directly pass to downstream APIs.☆29Updated last year