penfever / wildchat-50m
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆29Updated last month
Alternatives and similar repositories for wildchat-50m
Users that are interested in wildchat-50m are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 8 months ago
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆30Updated 2 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- Aioli: A unified optimization framework for language model data mixing☆25Updated 3 months ago
- ☆56Updated this week
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 7 months ago
- ☆33Updated 10 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- Official implementation of "BERTs are Generative In-Context Learners"☆27Updated last month
- ☆13Updated 4 months ago
- A repository for research on medium sized language models.☆76Updated 11 months ago
- ☆25Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆12Updated 3 weeks ago
- ☆21Updated 2 months ago
- Exploration of automated dataset selection approaches at large scales.☆40Updated 2 months ago
- ☆48Updated 6 months ago
- PyTorch code for System-1.x: Learning to Balance Fast and Slow Planning with Language Models☆22Updated 9 months ago
- MEXMA: Token-level objectives improve sentence representations☆41Updated 4 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated last month
- This repo is based on https://github.com/jiaweizzhao/GaLore☆27Updated 7 months ago
- ☆63Updated last month
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆23Updated 3 months ago
- ☆24Updated last year
- ☆56Updated this week
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 11 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆24Updated 3 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- ☆20Updated 5 months ago
- ☆47Updated 8 months ago