arcee-ai / DAM
☆41Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for DAM
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- A repository for research on medium sized language models.☆74Updated 6 months ago
- ☆24Updated last year
- ☆35Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆46Updated last month
- ☆27Updated 5 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- ☆22Updated 2 months ago
- ☆62Updated 3 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- ☆41Updated last month
- ☆45Updated 2 months ago
- ☆94Updated 2 months ago
- ☆37Updated this week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- Evaluating LLMs with CommonGen-Lite☆85Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆29Updated 6 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 10 months ago
- ☆52Updated 2 weeks ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- ☆112Updated last month
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- ☆20Updated last year
- ☆28Updated 8 months ago