apple / ml-lucid-datagen
☆32Updated last year
Alternatives and similar repositories for ml-lucid-datagen
Users that are interested in ml-lucid-datagen are comparing it to the libraries listed below
Sorting:
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆61Updated last year
- Unofficial implementation of AlpaGasus☆91Updated last year
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆63Updated last year
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆101Updated 2 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆85Updated last year
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆42Updated 7 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆77Updated last year
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆99Updated 11 months ago
- Reformatted Alignment☆113Updated 7 months ago
- Reasoning by Communicating with Agents☆28Updated 2 weeks ago
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆47Updated 10 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆140Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆111Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Updated last year
- ☆68Updated last year
- ☆69Updated last year
- Code for ACL2023 paper: Pre-Training to Learn in Context☆108Updated 9 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated 9 months ago
- FuseAI Project☆86Updated 3 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆47Updated 5 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆37Updated this week
- Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Text…☆81Updated this week
- ☆48Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆90Updated 2 months ago
- ☆15Updated last month
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆55Updated 7 months ago
- This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…☆19Updated 6 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 11 months ago