apple / ml-lucid-datagen
☆28Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for ml-lucid-datagen
- Unofficial implementation of AlpaGasus☆84Updated last year
- This is the official repository of the paper "OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI"☆86Updated this week
- ☆56Updated 9 months ago
- 🚢 Data Toolkit for Sailor Language Models☆82Updated 4 months ago
- Reformatted Alignment☆112Updated 2 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆58Updated 8 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- Expert Specialized Fine-Tuning☆148Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆76Updated 9 months ago
- ☆28Updated 5 months ago
- Code and data for "Dynosaur: A Dynamic Growth Paradigm for Instruction-Tuning Data Curation" (EMNLP 2023)☆62Updated 11 months ago
- Evaluating tool-augmented LLMs in conversation settings☆72Updated 5 months ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆81Updated 6 months ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆94Updated 5 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- FuseAI Project☆76Updated 3 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆107Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆43Updated 4 months ago
- A collection of instruction data and scripts for machine translation.☆20Updated last year
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆38Updated last month
- The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"☆49Updated 4 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆34Updated 10 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆73Updated 3 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆47Updated last month
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆68Updated 5 months ago