oooranz / Baby-CoThought
Baby's CoThought: Leveraging LLMs for Enhanced Reasoning in Compact Models
☆17Updated 2 months ago
Alternatives and similar repositories for Baby-CoThought:
Users that are interested in Baby-CoThought are comparing it to the libraries listed below
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆119Updated 5 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆41Updated last year
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆26Updated last month
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 6 months ago
- ☆122Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆13Updated 8 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆44Updated 3 months ago
- ☆142Updated 11 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 11 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆126Updated last year
- ☆96Updated 8 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆109Updated last year
- ☆17Updated 5 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆43Updated last year
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following☆79Updated 6 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆94Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆84Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆113Updated 4 months ago
- Evaluating LLMs with fewer examples☆147Updated 11 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆53Updated 9 months ago