declare-lab / flacuna
Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.
☆110Updated last year
Related projects ⓘ
Alternatives and complementary repositories for flacuna
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆58Updated 7 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆111Updated last month
- ☆123Updated 6 months ago
- [IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection☆81Updated 6 months ago
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆91Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆76Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆114Updated 6 months ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆105Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆73Updated 9 months ago
- Unofficial implementation of AlpaGasus☆84Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆154Updated 6 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆72Updated 2 months ago
- Code repository for the c-BTM paper☆105Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆87Updated 3 months ago
- ☆132Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆92Updated 9 months ago
- ☆111Updated last month
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆78Updated this week
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 2 weeks ago
- ☆169Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆140Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆211Updated last year
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 2 months ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆96Updated 4 months ago
- This is the official repository for Inheritune.☆105Updated last month
- ☆91Updated 7 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆87Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 8 months ago