Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is already an excellent writing assistant, and the intention behind Flacuna was to enhance Vicuna's problem-solving capabilities. To achieve this, we curated a dedicated instruction dataset called Flan-mini.
☆111Sep 10, 2023Updated 2 years ago
Alternatives and similar repositories for flacuna
Users that are interested in flacuna are comparing it to the libraries listed below
Sorting:
- ☆25Sep 19, 2023Updated 2 years ago
- XmodelLM☆38Nov 19, 2024Updated last year
- Official code for "3HAN: A Deep Neural Network for Fake News Detection" (ICONIP 2017)☆88Jun 21, 2018Updated 7 years ago
- A Residual Network Design with less than 5 million trainable parameters achieving an accuracy of 96.04% on CIFAR-10.☆27Jul 23, 2024Updated last year
- ☆44Jun 2, 2024Updated last year
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆29Dec 5, 2023Updated 2 years ago
- Adaptive Inter-Class Similarity Distillation for Semantic Segmentation (MTAP 2025)☆29Nov 14, 2025Updated 3 months ago
- ☆37Oct 10, 2024Updated last year
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- ☆29Jan 23, 2024Updated 2 years ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Feb 4, 2024Updated 2 years ago
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆34Feb 21, 2024Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆555Oct 28, 2023Updated 2 years ago
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆552Mar 10, 2024Updated last year
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆199Jul 23, 2024Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Dec 21, 2023Updated 2 years ago
- The official implementation for Collaborative Word-based Pre-trained Item Representation for Transferable Recommendation.☆25Jan 30, 2024Updated 2 years ago
- This repository holds the "Fully automated landmarking and facial segmentation on 3D photographs" files☆30Oct 23, 2023Updated 2 years ago
- Repo for paper "Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration"☆349May 8, 2024Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆92Jul 21, 2024Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆126May 7, 2024Updated last year
- A Data Source for Reasoning Embodied Agents☆19Sep 18, 2023Updated 2 years ago
- ☆198Feb 9, 2024Updated 2 years ago
- Salesforce open-source LLMs with 8k sequence length.☆725Jan 31, 2025Updated last year
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated 10 months ago
- Code for our ICLR 2024 paper "PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts"☆79May 5, 2024Updated last year
- [Preprint] Learning to Filter Context for Retrieval-Augmented Generaton☆196Apr 6, 2024Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆137Aug 2, 2023Updated 2 years ago
- ☆19Oct 2, 2023Updated 2 years ago
- ☆250Feb 20, 2026Updated last week
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,017Aug 21, 2024Updated last year
- This is the official code for the paper "SAM-DA: UAV Tracks Anything at Night with SAM-Powered Domain Adaptation".☆59Sep 27, 2024Updated last year
- ☆553Feb 8, 2026Updated 3 weeks ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆486Mar 19, 2024Updated last year
- Count Tokens of Code (forked from gocloc)☆44Aug 19, 2024Updated last year
- ☆45May 20, 2025Updated 9 months ago
- fastest vector database made in numpy☆766Oct 9, 2025Updated 4 months ago
- Lightweight chat AI platform featuring custom knowledge, open-source LLMs, prompt-engineering, retrieval analysis. Highly customizable. F…☆218Feb 14, 2024Updated 2 years ago