A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)
☆24Jul 26, 2024Updated last year
Alternatives and similar repositories for AlpaGasus
Users that are interested in AlpaGasus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial implementation of AlpaGasus☆95Sep 23, 2023Updated 2 years ago
- The source code of [WWW 2025] MoDiCF☆12Jul 12, 2025Updated 8 months ago
- Official code for "Traffic Speed Imputation with Spatio-Temporal Attentions and Cycle-Perceptual Training" (CIKM'22).☆13Mar 8, 2024Updated 2 years ago
- Code for "A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language Models"☆17Jul 20, 2025Updated 8 months ago
- ☆14Apr 21, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆47Jan 22, 2026Updated 2 months ago
- ☆18May 5, 2021Updated 4 years ago
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Jun 28, 2024Updated last year
- Multi-Critic Policy Gradient Optimization for Quadcopter Coordination☆14Aug 10, 2021Updated 4 years ago
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated last year
- ☆14Aug 15, 2024Updated last year
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆24Aug 25, 2025Updated 7 months ago
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- Multi-Agent Reinforcement Learning☆11Jun 16, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- [ACL 2025 Main] Official Repo for Paper "Measuring Data Diversity for Instruction Tuning: A Systematic Analysis and A Reliable Metric"☆36Feb 10, 2026Updated last month
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- [ICLR 2025] Breaking Mental Set to Improve Reasoning through Diverse Multi-Agent Debate☆19Apr 22, 2025Updated 11 months ago
- ☆25May 29, 2022Updated 3 years ago
- ☆13Jan 14, 2026Updated 2 months ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆18Sep 15, 2025Updated 6 months ago
- Implementation of our paper "Towards Consistent Document-Level Entity Linking: Joint Models for Entity Linking and Coreference Resolution…☆12Nov 13, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…☆15Dec 8, 2024Updated last year
- MICCAI 2024 code for the paper: EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing. EchoNet-Synthetic i…☆38Jun 16, 2025Updated 9 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆91Nov 13, 2024Updated last year
- ☆16Aug 1, 2024Updated last year
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-…☆24Jan 27, 2026Updated 2 months ago
- An introduction to theorem proving in Lean for the impatient.☆19Apr 6, 2025Updated 11 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆14Jul 21, 2024Updated last year
- ☆12Jun 12, 2024Updated last year
- [WSDM 2025] Source code for "Spectrum-based Modality Representation Fusion Graph Convolutional Network for Multimodal Recommendation".☆37Dec 22, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆20Mar 13, 2025Updated last year
- The Pre-lease github repository of ECHOPULSE: ECG CONTROLLED ECHOCARDIO- GRAMS VIDEO GENERATION☆42Feb 4, 2025Updated last year
- [KDD 2022] Multi-modal Siamese Network for Entity Alignment☆33Jul 10, 2025Updated 8 months ago
- ☆12Dec 22, 2024Updated last year
- ☆41Nov 19, 2022Updated 3 years ago