A better Alpaca Model Trained with Less Data (only 9k instructions of the original set)
☆24Jul 26, 2024Updated last year
Alternatives and similar repositories for AlpaGasus
Users that are interested in AlpaGasus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The source code of [WWW 2025] MoDiCF☆14Mar 26, 2026Updated 2 months ago
- Official code for "Traffic Speed Imputation with Spatio-Temporal Attentions and Cycle-Perceptual Training" (CIKM'22).☆13Mar 8, 2024Updated 2 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Kernel Herding for probability density estimation☆14Feb 23, 2016Updated 10 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆24Apr 26, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the source code for: Context-aware Entity Typing in Knowledge Graphs.☆16May 10, 2022Updated 4 years ago
- ☆14Apr 21, 2023Updated 3 years ago
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆33Jul 8, 2025Updated 10 months ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆48Jan 22, 2026Updated 4 months ago
- ☆18May 5, 2021Updated 5 years ago
- Modified CartPole-v0 OpenAI Gym environment with various noisy cases and Reinforcement Learning based controller☆10Dec 5, 2017Updated 8 years ago
- ☆14Aug 15, 2024Updated last year
- [ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving☆25Apr 6, 2026Updated last month
- ☆23Jan 12, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated 2 years ago
- ☆25May 29, 2022Updated 4 years ago
- First instruction-tuning dataset distilled from Claude2 (52k Alpaca prompts)!☆13Oct 22, 2023Updated 2 years ago
- ☆13Jan 14, 2026Updated 4 months ago
- Code for the benchmarking single-cell foundation models (scGPT, scBERT, and Geneformer) for cell-type annotation task using skewed single…☆15Dec 8, 2024Updated last year
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆91Nov 13, 2024Updated last year
- An introduction to theorem proving in Lean for the impatient.☆19Apr 6, 2025Updated last year
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-…☆28Jan 27, 2026Updated 4 months ago
- This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data☆13Jul 21, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Oct 12, 2022Updated 3 years ago
- ☆12Jun 12, 2024Updated last year
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- The Pre-lease github repository of ECHOPULSE: ECG CONTROLLED ECHOCARDIO- GRAMS VIDEO GENERATION☆45Feb 4, 2025Updated last year
- Calibrating LLM Confidence by Probing Perturbed Representation Stability☆18Jul 5, 2025Updated 10 months ago
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆21Mar 13, 2025Updated last year
- A Large-Scale Knowledge Graph for Automated Scientific Research☆106Updated this week
- Fast CUDA implementation of (differentiable) otam for PyTorch using Numba☆16Jun 21, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [KDD 2022] Multi-modal Siamese Network for Entity Alignment☆34Jul 10, 2025Updated 10 months ago
- ☆41Nov 19, 2022Updated 3 years ago
- The official repo for "Stepping Stones: A Progressive Training Strategy for Audio-Visual Semantic Segmentation", ECCV 2024☆18Oct 11, 2024Updated last year
- A program repair tool which modifies any bugged Python script based on cues from rest of program.☆20Jun 14, 2021Updated 4 years ago
- Anh - LAION's multilingual assistant datasets and models☆28Apr 5, 2023Updated 3 years ago
- Official Implementation of paper "Distilling Long-tailed Datasets" [CVPR 2025]☆21Aug 13, 2025Updated 9 months ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19May 14, 2026Updated 2 weeks ago