☆18Feb 20, 2024Updated 2 years ago
Alternatives and similar repositories for Small2Large
Users that are interested in Small2Large are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆23Mar 11, 2024Updated 2 years ago
- Collection of training data management explorations for large language models☆341Aug 2, 2024Updated last year
- A demonstration of how to train a custom tokenizer similar to TikToken.☆15Jan 6, 2025Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆190Jun 25, 2025Updated last year
- babyLM WhisBERT code☆19May 27, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆24Jan 27, 2025Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆48Mar 29, 2024Updated 2 years ago
- Identification of Human Phenotype Entities☆11Nov 2, 2018Updated 7 years ago
- Official implementation of Language Models as Compilers: Simulating the Execution Of Pseudocode Improves Algorithmic Reasoning in Languag…☆23Apr 8, 2024Updated 2 years ago
- The official implementation of "ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering"☆63Jun 21, 2025Updated last year
- This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibrat…☆23May 21, 2025Updated last year
- Code and data for QueryAgent(ACL 2024)☆21Dec 19, 2024Updated last year
- ☆23Aug 7, 2023Updated 2 years ago
- arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.☆28Jun 10, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆32Jun 25, 2025Updated last year
- 🌏 Modular retrievers for zero-shot multilingual IR.☆30Mar 6, 2024Updated 2 years ago
- ☆100Jun 27, 2024Updated 2 years ago
- [ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs☆57May 26, 2025Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆197Mar 25, 2024Updated 2 years ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆54Jun 24, 2024Updated 2 years ago
- ☆32Jul 8, 2024Updated last year
- Code for the paper "Contextualized Weak Supervision for Text Classification"☆52Mar 25, 2021Updated 5 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆137Jul 10, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆12Jan 27, 2025Updated last year
- ☆38Aug 5, 2024Updated last year
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- ☆47Jun 11, 2025Updated last year
- ☆12Dec 7, 2024Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆597Dec 9, 2024Updated last year
- A pre-commit hook for Pyrefly.☆27Jun 19, 2026Updated last week
- Finetune Malaysian LLM for Malaysian context embedding task.☆23Apr 27, 2024Updated 2 years ago
- Practice recognizing chords in this Rust/Yew/Webassembly app☆16Jan 20, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Dec 30, 2020Updated 5 years ago
- Exploring aspects of similarity between spoken personal narratives by disentangling them into narrative clause types -- Supplementary inf…☆12Jul 14, 2020Updated 5 years ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 8 months ago
- ☆11Feb 21, 2019Updated 7 years ago
- ☆10Oct 17, 2022Updated 3 years ago
- ☆52Jun 14, 2024Updated 2 years ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year