linkedin / ControlLLM
Control LLM
☆14Updated last week
Alternatives and similar repositories for ControlLLM:
Users that are interested in ControlLLM are comparing it to the libraries listed below
- ☆16Updated 8 months ago
- ☆10Updated 2 months ago
- Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆12Updated last week
- ☆16Updated 3 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 8 months ago
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆12Updated 2 weeks ago
- Code for paper: Long cOntext aliGnment via efficient preference Optimization☆13Updated 2 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆25Updated 5 months ago
- ☆20Updated 5 months ago
- Implementation for the paper "Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning"☆9Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 6 months ago
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated last month
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆25Updated last month
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆14Updated 8 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆19Updated 3 months ago
- Code for paper: "LASeR: Learning to Adaptively Select Reward Models with Multi-Arm Bandits"☆13Updated 6 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆29Updated 3 weeks ago
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆17Updated 2 months ago
- Exploration of automated dataset selection approaches at large scales.☆37Updated last month
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆10Updated 5 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆13Updated 2 weeks ago
- ☆20Updated last month
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 weeks ago
- The code of arXiv paper: "Dynamic Scaling of Unit Tests for Code Reward Modeling"☆18Updated 3 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆17Updated last month
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆20Updated 2 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆20Updated last month
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆24Updated 2 months ago
- Project for SNARE benchmark☆10Updated 10 months ago