Hritikbansal / sparse_feedbackLinks

☆29

Alternatives and similar repositories for sparse_feedback

Users that are interested in sparse_feedback are comparing it to the libraries listed below

Sorting:

allenai / easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
☆48Updated last year
declare-lab / flacuna
Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…
☆111Updated last year
giangdip2410 / HyperRouter
Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"
☆33Updated last year
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
EternityYW / Gemini-Commonsense-Evaluation
Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"
☆36Updated last year
PootieT / explain-then-translate
Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…
☆29Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
lunyiliu / CoachLM
Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.
☆60Updated last year
xhan77 / in-context-alignment
In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning
☆35Updated 2 years ago
shenao-zhang / SELM
The official implementation of Self-Exploring Language Models (SELM)
☆64Updated last year
allenai / super-benchmark
☆45Updated 4 months ago
clinicalml / co-llm
Co-LLM: Learning to Decode Collaboratively with Multiple Language Models
☆116Updated last year
tianyi-lab / C3PO
Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"
☆17Updated 4 months ago
john-hewitt / implicit-ins
Codebase for Instruction Following without Instruction Tuning
☆35Updated 10 months ago
rosewang2008 / backtracing
Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.
☆89Updated last year
austrian-code-wizard / c3po
☆29Updated last week
voidism / Lookback-Lens
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
☆130Updated 11 months ago
facebookresearch / llm-cross-capabilities
Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"
☆42Updated 10 months ago
dinobby / MAgICoRE
☆24Updated 10 months ago
limenlp / safer-instruct
This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"
☆17Updated last year
facebookresearch / NeuralMemory
A Data Source for Reasoning Embodied Agents
☆19Updated last year
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆49Updated 8 months ago
choosewhatulike / case2code
☆15Updated 4 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆86Updated last year
yale-nlp / refdpo
☆16Updated last year
architsharma97 / dpo-rlaif
☆99Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
amazon-science / controllable-readability-summarization
Generating Summaries with Controllable Readability Levels (EMNLP 2023)
☆13Updated last month
kernelmachine / silo-lm
SILO Language Models code repository
☆81Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago