BunsenFeng / model_swarm
☆13Updated 3 months ago
Alternatives and similar repositories for model_swarm:
Users that are interested in model_swarm are comparing it to the libraries listed below
- Official Implementation of "Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning" at EMNLP 2024 Main Conf…☆26Updated 2 months ago
- Direct preference optimization with f-divergences.☆13Updated 4 months ago
- Official repository of "Can Language Models Solve Graph Problems in Natural Language?". NeurIPS 2023 (Spotlight)☆121Updated 7 months ago
- [COLM'24] "Deductive Beam Search: Decoding Deducible Rationale for Chain-of-Thought Reasoning"☆20Updated 9 months ago
- ☆36Updated last week
- ☆19Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆72Updated 7 months ago
- LaTeX Drawing☆11Updated last year
- Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"☆61Updated 3 months ago
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆20Updated last year
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 4 months ago
- ☆39Updated 4 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆28Updated this week
- GenRM-CoT: Data release for verification rationales☆53Updated 5 months ago
- Early release of the official implementation for "UniGraph: Learning a Unified Cross-Domain Foundation Model for Text-Attributed Graphs"☆11Updated 7 months ago
- What does the bot say? ACL 2024☆20Updated 7 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆119Updated 6 months ago
- [ICML 2024] Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications☆73Updated last month
- Accepted LLM Papers in NeurIPS 2024☆34Updated 5 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆107Updated 6 months ago
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆20Updated 11 months ago
- Lightweight Adapting for Black-Box Large Language Models☆23Updated last year
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆31Updated 3 months ago
- ☆43Updated 5 months ago
- This repo is reproduction resources for linear alignment paper, still working☆17Updated 10 months ago
- Official code repository for the main conference paper in ACL2023: COLA: Contextualized Commonsense Causality Reasoning from the Causal I…☆28Updated last year
- ☆14Updated last year
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆57Updated 5 months ago
- ☆16Updated 4 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆54Updated 11 months ago