cathyxl / MAgIC
☆33Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for MAgIC
- ☆53Updated 2 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆28Updated 2 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆61Updated 3 weeks ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆32Updated 9 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆49Updated 6 months ago
- ☆43Updated last month
- ☆33Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆45Updated 7 months ago
- my commonly-used tools☆47Updated 3 months ago
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆27Updated 3 weeks ago
- ☆31Updated last year
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆38Updated 3 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆67Updated last month
- Code for https://arxiv.org/abs/2401.17139 (NeurIPS 2024)☆17Updated this week
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆14Updated 4 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆72Updated 9 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆43Updated 7 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆64Updated last year
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆43Updated last week
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆66Updated 3 weeks ago
- Directional Preference Alignment☆49Updated last month
- Official repository for paper "GTA: A Benchmark for General Tool Agents" (NeurIPS 2024 D&B Track)☆43Updated this week
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆47Updated last month
- ☆28Updated last week
- [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"☆82Updated 8 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆111Updated last week
- [ICML 2024 Oral] Official code repository for MLLM-as-a-Judge.☆53Updated 3 months ago
- TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models☆59Updated 9 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆52Updated 2 months ago