yuanzhoulvpi2017 / mamba4transformers
☆11Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for mamba4transformers
- This the implementation of LeCo☆27Updated 4 months ago
- ☆38Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆100Updated 2 weeks ago
- ☆17Updated 4 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆46Updated 2 weeks ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆18Updated last week
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆28Updated 5 months ago
- A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆31Updated this week
- Reformatted Alignment☆112Updated last month
- ☆40Updated 5 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- Codebase for Instruction Following without Instruction Tuning☆32Updated last month
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆68Updated 5 months ago
- ☆78Updated 2 months ago
- ☆48Updated 8 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆120Updated last week
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆38Updated 4 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆17Updated 8 months ago
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆19Updated 8 months ago
- ☆62Updated 3 weeks ago
- [SIGIR'24] The official implementation code of MOELoRA.☆127Updated 4 months ago
- The code and data for the paper JiuZhang3.0☆35Updated 5 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆45Updated 7 months ago
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆54Updated last week
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆69Updated last month
- ☆78Updated 7 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated 10 months ago
- ☆77Updated 4 months ago
- Code and data for "Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue" (ACL 2024)☆21Updated 3 months ago