yuanzhoulvpi2017 / mamba4transformers
☆11Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for mamba4transformers
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"☆31Updated last month
- Code for https://arxiv.org/abs/2401.17139 (NeurIPS 2024)☆23Updated this week
- This the implementation of LeCo☆27Updated 3 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆26Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆97Updated last week
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆30Updated last year
- Codebase for Instruction Following without Instruction Tuning☆30Updated last month
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated last month
- A trainable user simulator☆26Updated last month
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆32Updated 10 months ago
- Official repository for paper "GTA: A Benchmark for General Tool Agents" (NeurIPS 2024 D&B Track)☆43Updated last week
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆67Updated last month
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆71Updated 3 weeks ago
- ☆37Updated 5 months ago
- ☆37Updated 4 months ago
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆34Updated 7 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆67Updated 4 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆111Updated this week
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆32Updated 3 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆35Updated last week
- ☆29Updated last year
- Code and data for "Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue" (ACL 2024)☆21Updated 3 months ago
- The official repository of the Omni-MATH benchmark.☆47Updated last week
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆38Updated 4 months ago
- ☆74Updated 4 months ago
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆19Updated 7 months ago
- ☆58Updated 4 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆18Updated last month