tianyi-lab / MoE-Embedding
Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"
☆61Updated 6 months ago
Alternatives and similar repositories for MoE-Embedding:
Users that are interested in MoE-Embedding are comparing it to the libraries listed below
- Large Language Models Can Self-Improve in Long-context Reasoning☆68Updated 4 months ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆66Updated 3 weeks ago
- ☆91Updated last month
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆90Updated last week
- LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models☆74Updated 6 months ago
- ☆76Updated 3 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆74Updated last month
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆114Updated 3 weeks ago
- SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator☆69Updated 3 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆35Updated 2 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆61Updated 5 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"☆52Updated last week
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆99Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆135Updated 2 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆180Updated last month
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆120Updated last month
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆81Updated 2 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆84Updated 9 months ago
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆174Updated last month
- The demo, code and data of FollowRAG☆71Updated 4 months ago
- FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models☆42Updated last week
- ☆39Updated 3 weeks ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 2 months ago
- ☆45Updated 2 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆52Updated 5 months ago
- ☆82Updated 5 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆57Updated 3 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Updated 10 months ago