BestAnHongjun / SentenceVAELinks
Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context
β32Updated 10 months ago
Alternatives and similar repositories for SentenceVAE
Users that are interested in SentenceVAE are comparing it to the libraries listed below
Sorting:
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β85Updated 9 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimizationβ37Updated 4 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"β38Updated 8 months ago
- Code for paper "Patch-Level Training for Large Language Models"β85Updated 7 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"β80Updated last year
- RM-R1: Unleashing the Reasoning Potential of Reward Modelsβ111Updated this week
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encodingβ50Updated 6 months ago
- β18Updated 6 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingβ50Updated 3 weeks ago
- The official implementation for Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Freeβ44Updated last month
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Modelsβ85Updated last year
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward modelβ¦β47Updated 2 weeks ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Modelsβ52Updated 4 months ago
- Long Context Extension and Generalization in LLMsβ57Updated 9 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encodingβ18Updated 8 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*β104Updated last month
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replayβ79Updated 3 weeks ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"β35Updated 11 months ago
- Codebase for Instruction Following without Instruction Tuningβ34Updated 9 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Modelsβ78Updated last year
- LightThinker: Thinking Step-by-Step Compressionβ59Updated 2 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ61Updated 6 months ago
- β38Updated 2 months ago
- β85Updated 2 months ago
- β42Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domainsβ142Updated 2 weeks ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning"β119Updated 3 weeks ago
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentationβ69Updated 3 weeks ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"β108Updated last month
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"β25Updated last month