AIFlowPlayer / SentenceVAELinks
Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context
β33Updated 11 months ago
Alternatives and similar repositories for SentenceVAE
Users that are interested in SentenceVAE are comparing it to the libraries listed below
Sorting:
- Code for paper "Patch-Level Training for Large Language Models"β85Updated 8 months ago
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β86Updated 9 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimizationβ38Updated 4 months ago
- The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"β38Updated 9 months ago
- [ICML'24] The official implementation of βRethinking Optimization and Architecture for Tiny Language Modelsββ121Updated 6 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Modelsβ52Updated 5 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoningβ113Updated last week
- RM-R1: Unleashing the Reasoning Potential of Reward Modelsβ113Updated 3 weeks ago
- β90Updated 2 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"β81Updated last year
- β52Updated 5 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Modelsβ78Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*β105Updated last month
- [ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Modelsβ50Updated 7 months ago
- π This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.β86Updated this week
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswalβ¦β52Updated 2 years ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"β25Updated 2 months ago
- Repo for "Z1: Efficient Test-time Scaling with Code"β63Updated 3 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domainsβ149Updated last month
- Long Context Extension and Generalization in LLMsβ57Updated 9 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"β91Updated 2 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paperβ33Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modelingβ50Updated last month
- β74Updated last year
- β18Updated 6 months ago
- Official Implementation of ARPO: End-to-End Policy Optimization for GUI Agents with Experience Replayβ88Updated last month
- Large Language Models Can Self-Improve in Long-context Reasoningβ71Updated 7 months ago
- β55Updated last week
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Modelsβ58Updated 4 months ago
- β132Updated last month