thunlp / APBLinks
Official Implementation of APB (ACL 2025 main)
☆28Updated 4 months ago
Alternatives and similar repositories for APB
Users that are interested in APB are comparing it to the libraries listed below
Sorting:
- ☆36Updated this week
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- ☆20Updated 7 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆39Updated last month
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆23Updated 7 months ago
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆78Updated 2 weeks ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆37Updated 3 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆38Updated 3 months ago
- ☆48Updated 2 weeks ago
- ☆46Updated last week
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆38Updated last year
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆51Updated 11 months ago
- HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models☆45Updated 6 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆44Updated 3 months ago
- ☆37Updated 2 months ago
- [ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models☆47Updated 7 months ago
- [NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding☆18Updated 8 months ago
- ☆64Updated last year
- ☆17Updated 5 months ago
- Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"☆20Updated last month
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆40Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆32Updated 3 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 2 weeks ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated 3 weeks ago
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Updated 8 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆52Updated 4 months ago
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated 9 months ago