TemporaryLoRA / Temp-LoRA
☆95Updated 9 months ago
Alternatives and similar repositories for Temp-LoRA:
Users that are interested in Temp-LoRA are comparing it to the libraries listed below
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆154Updated 7 months ago
- ☆55Updated 2 months ago
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆53Updated last year
- ☆45Updated 7 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models☆173Updated 3 months ago
- Counting-Stars (★)☆78Updated 5 months ago
- This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems☆46Updated 2 weeks ago
- Repository of LV-Eval Benchmark☆58Updated 5 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆239Updated last month
- ☆78Updated last year
- ☆48Updated 10 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆64Updated last month
- ☆94Updated 4 months ago
- ☆36Updated 4 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated 10 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 7 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆120Updated 2 weeks ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆138Updated 4 months ago
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- Reformatted Alignment☆113Updated 4 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆156Updated 7 months ago
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆105Updated 6 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆143Updated 6 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆144Updated 2 weeks ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆66Updated 6 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆136Updated 7 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆71Updated 7 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 10 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆209Updated 3 months ago