adds Sequence Parallelism into LLaMA-Factory
☆12Dec 31, 2024Updated last year
Alternatives and similar repositories for 360-LLaMA-Factory
Users that are interested in 360-LLaMA-Factory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Dec 2, 2024Updated last year
- ☆22Aug 30, 2025Updated 8 months ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆27May 13, 2025Updated last year
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆54May 21, 2025Updated last year
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆27Jul 9, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- adds Sequence Parallelism into LLaMA-Factory☆607Feb 5, 2026Updated 3 months ago
- 华中科技大学GPA计算, 包括华科, 标准, 北大算法☆13Mar 12, 2020Updated 6 years ago
- 中科大郑启龙2021年并行程序设计课程实验☆11Jan 15, 2022Updated 4 years ago
- ☆46Mar 4, 2025Updated last year
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs☆65Mar 9, 2026Updated 2 months ago
- Repo for solving arc problems with an Neural Cellular Automata☆24Mar 9, 2026Updated 2 months ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- ☆18May 7, 2023Updated 3 years ago
- A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling☆23Jul 31, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CGED & CSC☆23Feb 27, 2020Updated 6 years ago
- 本代码是cs224n的作业2代码☆18Aug 29, 2018Updated 7 years ago
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 11 months ago
- GroundCUA☆126Mar 24, 2026Updated 2 months ago
- 中科大《高级数据库系统》实验——Storage and Buffer Manager☆22Dec 22, 2019Updated 6 years ago
- ☆74Jul 15, 2024Updated last year
- ☆23Nov 8, 2021Updated 4 years ago
- 《数据科学与工程导论》教材配套资源☆31Apr 17, 2021Updated 5 years ago
- use mtcnn detect face and mobilefacenet calculate similarity☆24Dec 24, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆32Apr 14, 2024Updated 2 years ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year
- 🤖 Long-form question answering in the legal domain. (AAAI 2024)☆46Feb 28, 2024Updated 2 years ago
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆60Sep 17, 2024Updated last year
- NAACL 2022 Findings Paper: MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving☆33Aug 18, 2022Updated 3 years ago
- ☆79May 4, 2025Updated last year
- the world's first large-scale multi-modal short-video encyclopedia, where the primitive units are items, aspects, and short videos.☆67Nov 28, 2023Updated 2 years ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆153Dec 22, 2025Updated 5 months ago
- R1-like Computer-use Agent☆91Mar 21, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs☆203Apr 8, 2026Updated last month
- Exploration of automated dataset selection approaches at large scales.☆54Mar 4, 2025Updated last year
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆82Oct 15, 2023Updated 2 years ago
- Data and code for SemEval 2019, Task 10: Math Question Answering☆48Oct 1, 2018Updated 7 years ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆73Jun 3, 2024Updated last year
- A grammatical error correction reading list maintained by BLCU ICALL Research Group☆47Sep 2, 2022Updated 3 years ago
- Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math…☆73Jul 27, 2024Updated last year