adds Sequence Parallelism into LLaMA-Factory
☆12Dec 31, 2024Updated last year
Alternatives and similar repositories for 360-LLaMA-Factory
Users that are interested in 360-LLaMA-Factory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Dec 2, 2024Updated last year
- ☆24Aug 30, 2025Updated 9 months ago
- Show windows menu on titlebar on DeepinV20☆10Jan 26, 2022Updated 4 years ago
- Code, Data and Model for Paper "Learning from Peers in Reasoning Models"☆26May 13, 2025Updated last year
- ☆34Feb 12, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Tableau-based reasoner for ALCQ description logic☆13May 1, 2020Updated 6 years ago
- ☆26Oct 9, 2024Updated last year
- [NeurIPS 2025 Spotlight] Official repository for "Web-Shepherd: Advancing PRMs for Reinforcing Web Agents"☆56May 21, 2025Updated last year
- adds Sequence Parallelism into LLaMA-Factory☆607Feb 5, 2026Updated 4 months ago
- 华中科技大学GPA计算, 包括华科, 标准, 北大算法☆13Mar 12, 2020Updated 6 years ago
- 全自动流水线烤肉☆12Jul 4, 2024Updated last year
- ☆10Mar 24, 2025Updated last year
- Website for MathVista☆21Jun 9, 2025Updated last year
- ☆46Mar 4, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs☆65Mar 9, 2026Updated 3 months ago
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- [WACV 2026 Round 1] Beyond Single Object Text-to-SVG Synthesis with Comprehensive Canvas Layout☆24Oct 11, 2025Updated 8 months ago
- A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling☆23Jul 31, 2021Updated 4 years ago
- ☆35Jul 20, 2025Updated 10 months ago
- A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis☆30May 1, 2025Updated last year
- A Magisk module that automatically adds user certificates to the system root CA store☆20May 17, 2025Updated last year
- This repo describe how to generate the ARC-Challenge submission for BERT Baseline☆25Nov 23, 2022Updated 3 years ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆75Jun 25, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- learning project☆19Aug 17, 2020Updated 5 years ago
- GroundCUA☆125Mar 24, 2026Updated 2 months ago
- ☆75Jul 15, 2024Updated last year
- use mtcnn detect face and mobilefacenet calculate similarity☆24Dec 24, 2018Updated 7 years ago
- checkra1n booter for windows (unofficial)☆25Aug 14, 2023Updated 2 years ago
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆62Sep 17, 2024Updated last year
- 🤖 Long-form question answering in the legal domain. (AAAI 2024)☆46Feb 28, 2024Updated 2 years ago
- [AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants☆45Dec 11, 2025Updated 6 months ago
- A modular and easy-to-use framework for Test-Time Training (TTT) and Test-Time Adaptation (TTA) in Pytorch, making your networks more gen…☆31Jun 1, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NAACL 2022 Findings Paper: MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving☆33Aug 18, 2022Updated 3 years ago
- English-Chinese-Japanese translation dataset of the terms in Genshin Impact☆41Jun 2, 2026Updated 2 weeks ago
- 🧹 游戏剧情录屏字幕清除☆34Oct 27, 2024Updated last year
- the world's first large-scale multi-modal short-video encyclopedia, where the primitive units are items, aspects, and short videos.☆67Nov 28, 2023Updated 2 years ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆154Dec 22, 2025Updated 5 months ago
- ☆56Nov 22, 2024Updated last year
- (NeurIPS 2024) Official PyTorch implementation of LOVA3☆91Mar 21, 2025Updated last year