adds Sequence Parallelism into LLaMA-Factory
☆12Dec 31, 2024Updated last year
Alternatives and similar repositories for 360-LLaMA-Factory
Users that are interested in 360-LLaMA-Factory are comparing it to the libraries listed below
Sorting:
- ☆25Feb 12, 2026Updated 3 weeks ago
- ☆19Apr 9, 2024Updated last year
- Multi-Turn-Single-Intent Bert model for dialogue session classification☆25Dec 8, 2022Updated 3 years ago
- ☆18Dec 2, 2024Updated last year
- adds Sequence Parallelism into LLaMA-Factory☆605Feb 5, 2026Updated last month
- A Bilingual Multi-Domain Dataset For Task-Oriented Dialogue Modeling☆23Jul 31, 2021Updated 4 years ago
- CGED & CSC☆23Feb 27, 2020Updated 6 years ago
- 本代码是cs224n的作业2代码☆18Aug 29, 2018Updated 7 years ago
- ☆29Aug 20, 2023Updated 2 years ago
- NAACL 2022 Findings Paper: MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving☆33Aug 18, 2022Updated 3 years ago
- [AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants☆46Dec 11, 2025Updated 2 months ago
- use mtcnn detect face and mobilefacenet calculate similarity☆24Dec 24, 2018Updated 7 years ago
- 🤖 Long-form question answering in the legal domain. (AAAI 2024)☆44Feb 28, 2024Updated 2 years ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year
- [ECCV2024] PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects☆58Sep 17, 2024Updated last year
- (ACL 2025) MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale☆49Jun 4, 2025Updated 9 months ago
- Cross-lingual GLUE☆49Jun 15, 2023Updated 2 years ago
- Data and code for SemEval 2019, Task 10: Math Question Answering☆48Oct 1, 2018Updated 7 years ago
- A PyTorch implementation of "Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study"☆50Dec 17, 2018Updated 7 years ago
- A grammatical error correction reading list maintained by BLCU ICALL Research Group☆47Sep 2, 2022Updated 3 years ago
- NLP的数据增强Demo☆48Feb 28, 2020Updated 6 years ago
- Long Context Extension and Generalization in LLMs☆63Sep 21, 2024Updated last year
- The dataset and the evaluation tool for NLPCC2018 Shared Task2--Grammatical Error Correction (GEC).☆55Mar 9, 2022Updated 4 years ago
- ☆67Jan 26, 2026Updated last month
- ☆63Mar 20, 2023Updated 2 years ago
- Mix of Minimal Optimal Sets (MMOS) of dataset has two advantages for two aspects, higher performance and lower construction costs on math…☆74Jul 27, 2024Updated last year
- R1-like Computer-use Agent☆89Mar 21, 2025Updated 11 months ago
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆72Jun 3, 2024Updated last year
- Resources of our paper at AAAI-19 ``Response Generation by Context-aware Prototype Editing"☆78May 28, 2019Updated 6 years ago
- [ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation☆80Oct 15, 2023Updated 2 years ago
- 这是一个seq2seq模型,编码器是bert,解码器是transformer的解码器,可用于自然语言处理中文本生成领域的任务☆74Aug 3, 2019Updated 6 years ago
- This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"☆118Oct 15, 2025Updated 4 months ago
- PyTorch implementation of MobileFaceNets☆116Dec 17, 2025Updated 2 months ago
- ☆132Jun 6, 2025Updated 9 months ago
- Google TPU optimizations for transformers models☆134Jan 23, 2026Updated last month
- 这里用来存储做人工智能项目的代码和参加数据挖掘比赛的代码☆111Jul 23, 2025Updated 7 months ago
- Source code for the paper A Memory-Augmented Neural Model for Automated Grading☆121Sep 2, 2019Updated 6 years ago
- tensorflow+bert+seq2seq 周公解梦。AI遇上玄学,说出你的梦境(dream),模型自动解析decode梦境的征兆。类似聊天机器人(chatbot,QA),你问我答。☆127Jan 3, 2020Updated 6 years ago
- Code and dataset of AAAI2020 Paper Neural Snowball for Few-Shot Relation Learning☆112Aug 6, 2020Updated 5 years ago