(撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。
☆36Aug 5, 2024Updated last year
Alternatives and similar repositories for LLM-Chinese
Users that are interested in LLM-Chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 📸 Code and Dataset for our ACL 2023 paper: "MPCHAT: Towards Multimodal Persona-Grounded Conversation"☆22Sep 5, 2023Updated 2 years ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- Codes for paper "Stylized Story Generation with Style-Guided Planning"☆12May 9, 2021Updated 4 years ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆19Jul 20, 2023Updated 2 years ago
- 浏览器AI插件,一键把网页文章内容生成为思维导图,很方便。☆27Jul 4, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Jun 12, 2023Updated 2 years ago
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆32Jan 22, 2024Updated 2 years ago
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- The repository of CLEME (EMNLP 2023) and CLEME2.0 (ACL 2025)☆12May 17, 2025Updated 11 months ago
- This repository releases the code and data for utterance rewriting in open-domain dialogues.☆18Feb 24, 2023Updated 3 years ago
- Personalized Response Generation via Generative Split Memory Network☆12Sep 6, 2021Updated 4 years ago
- Source code of paper "Alirector: Alignment-Enhanced Chinese Grammatical Error Corrector" (Findings of ACL 2024)☆13Mar 19, 2025Updated last year
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆25May 30, 2024Updated last year
- The repository of EMNLP 2023 "MixEdit: Revisiting Data Augmentation and Beyond for Grammatical Error Correction"☆12Nov 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Global ASP - African Storybook Project for the World☆18Dec 1, 2025Updated 5 months ago
- [ACL 2025] Can MLLMs Understand the Deep Implication Behind Chinese Images?☆23Apr 9, 2026Updated 3 weeks ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Jan 23, 2024Updated 2 years ago
- cracked prompt of famous coding agent and autodev☆24Mar 19, 2026Updated last month
- Emotional Chatbot using Reinforcement Learning☆14May 27, 2021Updated 4 years ago
- [ACL 2024] Dataset and Code of "ImplicitAVE: An Open-Source Dataset and Multimodal LLMs Benchmark for Implicit Attribute Value Extraction…☆16Jun 10, 2024Updated last year
- Towards Real-World Writing Assistance: A Chinese Character Checking Benchmark with Faked and Misspelled Characters☆16May 30, 2024Updated last year
- The official implementation of the paper "MLP Memory: A Retriever-Pretrained Memory for Large Language Models". (ICLR 2026)☆60Jan 28, 2026Updated 3 months ago
- [NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead☆43Oct 3, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Repo for the question-in-context rewriting baseline presented in Elgohary et al. "Can you unpack that? Learning to rewrite questions-in-c…☆24May 20, 2020Updated 5 years ago
- a autodl environment for native finetune stable diffusion.☆11Dec 7, 2022Updated 3 years ago
- 大语言模型工具集☆27Aug 1, 2025Updated 9 months ago
- Fetching confused chars, including same pronunciation, similar pronunciation and similar character pattern☆20Jan 20, 2023Updated 3 years ago
- The official code for the "System Combination via Quality Estimation for Grammatical Error Correction" paper, published in EMNLP 2023.☆16Jan 24, 2026Updated 3 months ago
- ☆12May 16, 2024Updated last year
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆20Nov 6, 2024Updated last year
- ☆21Nov 14, 2022Updated 3 years ago
- Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"☆13May 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- C++ MPEG-2 TS Demux/Mux☆10Oct 19, 2018Updated 7 years ago
- The official repository for our EMNLP 2024 paper, Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretab…☆20Feb 23, 2025Updated last year
- Implements a minimalistic version of Stable Cascade training☆13Oct 24, 2024Updated last year
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆11Mar 14, 2023Updated 3 years ago
- ☆13Oct 23, 2018Updated 7 years ago
- TCP Trafficgenerator and Proxy☆11Jan 8, 2026Updated 3 months ago
- ☆27Feb 1, 2025Updated last year