baichuan and baichuan2 finetuning and alpaca finetuning
☆33Mar 10, 2025Updated last year
Alternatives and similar repositories for baichuan_finetuning
Users that are interested in baichuan_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- qwen models finetuning☆107Mar 9, 2025Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆109Jul 19, 2023Updated 2 years ago
- Implementation of StyleTTS for Mandarin☆11Jun 22, 2023Updated 2 years ago
- 生成可用于darknet训练的车牌数据集☆16May 21, 2021Updated 5 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆12Oct 10, 2021Updated 4 years ago
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"☆12Aug 16, 2022Updated 3 years ago
- 天池项目:新浪微博互动预测☆10Apr 25, 2020Updated 6 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- ☆16May 16, 2025Updated last year
- Enhancing LangChain prompts to work better with RWKV models☆34May 30, 2023Updated 3 years ago
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆34Dec 15, 2025Updated 6 months ago
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- ☆17Mar 22, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- deep version SentiBank☆12Dec 16, 2014Updated 11 years ago
- ☆12Dec 22, 2024Updated last year
- Code & Data for our Paper "NaSGEC: Multi-Domain Chinese Grammatical Error Correction for Native Speaker Texts" (ACL 2023 Findings)☆97Feb 18, 2025Updated last year
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated last year
- 本项目利用深度学习技术,实时检测人体3D姿态,并基于此预测未来人体动作。采用mmpose框架与多进程技术实现后端快速预测,利用混合现实Hololens2头戴显示器显示人物动作,做到实时抓取,实时预测,实时显示。☆12Oct 30, 2023Updated 2 years ago
- [EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledge☆15Sep 4, 2025Updated 9 months ago
- [COLING 2020] BERT-based Models for Chengyu☆17Dec 29, 2021Updated 4 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- ☆19Mar 25, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The repository to keep supporting files for my blog posts.☆17Sep 20, 2025Updated 8 months ago
- 基于cn-clip模型封装的本地图片搜索工具☆11Jul 6, 2023Updated 2 years ago
- Baichuan-13B 指令微调☆87Jul 14, 2023Updated 2 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- Ilya Sutskever 推荐的30篇Deep learning 必读论文 (中英文对照翻译版)☆14Dec 18, 2024Updated last year
- 基于c++ muduo网络库的集群聊天服务器,使用nginx实现负载均衡,使用reids消息队列实现跨服务器通信☆12Feb 23, 2024Updated 2 years ago
- Yet another Bloomfilter implementation in Python, compatible with Java's Guava library☆12Aug 10, 2024Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Sep 23, 2020Updated 5 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Oct 9, 2020Updated 5 years ago
- lightgbm☆14Jun 21, 2022Updated 3 years ago
- Template for assignment 2 of SUSTech CS209, 23 spring semester.☆10Apr 18, 2023Updated 3 years ago
- NS3 simulator for RDMA load balancing☆12Jan 31, 2025Updated last year
- A framework for evolving and testing question-answering datasets with various models.☆26Feb 28, 2024Updated 2 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 4 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago