baichuan and baichuan2 finetuning and alpaca finetuning
☆33Mar 10, 2025Updated last year
Alternatives and similar repositories for baichuan_finetuning
Users that are interested in baichuan_finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- qwen models finetuning☆107Mar 9, 2025Updated last year
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- 生成可用于darknet训练的车牌数据集☆15May 21, 2021Updated 4 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- ☆12Oct 10, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"☆12Aug 16, 2022Updated 3 years ago
- Xcbwin - a simple C++ class for graphical outputs using XCB☆12May 12, 2015Updated 10 years ago
- Llama2-SFT, Llama-2-7B微调(transformers)/LORA(peft)/推理☆27Jul 26, 2023Updated 2 years ago
- 用于生成文本纠错模型(如Gector)需要的大量数据。☆14Jan 5, 2023Updated 3 years ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- clue chatyuan finetuning☆17Mar 10, 2025Updated last year
- This repository open-sources our GEC system submitted by THU KELab (sz) in the CCL2023-CLTC Track 1: Multidimensional Chinese Learner Tex…☆15Nov 25, 2023Updated 2 years ago
- 常用的NVIDIA docker☆15Sep 16, 2023Updated 2 years ago
- ☆12Dec 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 5 months ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated 11 months ago
- search_with_lepton 的自部署版☆13May 4, 2024Updated last year
- [EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledge☆14Sep 4, 2025Updated 6 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- The repository to keep supporting files for my blog posts.☆17Sep 20, 2025Updated 6 months ago
- 基于cn-clip模型封装的本地图片搜索工具☆11Jul 6, 2023Updated 2 years ago
- GAIIC2022商品标题实体识别Baseline,使用GlobalPointer实现,线上0.80349☆52Apr 9, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Baichuan-13B 指令微调☆90Jul 14, 2023Updated 2 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- Ilya Sutskever 推荐的30篇Deep learning 必读论文 (中英文对照翻译版)☆13Dec 18, 2024Updated last year
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆26Sep 23, 2020Updated 5 years ago
- A speech signal processing library in Python with emphasis on deep learning.☆31Jul 16, 2022Updated 3 years ago
- ☆22Oct 9, 2020Updated 5 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- ☆17Nov 3, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- lightgbm☆14Jun 21, 2022Updated 3 years ago
- 自然语言处理之中文文本分类(以垃圾短信识别为例)☆23Jun 4, 2020Updated 5 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- A framework for evolving and testing question-answering datasets with various models.☆23Feb 28, 2024Updated 2 years ago
- ☆39Jun 25, 2025Updated 9 months ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago