large language model training-3-stages+deployment
☆47Aug 14, 2023Updated 2 years ago
Alternatives and similar repositories for llm3s-conatiner
Users that are interested in llm3s-conatiner are comparing it to the libraries listed below
Sorting:
- Official completion of “Training on the Benchmark Is Not All You Need”.☆39Dec 31, 2024Updated last year
- ☆12Oct 30, 2022Updated 3 years ago
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Jul 26, 2023Updated 2 years ago
- Examples about using MGeo finetune models☆55Feb 9, 2023Updated 3 years ago
- Deepspeed、LLM、Medical_Dialogue、医疗大模型、预训练、微调☆294Jun 7, 2024Updated last year
- ☆19Mar 24, 2023Updated 2 years ago
- 一个简单的,由ChatGPT主导编写的api,使用简单的请求访问ChatRWKV☆15May 19, 2023Updated 2 years ago
- Source code for our "D-REPTILE" paper at EACL 2021☆13Jan 19, 2021Updated 5 years ago
- nlp_interview notes and answers: 该仓库主要记录 NLP 算法工程师相关的面试题和参考答案☆23Nov 16, 2023Updated 2 years ago
- [Ebook]从零到百万店铺:一个没有计算机学位的普通人的系统设计实战之旅☆26Nov 11, 2025Updated 4 months ago
- Online Segmentation ans POS tagger with Average Perceptron☆17Sep 18, 2017Updated 8 years ago
- ☆109Jul 15, 2025Updated 8 months ago
- Code for the ACL 2022 (Long paper): "New Intent Discovery with Pre-training and Contrastive Learning".☆14Jul 18, 2022Updated 3 years ago
- ☆14Mar 16, 2018Updated 8 years ago
- Fine-Tune LLM Synthetic-Data application and "From Data to AGI: Unlocking the Secrets of Large Language Model"☆16Jul 5, 2024Updated last year
- ☆13Dec 23, 2019Updated 6 years ago
- 使用百度百科+词性对规则构建数据☆13Oct 2, 2019Updated 6 years ago
- SolrCloud Rebalance API Documentation☆13Jul 18, 2016Updated 9 years ago
- Synthetic data generation for evaluating LLM symbolic and logic reasoning☆22Mar 6, 2026Updated 2 weeks ago
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆142Jan 15, 2024Updated 2 years ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆40May 15, 2024Updated last year
- 天池比赛☆10Jul 4, 2021Updated 4 years ago
- Inference Code for Paper "Harder Tasks Need More Experts: Dynamic Routing in MoE Models"☆69Jul 30, 2024Updated last year
- 使用Few-Shot方法来做文本分类任务,基于THUCNews数据☆10Jun 4, 2020Updated 5 years ago
- PyTorch implementation of the paper: Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding. Su Zhu, Ruish…☆18Nov 10, 2021Updated 4 years ago
- ☆17Jan 21, 2024Updated 2 years ago
- 文章标签抽取☆16Dec 17, 2018Updated 7 years ago
- ggml implementation of the baichuan13b model (adapted from llama.cpp)☆55Jul 27, 2023Updated 2 years ago
- ☆235May 10, 2024Updated last year
- ☆11Oct 2, 2023Updated 2 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- alpaca中文指令微调数据集☆397Mar 26, 2023Updated 2 years ago
- TensorRT简明教程☆26Aug 11, 2021Updated 4 years ago
- NLP models and codes for BAAI-JD joint project.☆10May 27, 2020Updated 5 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 8 months ago
- LLM-MapBook: AI-Powered Maps for Storytelling. Extracts geo-coordinates from books, visualizes on interactive maps, offering immersive st…☆12Aug 27, 2024Updated last year
- 可部署的相似度模型 deployable similarity model☆17Oct 27, 2022Updated 3 years ago
- 2021科大讯飞-车辆贷款违约预测挑战赛 Top1方案☆69Oct 17, 2021Updated 4 years ago