☆43Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for DeepSpeed-Chat-ChatGLM
Users that are interested in DeepSpeed-Chat-ChatGLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 实现moss int8的finetune和优化源moss项目模型保存问题☆17Jun 1, 2023Updated 2 years ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆50Mar 15, 2023Updated 3 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆140Apr 28, 2023Updated 2 years ago
- 【技术篇】个人微信公众号对接chatGLM-6B☆15Apr 3, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- chatglm多gpu用deepspeed和☆408Jul 8, 2024Updated last year
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆21Aug 1, 2025Updated 7 months ago
- Implementation of Chinese ChatGPT☆289Nov 20, 2023Updated 2 years ago
- Experiments with AllenNLP on semantic parsing datasets☆17Dec 29, 2018Updated 7 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 3 years ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- ChatGLM-6B 指令学习|指令数据|Instruct☆653Apr 10, 2023Updated 2 years ago
- moss chat finetuning☆51Apr 23, 2024Updated last year
- 收录实现中文版ChatGPT的各种技术路线,数据及其他资料☆35Jul 12, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆389Apr 4, 2023Updated 2 years ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆117Jun 5, 2023Updated 2 years ago
- TTS system base on FastSpeech2 and MelGAN.☆16Nov 26, 2020Updated 5 years ago
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- deepspeed+trainer简单高效实现多卡微调大模型☆133May 27, 2023Updated 2 years ago
- ChatGLM-6B fine-tuning.☆136Apr 25, 2023Updated 2 years ago
- Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations☆29Aug 1, 2024Updated last year
- ☆59Aug 1, 2023Updated 2 years ago
- Why lasso can't produce sparse solution in pytorch☆21Jan 26, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Aug 15, 2022Updated 3 years ago
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- Cluster Images using Perceptual Hash☆13Apr 22, 2016Updated 9 years ago
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆97Feb 5, 2024Updated 2 years ago
- ☆64Mar 18, 2026Updated last week
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Jul 25, 2024Updated last year
- code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation☆19May 20, 2024Updated last year
- 企业事件抽取☆13May 20, 2021Updated 4 years ago
- chatglm 6b finetuning and alpaca finetuning☆1,537Mar 9, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 6 years ago
- SIGIR 2022: Contrastive Learning with Hard Negative Entities for Entity Set Expansion☆30Jan 6, 2023Updated 3 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,730Oct 12, 2023Updated 2 years ago
- Debug DeepSpeed-Chat step by step in IDE (在IDE里一步一步调试DeepSpeed-Chat)☆10Apr 17, 2023Updated 2 years ago
- 🤔 When in Doubt: Improving Classification Performance with Alternating Normalization [Findings of EMNLP2021]☆14Oct 29, 2021Updated 4 years ago