☆43Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for DeepSpeed-Chat-ChatGLM
Users that are interested in DeepSpeed-Chat-ChatGLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆49Mar 15, 2023Updated 3 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆139Apr 28, 2023Updated 2 years ago
- 【技术篇】个人微信公众号对接chatGLM-6B☆15Apr 3, 2023Updated 3 years ago
- ☆84Sep 9, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of Chinese ChatGPT☆288Nov 20, 2023Updated 2 years ago
- Just for debug☆57Feb 15, 2024Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 3 years ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆22Aug 1, 2025Updated 8 months ago
- ChatGLM-6B 指令学习|指令数据|Instruct☆653Apr 10, 2023Updated 3 years ago
- moss chat finetuning☆51Apr 23, 2024Updated last year
- Leveraging Ontological Schema Information in Embedding Models for Knowledge Graphs☆14Jun 16, 2015Updated 10 years ago
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆389Apr 4, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆118Jun 5, 2023Updated 2 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated 11 months ago
- TTS system base on FastSpeech2 and MelGAN.☆16Nov 26, 2020Updated 5 years ago
- Slurm SPANK plugin to let users change GPU compute mode in jobs☆13Mar 4, 2023Updated 3 years ago
- deepspeed+trainer简单高效实现多卡微调大模型☆133May 27, 2023Updated 2 years ago
- Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations☆29Aug 1, 2024Updated last year
- ☆59Aug 1, 2023Updated 2 years ago
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 3 years ago
- Beyond Clicks: Modeling Multi-Relational Item Graph for Session-Based Target Behavior Prediction☆21Jun 2, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The PyTorch implementation of ClickPrompt☆27Oct 14, 2023Updated 2 years ago
- Cluster Images using Perceptual Hash☆13Apr 22, 2016Updated 9 years ago
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆97Feb 5, 2024Updated 2 years ago
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Jul 25, 2024Updated last year
- code for ACL2024-main: BatchEval: Towards Human-like Text Evaluation☆19May 20, 2024Updated last year
- Pytorch implementation of vision models.☆12Dec 8, 2022Updated 3 years ago
- 企业事件抽取☆13May 20, 2021Updated 4 years ago
- chatglm 6b finetuning and alpaca finetuning☆1,536Mar 9, 2025Updated last year
- Large-scale exact string matching tool☆17Mar 7, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 7 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 2 years ago
- ☆20Jul 23, 2025Updated 8 months ago
- SIGIR 2022: Contrastive Learning with Hard Negative Entities for Entity Set Expansion☆30Jan 6, 2023Updated 3 years ago
- 百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本☆49Aug 27, 2023Updated 2 years ago
- [ACL 2023] UniTRec: A Unified Text-to-Text Transformer and Joint Contrastive Learning Framework for Text-based Recommendation☆27Feb 19, 2024Updated 2 years ago
- Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调☆3,723Oct 12, 2023Updated 2 years ago