☆43Dec 15, 2023Updated 2 years ago
Alternatives and similar repositories for DeepSpeed-Chat-ChatGLM
Users that are interested in DeepSpeed-Chat-ChatGLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 实现moss int8的finetune和优化源moss项目模型保存问题☆17Jun 1, 2023Updated 3 years ago
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆49Mar 15, 2023Updated 3 years ago
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆138Apr 28, 2023Updated 3 years ago
- Toward Practical Entity Alignment Method Design: Insights from New Highly Heterogeneous Knowledge Graph Datasets☆17Feb 18, 2025Updated last year
- 【技术篇】个人微信公众号对接chatGLM-6B☆15Apr 3, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- chatglm多gpu用deepspeed和☆409Jul 8, 2024Updated last year
- ☆84Sep 9, 2023Updated 2 years ago
- youtube video recommendation(generation 4)☆21Oct 16, 2019Updated 6 years ago
- Just for debug☆57Feb 15, 2024Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Mar 20, 2023Updated 3 years ago
- chatglm-6b微调/LORA/PPO/推理, 样本为自动生成的整数/小数加减乘除运算, 可gpu/cpu☆165Aug 24, 2023Updated 2 years ago
- [AAAI'25] CharacterBench: Benchmarking Character Customization of Large Language Models☆23Aug 1, 2025Updated 10 months ago
- ChatGLM-6B 指令学习|指令数据|Instruct☆651Apr 10, 2023Updated 3 years ago
- moss chat finetuning☆51Apr 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 探索中文instruct数据在ChatGLM, LLaMA上的微调表现☆389Apr 4, 2023Updated 3 years ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat☆117Jun 5, 2023Updated 3 years ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated last year
- Source code for ICDE 2020 paper Collective Entity Alignment via Adaptive Features (CEA).☆16Jun 10, 2020Updated 6 years ago
- TTS system base on FastSpeech2 and MelGAN.☆16Nov 26, 2020Updated 5 years ago
- GLCONet: Learning Multisource Perception Representation for Camouflaged Object Detection (TNNLS, 2024)☆15Jul 10, 2025Updated 11 months ago
- Slurm SPANK plugin to let users change GPU compute mode in jobs☆15Mar 4, 2023Updated 3 years ago
- EMNLP 2018: Multi-Head Attention with Disagreement Regularization; NAACL 2019: Information Aggregation for Multi-Head Attention with Rout…☆21Oct 9, 2020Updated 5 years ago
- ChatGLM-6B fine-tuning.☆135Apr 25, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- deepspeed+trainer简单高效实现多卡微调大模型☆133May 27, 2023Updated 3 years ago
- ☆12Aug 15, 2022Updated 3 years ago
- Beyond Clicks: Modeling Multi-Relational Item Graph for Session-Based Target Behavior Prediction☆21Jun 2, 2020Updated 6 years ago
- ☆15Nov 19, 2018Updated 7 years ago
- The PyTorch implementation of ClickPrompt☆27Oct 14, 2023Updated 2 years ago
- Cluster Images using Perceptual Hash☆13Apr 22, 2016Updated 10 years ago
- Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.☆97Feb 5, 2024Updated 2 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆197May 23, 2023Updated 3 years ago
- [ACMMM 23] Official implementation of Object Segmentation by Mining Cross-Modal Semantics (First Uniformed model for SOD and/or COD with …☆18Sep 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆38Jul 25, 2024Updated last year
- Pytorch implementation of vision models.☆12Dec 8, 2022Updated 3 years ago
- 企业事件抽取☆13May 20, 2021Updated 5 years ago
- chatglm 6b finetuning and alpaca finetuning☆1,531Mar 9, 2025Updated last year
- “阿里灵杰”问天引擎电商搜索算法赛 13/2771☆10Jul 31, 2022Updated 3 years ago
- Large-scale exact string matching tool☆17Mar 7, 2025Updated last year
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 7 years ago