chatglm_rlhf_finetuning
☆30Oct 10, 2023Updated 2 years ago
Alternatives and similar repositories for chatglm_rlhf
Users that are interested in chatglm_rlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Huma…☆140Apr 28, 2023Updated 2 years ago
- realize the reinforcement learning training for gpt2 llama bloom and so on llm model☆27Sep 19, 2023Updated 2 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆198May 23, 2023Updated 2 years ago
- aigc_serving lightweight and efficient Language service model reasoning☆24Jun 12, 2024Updated last year
- share data, prompt data , pretraining data☆36Nov 30, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- deep learning☆150May 6, 2025Updated 10 months ago
- moss chat finetuning☆51Apr 23, 2024Updated last year
- Train LoRA using Microsoft's official implementation with Stable Diffusion models.☆33May 9, 2023Updated 2 years ago
- ☆11Nov 16, 2019Updated 6 years ago
- MSBD5001 Big Data Computing Projects -- Algorithm Parallelization. Use PySpark APIs to implement DBSCAN algorithm.☆18Aug 14, 2019Updated 6 years ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆110Jul 19, 2023Updated 2 years ago
- Combine Tecent's bert as service model and rasa_nlu for text classification☆20Oct 29, 2022Updated 3 years ago
- clue chatyuan finetuning☆17Mar 10, 2025Updated last year
- DiT for VAE (and Video Generation)☆35Sep 2, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- chatglm 6b finetuning and alpaca finetuning☆1,537Mar 9, 2025Updated last year
- 在kaggle部署ChatGLM API,和ChatGPT api使用相同的调用方式☆15Jun 30, 2023Updated 2 years ago
- Template for creating audio encoders compatible with X-ARES☆19Feb 11, 2026Updated last month
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- Cluster Images using Perceptual Hash☆13Apr 22, 2016Updated 9 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- A Straightforward Pytorch Implementation of Gated Feedback RNNs☆12May 8, 2017Updated 8 years ago
- 用于大模型 RLHF 进行人工数据标注排序的工具。A tool for manual response data annotation sorting in RLHF stage.☆256Aug 1, 2023Updated 2 years ago
- Dynamic Simulation Environments for Reinforcement Learning☆13Apr 17, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 目前只有阅读理解赛道的☆13Mar 31, 2021Updated 4 years ago
- Pytorch implementation of vision models.☆12Dec 8, 2022Updated 3 years ago
- ChatGLM2-6B 全参数微调,支持多轮对话的高效微调。☆402Aug 17, 2023Updated 2 years ago
- use chatGLM to perform text embedding☆45Apr 9, 2023Updated 2 years ago
- ☆15Aug 4, 2024Updated last year
- 常用的NVIDIA docker☆15Sep 16, 2023Updated 2 years ago
- ☆14Dec 26, 2022Updated 3 years ago
- Multimodal RAG using LlamaIndex, Qdrant, llama.cpp for document QA with local VisonLLM and embedding models☆18Nov 8, 2024Updated last year
- 人工精调的中文对话数据集和一段chatglm的微调代码☆1,194May 3, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- An efficient distillation method for flow matching models☆25Feb 1, 2026Updated last month
- Finetuning Stable Diffusion from Diffusers☆11Mar 11, 2024Updated 2 years ago
- Implementing ReaRAG, a knowledge-guided reasoning model that enhances factual accuracy using iterative retrieval-augmented generation. Ad…☆15Feb 2, 2026Updated last month
- ☆20Jul 15, 2025Updated 8 months ago
- Vector Base Amplitude Panning☆20Oct 21, 2017Updated 8 years ago
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆17Aug 30, 2024Updated last year
- 支持rasa-nlu 的bert finetune☆46Jul 9, 2024Updated last year