用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]
☆21May 16, 2023Updated 2 years ago
Alternatives and similar repositories for LLaMA-MOSS-RLHF-LoRA
Users that are interested in LLaMA-MOSS-RLHF-LoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Dec 3, 2021Updated 4 years ago
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- ChatGLM-Peft-Tuning☆13Mar 19, 2023Updated 3 years ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- 对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF☆198May 23, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- moss chat finetuning☆51Apr 23, 2024Updated 2 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- Interpretable Models for NLP using PyTorch☆18Jan 22, 2018Updated 8 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 10 months ago
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Apr 28, 2024Updated 2 years ago
- ☆13Apr 5, 2026Updated last month
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆19Oct 22, 2024Updated last year
- An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.☆14Jan 9, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code to generate the Inv3D dataset from our paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document …☆25Mar 6, 2024Updated 2 years ago
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 9 months ago
- [NLPCC 2024] Shared Task 10: Regulating Large Language Models☆14Jun 12, 2024Updated last year
- Lossless compression using Probabilistic Circuits☆16Mar 10, 2022Updated 4 years ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- Code Repository for ICML 2020 accepted paper, named "A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Le…☆12Jan 4, 2022Updated 4 years ago
- Graded projects of the course "Probabilistic Artificial Intelligence", ETH Zürich (Fall 2020). Topics: Gaussian Process Regression, Bayes…☆12Nov 3, 2021Updated 4 years ago
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆14Aug 8, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- ☆11Jul 17, 2021Updated 4 years ago
- CoCo-Ex extracts meaningful concepts from natural language texts and maps them to conjunct concept nodes in ConceptNet, utilizing the max…☆13Apr 7, 2026Updated last month
- ☆23Dec 8, 2022Updated 3 years ago
- ☆15Sep 20, 2024Updated last year
- Hierarchical And Quantized AutoEncoders☆13Jun 12, 2020Updated 5 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- [COLING 2022] Learning from Adjective-Noun Pairs: A Knowledge-enhanced Framework for Target-Oriented Multimodal Sentiment Classification☆14Apr 19, 2023Updated 3 years ago
- This is an official PyTorch implementation of Task-Adaptive Neural Network Search with Meta-Contrastive Learning (NeurIPS 2021, Spotlight…☆19Nov 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Safety-J: Evaluating Safety with Critique☆16Jul 28, 2024Updated last year
- Real-time multi-language unit test generation tool via LSP☆38Apr 8, 2026Updated last month
- ☆21Jun 16, 2025Updated 10 months ago
- ☆14Jul 27, 2021Updated 4 years ago
- The jailbreak-evaluation is an easy-to-use Python package for language model jailbreak evaluation.☆27Nov 4, 2024Updated last year
- An implementation of loopy belief propagation on a Bayesian Network (BN)☆11Feb 25, 2015Updated 11 years ago
- Tools for training schema-aware Web table embedding for unsupervised and supervised machine learning on tabular data☆21Apr 14, 2024Updated 2 years ago