lansinuote/Simple_RLHF_Llama3

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lansinuote/Simple_RLHF_Llama3)

lansinuote / Simple_RLHF_Llama3

☆31

Alternatives and similar repositories for Simple_RLHF_Llama3

Users that are interested in Simple_RLHF_Llama3 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yeshenpy / PMIC
View on GitHub
Original PyTorch implementation of PMIC from PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Colla…
☆22Mar 26, 2024Updated 2 years ago
lansinuote / Simple_RLHF
View on GitHub
☆116Jun 12, 2025Updated last year
ashishjamarkattel / reinforment-learning-with-human-feedback
View on GitHub
☆17Dec 31, 2023Updated 2 years ago
benjaminocampo / ISHate
View on GitHub
This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…
☆10May 9, 2024Updated 2 years ago
CrystalSixone / DSRG
View on GitHub
Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation
☆17Apr 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aggiejiang / SWSR
View on GitHub
A new release of Chinese sexism dataset and lexicon
☆14May 23, 2023Updated 3 years ago
penzant / nlu_datasets_2018
View on GitHub
☆12Nov 9, 2018Updated 7 years ago
yushuiwx / MH-MoE
View on GitHub
☆20Nov 5, 2024Updated last year
wenhuchen / GPT2-Logic2Text
View on GitHub
The code for Template-GPT-2 Generation Model for Logic2Text Dataset
☆18Jun 1, 2020Updated 6 years ago
Tlntin / booking_simulator
View on GitHub
☆11Jan 6, 2024Updated 2 years ago
Y-L-LIU / MGTBench-2.0
View on GitHub
☆28Apr 18, 2025Updated last year
1783696285 / SKS
View on GitHub
The code of SKS
☆15Mar 22, 2022Updated 4 years ago
informagi / laps
View on GitHub
☆14Oct 18, 2024Updated last year
menggedu / EDL
View on GitHub
Code and data for paper named: Large language models for automatic equation discovery of nonlinear dynamics
☆14Mar 6, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sww9370 / RoCBert
View on GitHub
☆20Dec 26, 2022Updated 3 years ago
MunzirH / Applications-of-Physics-Informed-Machine-Learning
View on GitHub
🌌 Applications of Physics-Informed ML: A collection of notebooks from my Masters research, exploring how machine learning can solve scie…
☆12Apr 29, 2026Updated 3 months ago
lucylow / Covid_Control
View on GitHub
Machine learning to predict future number Covid19 Daily Cases (7-day moving average). Long Short Term Memory (LSTM) Predictor and Reinfor…
☆14Feb 21, 2021Updated 5 years ago
Elbria / xformal-FoST
View on GitHub
Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"
☆12Jun 7, 2021Updated 5 years ago
ZBayes / poc_project
View on GitHub
通用简单工具项目
☆22Oct 6, 2024Updated last year
zysxmu / DFSQ
View on GitHub
super-resolution; post-training quantization; model compression
☆14Nov 10, 2023Updated 2 years ago
DSKSD / Pytorch_models
View on GitHub
PyTorch study
☆14Oct 16, 2017Updated 8 years ago
language-agent-tutorial / language-agent-tutorial.github.io
View on GitHub
[EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks
☆10Nov 27, 2024Updated last year
BrianPulfer / idempotent-generative-network
View on GitHub
☆10Nov 26, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
chenzixuan99 / Awesome-LLM-based-Web-Agent-and-Tools
View on GitHub
A collection of some awesome public projects about LLM-based Web Agents and Tools.
☆13Apr 25, 2024Updated 2 years ago
xinleihe / toxic-prompt
View on GitHub
☆27Nov 20, 2023Updated 2 years ago
THU-MIG / Consolidator
View on GitHub
Official implementation for ICLR 2023 paper Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation
☆16Jan 23, 2024Updated 2 years ago
lansinuote / Diffusion_Training_Examples
View on GitHub
☆90Aug 7, 2023Updated 2 years ago
lansinuote / Simple_LLM_DPO
View on GitHub
☆75Nov 13, 2023Updated 2 years ago
apalladi / covid_vaccine_model
View on GitHub
Epidemiological model to predict the spread of the epidemic, taking into account the vaccinations
☆14Apr 14, 2021Updated 5 years ago
SeunginLyu / EpidemicSpreading
View on GitHub
SIS epidemic model simulation in scale-free networks
☆16Oct 27, 2017Updated 8 years ago
chenyuntc / CVPR2024-award-posters
View on GitHub
posters for all CVPR2024 Award papers (Highlight and Oral)
☆14Jul 9, 2024Updated 2 years ago
puyuan1996 / MARL
View on GitHub
Implementation for mSAC methods in PyTorch
☆42Oct 10, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
owenliang / mnist-clip
View on GitHub
a super easy clip model with mnist dataset for study
☆179Mar 17, 2024Updated 2 years ago
LAMDA-RL / ODIS
View on GitHub
The implementation of ICLR 2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".
☆45Oct 31, 2024Updated last year
eric-haibin-lin / verl-data
View on GitHub
☆14May 12, 2025Updated last year
ceriottm / ale-notebooks
View on GitHub
Jupyter notebook for an introduction to atomic-scale machine learning class
☆18Nov 14, 2023Updated 2 years ago
Aolius / semi-fst
View on GitHub
Code for ACL 2022 paper "Semi-Supervised Formality Style Transfer with Consistency Training".
☆17May 21, 2022Updated 4 years ago
STAIR-BUPT / SCCD
View on GitHub
SCCD:基于会话的中文网络欺凌检测数据集
☆24Mar 9, 2025Updated last year
tommy-xq / SA2VP
View on GitHub
☆15Mar 23, 2024Updated 2 years ago