lansinuote/Simple_RLHF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lansinuote/Simple_RLHF)

lansinuote / Simple_RLHF

☆116

Alternatives and similar repositories for Simple_RLHF

Users that are interested in Simple_RLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lansinuote / Simple_RLHF_Llama3
View on GitHub
☆31Aug 7, 2024Updated last year
mst272 / transformer-pytorch
View on GitHub
A pytorch Implementation of the Transformer: Attention Is All You Need
☆14Jun 7, 2024Updated 2 years ago
lansinuote / Simple_LLM_PPO
View on GitHub
☆45Aug 9, 2024Updated last year
RitaRamo / extra
View on GitHub
Retrieval-augmented Image Captioning
☆13Feb 16, 2023Updated 3 years ago
iOPENCap / awesome-unimodal-training
View on GitHub
text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)
☆12Oct 15, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tongzhou21 / CogMG
View on GitHub
☆10May 7, 2024Updated 2 years ago
allenai / beacon
View on GitHub
On-the-fly Definition Augmentation of LLMs for Biomedical NER
☆14Apr 14, 2025Updated last year
yuanzhoulvpi2017 / SentenceEmbedding
View on GitHub
☆121Jun 30, 2024Updated 2 years ago
benjaminocampo / ISHate
View on GitHub
This repository contains the dataset and implementation details of the paper "An In-depth Analysis of Implicit and Subtle Hate Speech Mes…
☆10May 9, 2024Updated 2 years ago
hmchuong / CoLLM
View on GitHub
[CVPR25] CoLLM: A Large Language Model for Composed Image Retrieval
☆28Mar 26, 2025Updated last year
ml-jku / semantic-image-text-alignment
View on GitHub
☆25Jul 10, 2023Updated 3 years ago
Pillars-Creation / Visualglm-image-to-text
View on GitHub
补充了一些Visualglm缺少的文件，可以对Visualglm进行训练，实例中是对人脸做了面相的识别
☆13Jun 7, 2023Updated 3 years ago
season-lab / DroidReach
View on GitHub
Framework for testing the reachability of native functions in Android applications.
☆11Aug 30, 2023Updated 2 years ago
zyds / transformers-code
View on GitHub
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
☆4,037Jul 15, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Tobi-Tob / CityLearnTransformer
View on GitHub
This repository is used to generate data and evaluate Decision Transformers on the CityLearn (Challenge 2022) environment for urban energ…
☆17Aug 22, 2023Updated 2 years ago
aggiejiang / SWSR
View on GitHub
A new release of Chinese sexism dataset and lexicon
☆14May 23, 2023Updated 3 years ago
liujunwen23 / MIRE
View on GitHub
WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge
☆132Nov 11, 2024Updated last year
yushuiwx / MH-MoE
View on GitHub
☆20Nov 5, 2024Updated last year
NgCafai / Transformer
View on GitHub
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆15Dec 13, 2023Updated 2 years ago
junyangwang0410 / Knight
View on GitHub
SotA text-only image/video method (IJCAI 2023)
☆15Jan 9, 2024Updated 2 years ago
dqwang122 / CALMS
View on GitHub
Code and dataset for 'Contrastive Aligned Joint Learning for Multilingual Summarization'
☆13Mar 24, 2022Updated 4 years ago
FHDO-ICLAB / Canakari
View on GitHub
CAN 2.0B Controller in VHDL and Verilog
☆11Nov 22, 2023Updated 2 years ago
zhanshijinwat / Steel-LLM
View on GitHub
Train a 1B LLM with 1T tokens from scratch by personal
☆810Apr 27, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ZhengboZhang / VisBrowse-Bench
View on GitHub
Official data and code for the paper "VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents".
☆15Mar 18, 2026Updated 4 months ago
1783696285 / SKS
View on GitHub
The code of SKS
☆15Mar 22, 2022Updated 4 years ago
ztb-35 / MLTA
View on GitHub
☆16Jan 21, 2026Updated 6 months ago
dmksjfl / DARC
View on GitHub
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
☆22Mar 11, 2022Updated 4 years ago
lansinuote / Simple_Reinforcement_Learning
View on GitHub
☆636Oct 31, 2024Updated last year
multimodal-art-projection / COIG-P
View on GitHub
☆42Jul 15, 2025Updated last year
LSTM-Kirigaya / Quadrotor
View on GitHub
使用DDPG算法解决rlschool中无人机悬停控制的问题（内含训练了9个小时的良模型）
☆10Jul 7, 2020Updated 6 years ago
pengzhangzhi / Awesome-List-Protein-Binding-Site-Prediction-
View on GitHub
List of papers on protein binding site prediction
☆11Aug 11, 2023Updated 2 years ago
MIRALab-USTC / RL-SCPO
View on GitHub
The code of paper *Learning Robust Policy against Disturbance in Transition Dynamics via State-Conservative Policy Optimization*.
☆18Mar 26, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
limafang / agent-arxiv-daily
View on GitHub
🎓Automatically Update agent Papers Daily using Github Actions (Update Every 12th hours)每日更新agent相关论文（已附带中文摘要翻译）
☆37Updated this week
AkideLiu / MiniCache
View on GitHub
☆14Sep 7, 2024Updated last year
takayama-lily / syanten
View on GitHub
麻雀シャンテン計算(日麻向听数计算)
☆24Mar 22, 2023Updated 3 years ago
owenliang / chatgpt
View on GitHub
simple decoder-only GTP model in pytorch
☆46May 19, 2024Updated 2 years ago
quan-dao / APRO3D-Net
View on GitHub
APRO3D-Net: Attention-based Proposals Refinement for 3D Object Detection
☆19Jan 25, 2022Updated 4 years ago
Emma1066 / Self-Improve-Zero-Shot-NER
View on GitHub
This is the github repository for the paper at NAACL 2024: Self-Improving for Zero-Shot Named Entity Recognition with Large Language Mode…
☆53Mar 17, 2024Updated 2 years ago
REXWindW / my_llm
View on GitHub
尝试自己从头写一个LLM，参考llama和nanogpt
☆69Apr 27, 2024Updated 2 years ago