Train a tiny LLaMA model from scratch to repeat your words using Reinforcement Learning from Human Feedback (RLHF)
☆18May 23, 2024Updated last year
Alternatives and similar repositories for nanoRLHF
Users that are interested in nanoRLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SFT+RL boosts multimodal reasoning☆48Jun 27, 2025Updated 10 months ago
- A collection of various NLP datasets, mainly Indonesia-related languages.☆15Apr 23, 2022Updated 4 years ago
- ☆12Jun 12, 2024Updated last year
- ☆13Sep 27, 2022Updated 3 years ago
- FIGMENT☆15Jan 27, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Neural Paraphrase Generation based on OpenNMT-py☆12Jan 2, 2018Updated 8 years ago
- 🤖 基于AutoGen的AI辩论系统 | 🗣️ 支持中文交互 | 🔄 多智能体协作 | 📝 自动记录辩论过程 🤖 AI Debate System based on AutoGen | 🗣️ Chinese Interaction | 🔄 Multi-Age…☆25Mar 4, 2026Updated 2 months ago
- The source of MNER-MI.☆18Dec 17, 2024Updated last year
- Pytorch自动混合精度训练模板☆18Apr 6, 2022Updated 4 years ago
- This repo consist of some experimental results on bdd100k datasets using different object detection algorithms(Faster-RCNN, FCOS, ATSS)☆11Jun 27, 2020Updated 5 years ago
- Social Distancing Analyzer using OpenCV and YOLO☆10Aug 30, 2024Updated last year
- [ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆18Apr 3, 2025Updated last year
- Indonesian T0 | Instruction-tuning for low-resource and extremely low-resource Austronesian languages☆17Jun 24, 2024Updated last year
- ☆22Oct 28, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of our paper, "MM-Forecast: A Multimodal Approach to Temporal Event Forecasting with Large Language Models".☆18Apr 16, 2025Updated last year
- Unofficial reimplementation of ViR: Vision Retention Networks by Hatamizadeh et. al. (https://arxiv.org/abs/2310.19731)☆18Jul 26, 2024Updated last year
- Indonesian Image Captioning using Attention-based Semantic Compositional Networks☆13Jul 31, 2019Updated 6 years ago
- ☆12May 20, 2025Updated 11 months ago
- Play with various big data technologies☆10Jul 12, 2017Updated 8 years ago
- Google MobileNets Implementation using Tensorflow☆18Jun 6, 2017Updated 8 years ago
- A minimalist immersive text-based cross-platform game☆31Jan 12, 2025Updated last year
- Scrape financial News from Yahoo and analyse the sentiment (PoC)☆20Jul 16, 2019Updated 6 years ago
- ☆13Dec 14, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Visualization of topics in a document (documents), aimed to replace word cloud☆19May 10, 2016Updated 9 years ago
- Miscellaneous coding examples for reference☆14Aug 29, 2025Updated 8 months ago
- A library to create shell-like command processors☆16Sep 12, 2024Updated last year
- Multi-temporal Scene dataset for Scene Change Detection.☆15Apr 14, 2021Updated 5 years ago
- ☆11Apr 15, 2019Updated 7 years ago
- ☆12Sep 22, 2015Updated 10 years ago
- Just a helper script for invoking kohya converter (and maybe a cheeky inferencer to check it worked okay)☆11Aug 26, 2023Updated 2 years ago
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- python-training☆11Sep 12, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Kaggle TalkingData AdTracking Fraud Detection Challenge 48th solution☆11May 18, 2018Updated 7 years ago
- Implementation of Adaptive Noise Reduction and Background Noise Classification using External Microphones on iOS☆16Apr 30, 2019Updated 7 years ago
- Natural Perturbation for Robust Question Answering☆12Apr 7, 2020Updated 6 years ago
- ☆18May 27, 2021Updated 4 years ago
- Get anyones pinned GitHub repositories easily.☆12Jan 23, 2024Updated 2 years ago
- indoBERT Base-Uncased fine-tuned on Translated Squad v2.0☆19Dec 24, 2024Updated last year
- Cluster paraphrases by word sense☆12Jan 3, 2019Updated 7 years ago