Train a tiny LLaMA model from scratch to repeat your words using Reinforcement Learning from Human Feedback (RLHF)
☆18May 23, 2024Updated last year
Alternatives and similar repositories for nanoRLHF
Users that are interested in nanoRLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FIGMENT☆15Jan 27, 2020Updated 6 years ago
- Neural Paraphrase Generation based on OpenNMT-py☆12Jan 2, 2018Updated 8 years ago
- Social Distancing Analyzer using OpenCV and YOLO☆10Aug 30, 2024Updated last year
- ☆21Oct 28, 2024Updated last year
- use simple image processing to detect cars in videos☆11Sep 7, 2018Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆22Feb 16, 2025Updated last year
- Indonesian Image Captioning using Attention-based Semantic Compositional Networks☆13Jul 31, 2019Updated 6 years ago
- Algorithm training.☆10Jul 21, 2020Updated 5 years ago
- Compiler for the Tiger programming language☆12Oct 27, 2018Updated 7 years ago
- A RPC Server implement base on Raft Paper in Golang☆10Jun 17, 2016Updated 9 years ago
- A minimalist immersive text-based cross-platform game☆32Jan 12, 2025Updated last year
- Telstra Network Disruptions - Predict service faults on Australia's largest telecommunications network☆13Oct 28, 2016Updated 9 years ago
- Scrape financial News from Yahoo and analyse the sentiment (PoC)☆20Jul 16, 2019Updated 6 years ago
- Visualization of topics in a document (documents), aimed to replace word cloud☆19May 10, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A library to create shell-like command processors☆16Sep 12, 2024Updated last year
- POS Tag for Indonesian language☆18Dec 24, 2016Updated 9 years ago
- Knowledge Graph based Question Answering benchmark.☆10Feb 1, 2020Updated 6 years ago
- Kaggle TalkingData AdTracking Fraud Detection Challenge 48th solution☆11May 18, 2018Updated 7 years ago
- Go client library for SeatGeek's Sixpack AB testing framework.☆20Sep 2, 2013Updated 12 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- Convert Pytorch model to Tensorflow lite and deploy it in ESP32.☆21Oct 10, 2024Updated last year
- DataFest Competition April 2017 - Data Analysis on Expedia Click Data☆13Sep 30, 2017Updated 8 years ago
- Implementation of Adaptive Noise Reduction and Background Noise Classification using External Microphones on iOS☆16Apr 30, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18May 27, 2021Updated 4 years ago
- An open-source food image embedding model☆27Dec 10, 2017Updated 8 years ago
- 自己实现的一些美颜美妆算法☆11Oct 11, 2020Updated 5 years ago
- An iOS Application to cancel noise when using headsets.☆10May 6, 2015Updated 10 years ago
- Get anyones pinned GitHub repositories easily.☆12Jan 23, 2024Updated 2 years ago
- Cluster paraphrases by word sense☆12Jan 3, 2019Updated 7 years ago
- indoBERT Base-Uncased fine-tuned on Translated Squad v2.0☆19Dec 24, 2024Updated last year
- lime-ner: extending LIME for Named Entity Recognition☆10Aug 15, 2018Updated 7 years ago
- ☆14Apr 6, 2014Updated 11 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- UCLA CS 188 (Winter 2023) course project.☆12Mar 31, 2023Updated 2 years ago
- 华东师范大学 校园网网关一键登录☆29Nov 20, 2022Updated 3 years ago
- Deep neural models for core NLP tasks☆13Nov 9, 2017Updated 8 years ago
- Unofficial PyTorch implementation of the Contrast Adaptative Sharpening (CAS) featured in AMD's FidelityFX.☆15Oct 5, 2023Updated 2 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Sandbox for playing with Neo4J and graph approaches to NLP☆12Jul 12, 2017Updated 8 years ago
- AAAI2024 Global Competition on Math Problem Solving and Reasoning☆14Oct 4, 2023Updated 2 years ago