Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.
☆56Mar 9, 2024Updated last year
Alternatives and similar repositories for InstructLLaMA
Users that are interested in InstructLLaMA are comparing it to the libraries listed below
Sorting:
- ☆10Aug 10, 2024Updated last year
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Feb 26, 2024Updated 2 years ago
- An Affordable LLM Pre-training Benchmark via Accurate Loss Prediction across Scales☆16Jun 6, 2024Updated last year
- Build DDPG models and test on stock market☆22Nov 19, 2018Updated 7 years ago
- Deep Reinforcement Learning Framework for Factor Investing☆30Mar 25, 2023Updated 2 years ago
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 3 years ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆33Dec 14, 2023Updated 2 years ago
- This is for the capstone project "Optimal Execution of a VWAP order".☆37Nov 21, 2019Updated 6 years ago
- I use various Data Science and machine learning techniques to analyze customer data using STP framework. I preprocessed the data, perform…☆12Apr 26, 2020Updated 5 years ago
- Kontur Platform API Gateway☆11Aug 27, 2025Updated 6 months ago
- An algorithm that intelligently executes a crypto order over time via Coinbase☆12Oct 26, 2021Updated 4 years ago
- 一个基于中国市场的Fama-French五因子实证研究☆40Jul 18, 2022Updated 3 years ago
- Dataset2024☆11Jun 12, 2025Updated 8 months ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- Morphometric taxonomy of Central Europe☆35Feb 12, 2026Updated 2 weeks ago
- This project is focus on stock prediction,our goal is implementing one trading framework using DRL with LSTM.☆11Jun 1, 2018Updated 7 years ago
- ☆10Jul 21, 2019Updated 6 years ago
- Open Source Tsetlin Machine framework☆17Oct 15, 2018Updated 7 years ago
- Unity TTS plugin: Piper neural synthesis + OpenJTalk Japanese + Unity AI Inference Engine. Windows/Mac/Linux/Android/iOS ready. High-qual…☆18Updated this week
- USB Hid handler for nodejs☆11Sep 30, 2022Updated 3 years ago
- ☆13Jun 17, 2025Updated 8 months ago
- About Code release for "Imagination Mechanism: Mesh Information Propagation for Enhancing Data Efficiency in Reinforcement Learning"☆13Oct 7, 2023Updated 2 years ago
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- Data profiling tools for Big Data☆11Nov 17, 2025Updated 3 months ago
- scrape, clean and model IPO data with supervised ML☆10Aug 20, 2020Updated 5 years ago
- tooling for vectorizing the planet☆27Feb 17, 2025Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Cooperative workspace driver☆10Jan 20, 2026Updated last month
- ☆10Feb 18, 2025Updated last year
- Predicting the Short-term Direction of Futures Contracts through Machine Learning☆14Oct 15, 2024Updated last year
- Utilities for AI-Assisted Mapping fAIr☆13May 21, 2025Updated 9 months ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- The PyTorch implementation of "Modeling Financial Time Series using LSTM with Trainable Initial Hidden States"☆11Jul 15, 2020Updated 5 years ago
- Monlan is a collection of Data Science experiments (DRL and other approaches) into FOREX algotrading field. Warning! It's my research pro…☆12Aug 1, 2022Updated 3 years ago
- ☆15Nov 20, 2025Updated 3 months ago
- Search engin prototype for wikipedia articles☆10Jul 31, 2014Updated 11 years ago
- ☆11Jul 4, 2022Updated 3 years ago
- D3PE (Deep Data-Driven Policy Evaluation) aims to evaluation a large set of candidate policies from a fixed dataset to select best ones.☆11Jun 2, 2022Updated 3 years ago