mingyin0312 / RL4LLMView external linksLinks
RL significantly the reasoning capability of Qwen2.5-1.5B-Instruct
☆31Feb 23, 2025Updated 11 months ago
Alternatives and similar repositories for RL4LLM
Users that are interested in RL4LLM are comparing it to the libraries listed below
Sorting:
- Composition of Multimodal Language Models From Scratch☆15Aug 16, 2024Updated last year
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆44Jan 6, 2026Updated last month
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- A Python reimplementation of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆27Jul 14, 2021Updated 4 years ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 2 years ago
- Teknofest 2023 Türkçe Doğal Dil İşleme yarışması için gerçekleştirilen bu çalışma, Shap Analizi yöntemi kullanılarak modelin tahminlerini…☆28Mar 31, 2023Updated 2 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- G4T0R2 - TEKNOFEST 2024 Türkçe Doğal Dil İşleme - Senaryo Ekibi #Acıkhack2024TDDİ☆10Jan 25, 2025Updated last year
- This project is focus on stock prediction,our goal is implementing one trading framework using DRL with LSTM.☆11Jun 1, 2018Updated 7 years ago
- Sample repository for my awesome Youtube viewers.☆10Jun 3, 2020Updated 5 years ago
- a recommendation list of math courses for people with no math background.☆11Mar 2, 2021Updated 4 years ago
- ☆10May 19, 2022Updated 3 years ago
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- Mintlemon, Türkçe Doğal Dil İşleme Kütüphanesi, Teknofest Türkçe Doğal Dil İşleme Yarışması kapsamında geliştirildi. Nane&Limon Takımı ad…☆44Jun 1, 2024Updated last year
- Hand Written Blots augmentation☆12Aug 28, 2025Updated 5 months ago
- Keyscan: AI-powered API key scanner for GitHub Gists.☆28Jan 1, 2026Updated last month
- Use pretrained BERT model to automatically generate grammar multiple choice questions (MCQ) from any news article or story.☆13Oct 2, 2019Updated 6 years ago
- Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.☆10Jan 27, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 6 months ago
- 更纯粹、更高压缩率的Tokenizer in Rust☆13Dec 21, 2024Updated last year
- Mac port of Torcs, The Open Racing Car Simulator☆11Jun 16, 2010Updated 15 years ago
- ☆11Oct 6, 2020Updated 5 years ago
- Professional Wargaming LLM Toolbox☆20Jul 9, 2025Updated 7 months ago
- The PyTorch implementation of "Modeling Financial Time Series using LSTM with Trainable Initial Hidden States"☆11Jul 15, 2020Updated 5 years ago
- ☆11Mar 25, 2023Updated 2 years ago
- Text-to-image generation using Huggingface stable diffusion ControlNet conditioning and AWS Translate's prompt translation function☆14Aug 25, 2023Updated 2 years ago
- ☆13May 25, 2023Updated 2 years ago
- R package for tracking Covid19 cases in San Francisco☆12Apr 2, 2023Updated 2 years ago
- ChatGPT solutions for the MLE interview☆14Dec 9, 2022Updated 3 years ago
- An overview of popular reranking models and architectures for 2 stage RAG pipelines☆20Jun 10, 2025Updated 8 months ago
- Concise Reasoning via Reinforcement Learning☆13Apr 16, 2025Updated 9 months ago
- Lossless normalization of uppercase characters☆11Jul 3, 2023Updated 2 years ago
- ☆15Apr 11, 2023Updated 2 years ago
- High-quality reference implementations of various algorithms for Inverse Reinforcement Learning☆13Jun 20, 2018Updated 7 years ago
- code for Towards Data Science article on prompt-loss-weight☆11Jun 4, 2025Updated 8 months ago
- todo: desc☆11Aug 12, 2021Updated 4 years ago
- Predicting the Short-term Direction of Futures Contracts through Machine Learning☆14Oct 15, 2024Updated last year