liushulinle / UloRLView external linksLinks
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 6 months ago
Alternatives and similar repositories for UloRL
Users that are interested in UloRL are comparing it to the libraries listed below
Sorting:
- ☆54Jul 7, 2025Updated 7 months ago
- C^3-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking☆37Jun 30, 2025Updated 7 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆45Sep 19, 2025Updated 4 months ago
- multicast learning in network programming course☆10Oct 30, 2020Updated 5 years ago
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆24Jan 5, 2026Updated last month
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 4 months ago
- ☆10Feb 22, 2022Updated 3 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 5 months ago
- Tutorial about noisy labels for SIBGRAPI 2020☆11Nov 6, 2020Updated 5 years ago
- 使用flutter仿QQ的界面UI功能效果☆12Dec 27, 2023Updated 2 years ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 6 months ago
- My collection of dotfiles☆15Updated this week
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- Official implementation for Text Generation Beyond Discrete Token Sampling☆21Aug 11, 2025Updated 6 months ago
- ☆21Sep 25, 2025Updated 4 months ago
- This is the source code for Efficient Sequential Recommendation for Long Term User Interest Via Personalization.☆19Nov 18, 2025Updated 2 months ago
- ☆18Jul 8, 2025Updated 7 months ago
- Awesome LLM for Cybersecurity☆11Nov 16, 2024Updated last year
- auth client for yuque oauth app☆10Jul 13, 2023Updated 2 years ago
- Enemies for your LLM☆35Jan 20, 2026Updated 3 weeks ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆18Nov 18, 2025Updated 2 months ago
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated last month
- This is a sample project where we can get the exact use case of pythons multi threading.☆11Oct 6, 2020Updated 5 years ago
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆34May 30, 2025Updated 8 months ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated 11 months ago
- ☆15Nov 18, 2025Updated 2 months ago
- Practice typing In your favorite programming language☆12Apr 27, 2014Updated 11 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- ☆54Oct 29, 2024Updated last year
- ☆10Jul 28, 2020Updated 5 years ago
- Official implementation of the paper “Reconsidering Overthinking: Penalizing Internal and External Redundancy in CoT Reasoning”☆20Aug 20, 2025Updated 5 months ago
- An official implementation for the KDD 2025 paper 'Unlocking the Power of Diffusion Models in Sequential Recommendation: A Simple and Eff…☆22Jun 4, 2025Updated 8 months ago
- macOS app for upscaling images using Vision and Core ML☆19Mar 26, 2024Updated last year
- A paddle implementation of "Masked Autoencoders Are Scalable Vision Learners"☆11Feb 24, 2022Updated 3 years ago
- 自定义模糊效果UICustomBlurEffect,基础模糊用法,包含OC和Swift两个版本。☆11Oct 11, 2020Updated 5 years ago
- flutter_nest_page_view☆17Apr 7, 2024Updated last year
- A perl script for searching and replacing in mathematics in LaTeX documents.☆13Jul 21, 2021Updated 4 years ago