CarlanLark / Lp-Reg-devLinks
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆31Updated last week
Alternatives and similar repositories for Lp-Reg-dev
Users that are interested in Lp-Reg-dev are comparing it to the libraries listed below
Sorting:
- TITAN : A Task-oriented Dialogue Dataset with Mixed-Initiative Interactions☆33Updated 2 years ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated 10 months ago
- ☆26Updated last year
- ☆29Updated last year
- CTFd_v3.5.1中文版本☆29Updated last year
- ☆75Updated 8 months ago
- Ethan daily algorithm practice☆18Updated last year
- Common implementations of VaR models☆11Updated last year
- ☆12Updated 8 months ago
- ☆16Updated last year
- [TMLR 2022] DHA: End-to-End Joint Optimization of Data Augmentation Policy, Hyper-parameter and Architecture☆30Updated last year
- 使用donut多模态模型,身份证识别,对身份证做端对端识别,无需中间处理,识别率达到商用☆18Updated last year
- 🎵 Unlocking the Power of Personalized Playlists 🎧 Discover your musical soulmate with MelodiCue's tailored recommendations.☆52Updated last year
- QRec is an algorithm that helps you quickly find the largest fixed-aspect, axis-aligned rectangle that can be inscribed in any given poly…☆27Updated 4 months ago
- a GUI☆28Updated last year
- An open-source toolbox focused on MP4 file processing and Dolby Digital audio encoding, providing efficient and user-friendly Dolby audio…☆42Updated 5 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆32Updated last year
- Number Animation Effect☆16Updated 11 months ago
- mobile predict☆25Updated 11 months ago
- ROSE: Robust Cross Supervision with Neighborhood Mining for Source-free Graph Domain Adaptation☆19Updated last year
- A library on numerical analysis and numerical calculation.☆42Updated 9 months ago
- ☆24Updated 10 months ago
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆40Updated 8 months ago
- ☆40Updated 4 months ago
- ☆11Updated last year
- Dynamic Topic Segmentation in Dialogues: Enhancing Boundaries with Topic-Aware Propagation☆42Updated 10 months ago
- a scalable short link generation service to improve marketing efforts☆21Updated last year
- Escape room game with quiz-solving and smart AI navigation. Unity + NavMesh + C#.☆41Updated 4 months ago
- AI agents united for smarter trading and copy strategies.☆32Updated 10 months ago
- ☆62Updated last year