CarlanLark / Lp-RegLinks
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆33Updated last month
Alternatives and similar repositories for Lp-Reg
Users that are interested in Lp-Reg are comparing it to the libraries listed below
Sorting:
- ☆86Updated 8 months ago
- (EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam☆92Updated 2 months ago
- The codes for the paper One-bit Deep Hashing: Towards a Resource-Efficient Hashing Model with Binary Neural Networks (ACMMM24)☆45Updated 8 months ago
- [BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.☆429Updated last week
- 一个基于 Qt6 和 C++20 构建的现代化、功能丰富的通信调试平台。该应用程序集成了串口通信、TCP网络通信、JavaScript脚本引擎、数据可视化等核心功能,为开发者提供了专业的数据处理和协议解析能力。支持多种数据格式、实时监控、专业级数据可视化和完全可定制的样式系…☆84Updated last month
- ☆50Updated 6 months ago
- ☆100Updated 9 months ago
- [COLM 2025] Assessing Judging Bias in Large Reasoning Models: An Empirical Study https://arxiv.org/abs/2504.09946☆164Updated last month
- Spring项目:支持设置时间、价格、距离权重的个性化导航服务,并支持根据大量用户行驶状态更新道路情况和预计到达时间☆22Updated 6 months ago
- 基于Google的Gemini模型API开发VLM☆198Updated 6 months ago
- A comprehensive, production-ready framework for building intelligent AI agents with advanced capabilities including tool calling, persist…☆161Updated 2 months ago
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆176Updated 4 months ago
- [VLDB 2025] SimRN: Trajectory Similarity Learning in Road Networks based on Distributed Deep Reinforcement Learning☆105Updated 6 months ago
- The code for Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models (Finding of ACL2025)☆82Updated 4 months ago
- ☆75Updated 5 months ago
- 1.1☆81Updated last month
- 【最新国际股票】代号:Stock-Finex-多语言股票-功能:新股申购、大宗交易、股票配资、质押理财、在线客服-多国语言,最新股票源码-股票搭建-java股票☆80Updated 3 months ago
- ☆107Updated last year
- A simple and graceful HTTP request tool!☆82Updated 4 months ago
- MAX31855 full-featured driver library for general-purpose MCU and Linux.☆70Updated 2 weeks ago
- EmbodyHub☆79Updated 9 months ago
- A ChatGPT-based programming approach is proposed to assist in solving engineering computational problems. Using three-dimensional slope s…☆113Updated 2 months ago
- A project aims to improve LLMs' pixel reasoning ability.☆81Updated 2 months ago
- ☆50Updated 7 months ago
- A lightweight intelligent agent framework implementing the complete ReAct pattern☆173Updated 3 months ago
- ☆156Updated 3 months ago
- ☆51Updated 3 weeks ago
- Go bindings for the CUDA Driver and Runtime APIs, cuBLAS, and cuDNN.☆154Updated last month
- Connect INCA and mdf conversion☆33Updated last month
- ANIMAT is the first AI platform to integrate MMD and facial tracking for dynamic 3D Model, enabling realistic customization and upgrade o…☆83Updated 9 months ago