CarlanLark / Lp-RegLinks
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆33Updated 2 months ago
Alternatives and similar repositories for Lp-Reg
Users that are interested in Lp-Reg are comparing it to the libraries listed below
Sorting:
- ☆86Updated 9 months ago
- (EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam☆93Updated 3 months ago
- Using multiple regression model for analyzing and predicting the stock price☆41Updated 9 months ago
- ☆50Updated 7 months ago
- The code for Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models (Finding of ACL2025)☆83Updated 4 months ago
- 一个基于 Qt6 和 C++20 构建的现代化、功能丰富的通信调试平台。该应用程序集成了串口通信、TCP网络通信、JavaScript脚本引擎、数据可视化等核心功能,为开发者提供了专业的数据处理和协议解析能力。支持多种数据格式、实时监控、专业级数据可视化和完全可定制的样式系…☆85Updated last month
- MAX31855 full-featured driver library for general-purpose MCU and Linux.☆70Updated last month
- Common 3rd party API Simulator with payment api demo. Utilize Spring boot, Redis, MySQL, Docker, Groovy, Velocity, etc.☆38Updated 8 months ago
- [BIRD-INTERACT] Re-imagines Text-to-SQL evaluation via lens of dynamic interactions.☆451Updated 2 weeks ago
- A modern web application for the Melbourne University Ultimate Frisbee Club, built with Next.js 15, TypeScript, and Tailwind CSS. This pl…☆101Updated 4 months ago
- The 1st dynamic phishing kit dataset☆202Updated 10 months ago
- 【最新国际股票】代号:Stock-Finex-多语言股票-功能:新股申购、大宗交易、股票配资、质押理财、在线客服-多国语言,最新股票源码-股票搭建-java股票☆80Updated 4 months ago
- [ACL 2025 Oral] QAEncoder: Towards Aligned Representation Learning in Question Answering Systems☆176Updated 4 months ago
- 一个视频、Wifi融合的摔倒检测系统☆67Updated 2 months ago
- A comprehensive, production-ready framework for building intelligent AI agents with advanced capabilities including tool calling, persist…☆163Updated 3 months ago
- ☆100Updated 10 months ago
- ☆209Updated last month
- 提供项目中常用的工具函数,比如时间戳、格式的转换、数据类型判断等。如名字screw一样,做一个项目开发过程中的螺丝钉。☆48Updated 3 months ago
- ☆92Updated this week
- The code for paper "Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review" accepted by ACL 2025.☆103Updated 6 months ago
- A Quantum Computing Library in Rust which help deploy your emulation☆121Updated this week
- [COLM 2025] Assessing Judging Bias in Large Reasoning Models: An Empirical Study https://openreview.net/pdf?id=SlRtFwBdzP☆164Updated 2 months ago
- EmbodyHub☆79Updated 10 months ago
- 【最新国际股票】代号:Stock-Finvest-多语言股票-功能:新股申购、大宗交易、股票配资、质押理财、在线客服-多国语言,最新股票源码-股票搭建-java股票-全球股票搭建-股票数据可选☆80Updated 4 months ago
- Go bindings for the CUDA Driver and Runtime APIs, cuBLAS, and cuDNN.☆154Updated last week
- ☆82Updated 3 months ago
- Polyomino:Mapping cell locations via multi-layer regionalization constraints☆35Updated 2 weeks ago
- ☆233Updated 5 months ago
- semqreg package☆120Updated 4 months ago
- [VLDB 2025] SimRN: Trajectory Similarity Learning in Road Networks based on Distributed Deep Reinforcement Learning☆105Updated 7 months ago