Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆43Nov 18, 2025Updated 3 months ago
Alternatives and similar repositories for Lp-Reg-dev
Users that are interested in Lp-Reg-dev are comparing it to the libraries listed below
Sorting:
- This project provides a high-performance distributed RPC (Remote Procedure Call) system based on Spring Boot, Netty, and Zookeeper for ef…☆34Dec 29, 2024Updated last year
- Modular multi-agent orchestration framework powered by LangGraph and FastAPI.☆26Nov 10, 2025Updated 3 months ago
- A template project based on gdbus-codegen and glib☆43Jun 25, 2025Updated 8 months ago
- UHDmedi is an R package for estimating total, indirect, and direct effects in ultra-high dimensional mediation. It offers functions for s…☆11May 19, 2025Updated 9 months ago
- 该Agent是一个基于eBPF技术的容器异常检测框架,旨在通过收集容器的行为特征和指标特征,利用人工智能算法自动识别具有异常行为的容器。☆119Apr 11, 2025Updated 10 months ago
- GENERanno: A Genomic Foundation Model for Metagenomic Annotation☆306Updated this week
- ☆141Feb 7, 2026Updated 3 weeks ago
- A full-stack web application for data analysis and visualization, featuring AI-driven insights generation powered by Gemini.☆25Dec 23, 2024Updated last year
- ☆60May 15, 2025Updated 9 months ago
- MMDepth: Comprehensive MMEngine-based Framework for Monocular, Stereo & Multi-view Depth Estimation☆98Mar 4, 2025Updated last year
- MetaTrx: Comprehensive Cross-Species Transcriptome Analysis☆118Jun 4, 2024Updated last year
- data and codes for adaptive strategies for climate change adaptation: An application for flood risk management☆136Feb 13, 2025Updated last year
- Accelerating industrial internet communication through lightweight Android and Java applications – empowering operators to identify and b…☆44Jun 16, 2025Updated 8 months ago
- Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …☆207Dec 15, 2025Updated 2 months ago
- 2025技术分享(FullStack Frontend Focus),分享常用知 识点。代码纯手打+AI验证,只做精品!!!☆154Jul 2, 2025Updated 8 months ago
- A fast JSON5 encoder/decoder for Python☆43Apr 16, 2025Updated 10 months ago
- The pipeline of annotating plant disease resistance genes based on deep protein language and machine learning models☆104Jan 15, 2025Updated last year
- [ACL 2025 Findings] MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs https://aclanthology.org/202…☆236Nov 18, 2025Updated 3 months ago
- ☆391May 5, 2025Updated 10 months ago
- ☆11Oct 30, 2021Updated 4 years ago
- ☆11Oct 30, 2021Updated 4 years ago
- ☆11Oct 30, 2021Updated 4 years ago
- ☆10Oct 30, 2021Updated 4 years ago
- FreeSwap Smart Contracts☆28Nov 16, 2024Updated last year
- ☆344Jul 4, 2025Updated 8 months ago
- One-click training of your own GPT. Training a GPT has never been easier for beginners. / 一键预训练+SFT一个属于自己的LLM,0基础训练GPT原来可以这么简单?☆367Feb 4, 2026Updated last month
- OasisDB: A minimal and lightweight vector database☆68Aug 24, 2025Updated 6 months ago
- 极简高效、易于集成、灵活扩展、上下文管理强大、适合新手的 LLM 智能体开发框架☆102Jul 16, 2025Updated 7 months ago
- A Go library implementation of the Model Controller Protocol (MCP). This library allows developers to easily parse MCP service configurat…☆48Apr 27, 2025Updated 10 months ago
- ☆14Oct 29, 2021Updated 4 years ago
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- a simple lib to let you code js in wechat miniprogram like vue☆33Jun 29, 2018Updated 7 years ago
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆35May 7, 2024Updated last year
- 使用GPT对给定的标题进行相关论文总结☆39Jul 16, 2023Updated 2 years ago
- 一个超超超好用的 uniapp 开发框架:uni-plus 是由 Uniapp + Vue3 + TS + Vite + Pinia + Unocss + WotUi 驱动的跨端快速启动模板,使用 VS Code 开发,具有丰富的代码提示、错误校验、类型提醒、预先插件安装、…☆272Mar 14, 2025Updated 11 months ago
- GENERator: A Long-Context Generative Genomic Foundation Model☆444Feb 10, 2026Updated 3 weeks ago
- OpenMM, an open-source platform for molecular dynamics (MD) simulations, is supported by an MCP server that offers a structured communica…☆32May 31, 2025Updated 9 months ago
- ☆13Oct 30, 2021Updated 4 years ago
- 扩展点-插件化框架☆114Jan 25, 2026Updated last month