CarlanLark/Lp-Reg-dev

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CarlanLark/Lp-Reg-dev)

CarlanLark / Lp-Reg-dev

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

☆44

Alternatives and similar repositories for Lp-Reg-dev

Users that are interested in Lp-Reg-dev are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CarlanLark / Lp-Reg
View on GitHub
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆33Oct 5, 2025Updated 9 months ago
CarlanLark / Robust-AIGC-Detector
View on GitHub
Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?
☆33Jul 12, 2024Updated 2 years ago
JyAether / Aether
View on GitHub
☆389May 5, 2025Updated last year
keating666 / yzcbbs
View on GitHub
A Knowledge Base on Pre-made Dishes
☆105Jul 6, 2026Updated 2 weeks ago
suimuc / VIRES
View on GitHub
☆342Jul 4, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GenerTeam / GENERanno
View on GitHub
GENERanno: A Genomic Foundation Model for Metagenomic Annotation
☆314Jun 15, 2026Updated last month
CarlanLark / IPGPF
View on GitHub
Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…
☆19Feb 2, 2025Updated last year
ZinYY / TreeLoRA
View on GitHub
[ICML 2025] A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical G…
☆350Dec 15, 2025Updated 7 months ago
ByteDance-Seed / EvaLearn
View on GitHub
EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…
☆431May 12, 2026Updated 2 months ago
THESIS-AGENT / AIRouter
View on GitHub
🚀 AIRouter - 智能AI路由器：为多个LLM提供商提供统一API接口，支持负载均衡、故障转移和智能路由 | Intelligent AI Router with unified API interface, load balancing, and smart r…
☆180Aug 28, 2025Updated 10 months ago
FinDii / EDA
View on GitHub
In-depth exploratory data analysis, including distribution characteristics, multidimensional correlation, missing value pattern, PCA, et…
☆17Nov 24, 2025Updated 8 months ago
wjf5203 / TokBench
View on GitHub
Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.
☆152Jun 11, 2026Updated last month
renyuantime / openai-assistant
View on GitHub
Create production-ready, full-suite agents that offer: RAG (Retrieval-Augmented Generation) Function Calling Code Interpreter Streaming c…
☆103Jul 15, 2025Updated last year
tcztzy / swarmx
View on GitHub
Framework exploring ergonomic, lightweight multi-agent orchestration.
☆116Updated this week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
greatInvoker / 2025-full-stack-tech-sharing
View on GitHub
2025技术分享（FullStack Frontend Focus），分享常用知识点。代码纯手打+AI验证，只做精品！！！
☆153Jul 2, 2025Updated last year
renxh4 / CompressPng
View on GitHub
☆405Aug 31, 2022Updated 3 years ago
xk-dragonfly / xk-RPC
View on GitHub
This project provides a high-performance distributed RPC (Remote Procedure Call) system based on Spring Boot, Netty, and Zookeeper for ef…
☆33Dec 29, 2024Updated last year
DejaOS / DejaOS
View on GitHub
JavaScript Runtime Environment In Embedded Device
☆385Updated this week
SSSYDYSSS / MetaTrx
View on GitHub
MetaTrx: Comprehensive Cross-Species Transcriptome Analysis
☆118Jun 4, 2024Updated 2 years ago
yx-fan / multi-agent-orchestration-framework
View on GitHub
Modular multi-agent orchestration framework powered by LangGraph and FastAPI.
☆27Nov 10, 2025Updated 8 months ago
yixinzhang98 / otc_med_chat_agent
View on GitHub
An AI-powered conversational agent for recommending over-the-counter medications based on user symptoms and needs. Built with Python and …
☆198Jul 29, 2025Updated 11 months ago
shenshanf / mmdepth
View on GitHub
MMDepth: Comprehensive MMEngine-based Framework for Monocular, Stereo & Multi-view Depth Estimation
☆98Mar 4, 2025Updated last year
GabePersson / EmoVision
View on GitHub
☆590Oct 11, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hehefan / Translution
View on GitHub
☆141Feb 7, 2026Updated 5 months ago
garlic-byte / RL-LLM
View on GitHub
强化学习-大语言模型
☆68Jun 17, 2025Updated last year
Din829 / DbRheo-CLI
View on GitHub
A database operations and data analysis AI agent
☆432Aug 31, 2025Updated 10 months ago
Jiapeng-Pei / LLMSensitiveDataGoverance
View on GitHub
☆286Feb 21, 2026Updated 5 months ago
xcancloud / OpenAPIDesigner
View on GitHub
🔥 OpenAPIDesigner is an open-source OpenAPI specification design tool that allows developers to design, write, and validate OpenAPI spec…
☆400Mar 24, 2026Updated 4 months ago
HKUDS / SepLLM
View on GitHub
[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"
☆572Jul 29, 2025Updated 11 months ago
HiGoalV / HiGoalVita
View on GitHub
HiGoalVita is a modular, layered, production ready AI RAG suite.
☆252May 22, 2025Updated last year
ZivJia / Cybersecurity-Doughnuts
View on GitHub
Fullstack engineer's checklist for your cybersecurity.
☆383Jul 11, 2024Updated 2 years ago
Xtra-Computing / MegaAgent
View on GitHub
[ACL 2025 Findings] MegaAgent: A Large-Scale Autonomous LLM-based Multi-Agent System Without Predefined SOPs https://aclanthology.org/202…
☆245Nov 18, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SuperAier / FastDLP
View on GitHub
FastDLP
☆79Jul 25, 2025Updated 11 months ago
Rhythm-Byte / SchemaDiff
View on GitHub
☆246Nov 24, 2024Updated last year
HenryLiu0405 / Industrial-Internet-Communication-Accelerator
View on GitHub
Accelerating industrial internet communication through lightweight Android and Java applications – empowering operators to identify and b…
☆42Jun 16, 2025Updated last year
kaitoInfra / fast-twitter-api
View on GitHub
Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required
☆183May 28, 2026Updated last month
IAAR-Shanghai / MaintainCoder
View on GitHub
☆47May 21, 2025Updated last year
ShuaiLyu0110 / SQL-o1
View on GitHub
SQL-o1: A Self-Reward Heuristic Dynamic Search Method for Text-to-SQL
☆197May 23, 2025Updated last year
ichason / Anti-Package-visibility-filtering
View on GitHub
bypass Package visibility filtering on Android（Android11+）
☆86Jul 19, 2025Updated last year