CarlanLark/Lp-Reg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CarlanLark/Lp-Reg)

CarlanLark / Lp-Reg

Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward

☆33

Alternatives and similar repositories for Lp-Reg

Users that are interested in Lp-Reg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CarlanLark / Lp-Reg-dev
View on GitHub
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆44Nov 18, 2025Updated 7 months ago
yunbeizhang / MM-Plan
View on GitHub
[ICLRW 2026 Best Short Paper Award] Visual Exclusivity Attacks: Automatic Multimodal Red Teaming via Agentic Planning
☆89Apr 15, 2026Updated 2 months ago
Judecoin / jude-eth-swap
View on GitHub
☆10Mar 4, 2025Updated last year
TianHongZXY / qaap
View on GitHub
[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions
☆12Dec 18, 2023Updated 2 years ago
GUTS-W / red-black-tree
View on GitHub
红黑树的实现和分析（SDU CS Data Structures and Algorithms Course Design）
☆12Jan 9, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
open-event-hub / title2event_baselines
View on GitHub
[EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset
☆20Apr 4, 2023Updated 3 years ago
ruizheng20 / gpo
View on GitHub
The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".
☆17Jun 20, 2024Updated 2 years ago
Caxson / swiftagentx
View on GitHub
Enterprise-grade fast-response Agent framework.
☆220Jun 24, 2026Updated 2 weeks ago
prakharguptaz / EDGE-exemplars
View on GitHub
Code for the paper - Controlling Dialogue Generation with Semantic Exemplars (Naacl 2021) A semantic exemplar based retrieve-refine appro…
☆18Mar 26, 2021Updated 5 years ago
ellenmellon / INSCIT
View on GitHub
INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions
☆16Jan 21, 2025Updated last year
nancui0000 / adaptive-mogrpo
View on GitHub
Adaptive Weight Scheduling for Multi-Objective GRPO in Code Generation. Fixed multi-objective rewards cause reward hacking (short but bro…
☆49Apr 14, 2026Updated 2 months ago
gulucaptain / CameraNoise
View on GitHub
[ICML'26] CameraNoise helps control the faithful camera movement 🎞️ in the video diffusion.
☆209Jun 2, 2026Updated last month
allenai / drug-combo-extraction
View on GitHub
☆22Oct 20, 2022Updated 3 years ago
CarlanLark / IPGPF
View on GitHub
Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…
☆19Feb 2, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xytian1008 / SAS
View on GitHub
The official implementation of "Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition" …
☆73Apr 4, 2026Updated 3 months ago
pkunlp-icler / TSAR
View on GitHub
Source code for "A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction" @ NAACL 2022
☆19May 1, 2022Updated 4 years ago
xiaogang00 / ACT-VAE
View on GitHub
The code for the paper "Conditional Temporal Variational AutoEncoder for Action Video Prediction“
☆82Mar 27, 2022Updated 4 years ago
wxw850227 / jjjshop_food_java
View on GitHub
【java+springboot+vue3】三勾点餐系统，校园点餐系统，门店点餐系统，三勾餐饮系统，校园餐饮系统，门店餐饮系统
☆118May 21, 2025Updated last year
VincentJYZhang / USTC_Lecture
View on GitHub
USTC研究生学术报告选课脚本
☆18Dec 6, 2022Updated 3 years ago
bobbylkchao / ai-phone-agent
View on GitHub
AI Phone Agent: A starter kit to build AI agents that answer real phone calls and talk to customers in real time (OpenAI Realtime). Node.…
☆104Apr 18, 2026Updated 2 months ago
XIAOTsune / MatteBackgroundFree
View on GitHub
一款基于 SOTA 模型 BiRefNet 开发的高精度 AI 抠图工具
☆62Jan 22, 2026Updated 5 months ago
YuChuang1205 / FDEP-Framework
View on GitHub
Official code of the paper "Rethinking Infrared Small Target Detection: A Foundation- Driven Efficient Paradigm"
☆42Dec 8, 2025Updated 7 months ago
hwang-fu / minicontainer
View on GitHub
A Linux mini container runtime written in Go
☆162Dec 28, 2025Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CarlanLark / Robust-AIGC-Detector
View on GitHub
Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?
☆33Jul 12, 2024Updated last year
nealgatech-web / comment-moderation-api
View on GitHub
☆48Nov 11, 2025Updated 7 months ago
NUSTM / FS-ABSA
View on GitHub
Srouce code for SIGIR 2023 paper
☆24Jul 31, 2023Updated 2 years ago
clownrat6 / Novel_Theft
View on GitHub
轻小说文库 epub 解析打包
☆21May 3, 2020Updated 6 years ago
PrasannS / rlhf-length-biases
View on GitHub
☆27Mar 13, 2024Updated 2 years ago
ceilf6 / CPlusPlus
View on GitHub
算法与编程练习册答案，个人答案供同学们参考。 | Help classmates learn algorithms - design patterns.
☆77Jan 22, 2026Updated 5 months ago
huhouhua / go-nuget
View on GitHub
NuGet Go SDK
☆31Updated this week
PLUM-Lab / Event_Query_Extract
View on GitHub
☆25Dec 6, 2022Updated 3 years ago
thu-spmi / LABES
View on GitHub
Yichi Zhang et al. A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning. EMNL…
☆20Nov 5, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
zjy365 / gh-explorer
View on GitHub
AI-powered tool for analyzing GitHub trending repositories and URL metadata
☆27Jun 7, 2026Updated last month
lawrenceching / metricdump
View on GitHub
☆24Aug 16, 2024Updated last year
Horace-Maxwell / Harness_Engineering_Regression_Copilot
View on GitHub
Failure-first AI regression testing CLI for turning AI failures into local regression assets and PR gates. 把真实 AI 失败快速变成可执行回归资产和防止再次犯错清单。
☆82Apr 10, 2026Updated 2 months ago
alphadl / OOP-eval
View on GitHub
The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs
☆27Jan 15, 2025Updated last year
RunxinXu / TSAR
View on GitHub
Source code for "A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction" @ NAACL 2022
☆37May 7, 2022Updated 4 years ago
nuglifeleoji / Job-Recommendation-System
View on GitHub
Intelligent job recommendation platform using Java + MySQL + Redis. Supports location-based search, AI keyword extraction, and personaliz…
☆228Aug 31, 2025Updated 10 months ago
facebookresearch / llm-cross-capabilities
View on GitHub
Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"
☆43Oct 1, 2024Updated last year