Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆33Oct 5, 2025Updated 6 months ago
Alternatives and similar repositories for Lp-Reg
Users that are interested in Lp-Reg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆44Nov 18, 2025Updated 4 months ago
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- 红黑树的实现和分析(SDU CS Data Structures and Algorithms Course Design)☆12Jan 9, 2025Updated last year
- [EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset☆20Apr 4, 2023Updated 3 years ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆17Jun 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Failure-first AI regression testing CLI for turning AI failures into local regression assets and PR gates. 把真实 AI 失败快速变成可执行回归资产和防止再次犯错清单。☆32Apr 10, 2026Updated last week
- YiRage (Yield Revolutionary AGile Engine) - Multi-Backend LLM Inference Optimization. Extends Mirage with comprehensive support for CUDA,…☆36Updated this week
- Code for the paper - Controlling Dialogue Generation with Semantic Exemplars (Naacl 2021) A semantic exemplar based retrieve-refine appro…☆18Mar 26, 2021Updated 5 years ago
- INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions☆16Jan 21, 2025Updated last year
- Interactive visualization of Claude Code's source architecture☆64Apr 5, 2026Updated last week
- ☆22Oct 20, 2022Updated 3 years ago
- Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…☆19Feb 2, 2025Updated last year
- The code for the paper "Conditional Temporal Variational AutoEncoder for Action Video Prediction“☆81Mar 27, 2022Updated 4 years ago
- Source code for "A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction" @ NAACL 2022☆19May 1, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 【java+springboot+vue3】三勾点餐系统,校园点餐系统,门店点餐系统,三勾餐饮系统,校园餐饮系统,门店餐饮系统☆112May 21, 2025Updated 10 months ago
- USTC研究生学术报告选课脚本☆18Dec 6, 2022Updated 3 years ago
- 一款基于 SOTA 模型 BiRefNet 开发的高精度 AI 抠图工具☆57Jan 22, 2026Updated 2 months ago
- Official code of the paper "Rethinking Infrared Small Target Detection: A Foundation- Driven Efficient Paradigm"☆42Dec 8, 2025Updated 4 months ago
- A Linux mini container runtime written in Go☆161Dec 28, 2025Updated 3 months ago
- ☆48Nov 11, 2025Updated 5 months ago
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆33Jul 12, 2024Updated last year
- Srouce code for SIGIR 2023 paper☆23Jul 31, 2023Updated 2 years ago
- 轻小说文库 epub 解析打包☆21May 3, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆27Mar 13, 2024Updated 2 years ago
- NuGet Go SDK☆31Apr 2, 2026Updated 2 weeks ago
- 算法与编程练习册答案,个人答案供同学们参考。 | Help classmates learn algorithms - design patterns.☆77Jan 22, 2026Updated 2 months ago
- ☆25Dec 6, 2022Updated 3 years ago
- Yichi Zhang et al. A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning. EMNL…☆20Nov 5, 2020Updated 5 years ago
- AI-powered tool for analyzing GitHub trending repositories and URL metadata☆25Apr 1, 2026Updated 2 weeks ago
- ☆24Aug 16, 2024Updated last year
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- Intelligent job recommendation platform using Java + MySQL + Redis. Supports location-based search, AI keyword extraction, and personaliz…☆220Aug 31, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source code for "A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction" @ NAACL 2022☆37May 7, 2022Updated 3 years ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- Telegram AI assistant based on LangGraph, supporting long-term memory, web search, in-depth research, and multi-user permission managemen…☆68Dec 27, 2025Updated 3 months ago
- DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction☆40Apr 19, 2023Updated 2 years ago
- Code for NAACL 2022 paper (Main Track) "RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Ex…☆36Aug 2, 2022Updated 3 years ago
- NovaHook是一款轻量级鸿蒙应用层hook框架☆37Mar 16, 2025Updated last year
- Data and code supporting data examples analysis in the paper "Assessing the interconnectedness and systemic risk contagion in the Chinese…☆21Aug 26, 2024Updated last year