Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆33Oct 5, 2025Updated 5 months ago
Alternatives and similar repositories for Lp-Reg
Users that are interested in Lp-Reg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward☆43Nov 18, 2025Updated 4 months ago
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- 红黑树的实现和分析(SDU CS Data Structures and Algorithms Course Design)☆12Jan 9, 2025Updated last year
- [EMNLP'22] Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset☆20Apr 4, 2023Updated 2 years ago
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆17Jun 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- YiRage (Yield Revolutionary AGile Engine) - Multi-Backend LLM Inference Optimization. Extends Mirage with comprehensive support for CUDA,…☆36Jan 28, 2026Updated last month
- Code for the paper - Controlling Dialogue Generation with Semantic Exemplars (Naacl 2021) A semantic exemplar based retrieve-refine appro…☆18Mar 26, 2021Updated 5 years ago
- INSCIT: Information-Seeking Conversations with Mixed-Initiative Interactions☆16Jan 21, 2025Updated last year
- ☆22Oct 20, 2022Updated 3 years ago
- Code for EMNLP 2023 long paper: An Iteratively Parallel Generation Method with the Pre-Filling Strategy for Document-level Event Extracti…☆19Feb 2, 2025Updated last year
- The code for the paper "Conditional Temporal Variational AutoEncoder for Action Video Prediction“☆81Mar 27, 2022Updated 4 years ago
- Source code for "A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction" @ NAACL 2022☆19May 1, 2022Updated 3 years ago
- 【java+springboot+vue3】三勾点餐系统,校园点餐系统,门店点餐系统,三勾餐饮系统,校园餐饮系统,门店餐饮系统☆112May 21, 2025Updated 10 months ago
- USTC研究生学术报告选课脚本☆18Dec 6, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 一款基于 SOTA 模型 BiRefNet 开发的高精度 AI 抠图工具☆57Jan 22, 2026Updated 2 months ago
- Official code of the paper "Rethinking Infrared Small Target Detection: A Foundation- Driven Efficient Paradigm"☆43Dec 8, 2025Updated 3 months ago
- A Linux mini container runtime written in Go☆161Dec 28, 2025Updated 2 months ago
- ☆48Nov 11, 2025Updated 4 months ago
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆33Jul 12, 2024Updated last year
- 轻小说文库 epub 解析打包☆21May 3, 2020Updated 5 years ago
- Srouce code for SIGIR 2023 paper☆23Jul 31, 2023Updated 2 years ago
- ☆27Mar 13, 2024Updated 2 years ago
- NuGet Go SDK☆31Mar 20, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 算法与编程练习册答案,个人答案供同学们参考。 | Help classmates learn algorithms - design patterns.☆77Jan 22, 2026Updated 2 months ago
- ☆25Dec 6, 2022Updated 3 years ago
- Yichi Zhang et al. A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning. EMNL…☆20Nov 5, 2020Updated 5 years ago
- AI-powered tool for analyzing GitHub trending repositories and URL metadata☆25Mar 2, 2026Updated 3 weeks ago
- ☆24Aug 16, 2024Updated last year
- The first Object-Oriented Programming (OOP) Evaluation Benchmark for LLMs☆27Jan 15, 2025Updated last year
- Intelligent job recommendation platform using Java + MySQL + Redis. Supports location-based search, AI keyword extraction, and personaliz…☆219Aug 31, 2025Updated 6 months ago
- Source code for "A Two-Stream AMR-enhanced Model for Document-level Event Argument Extraction" @ NAACL 2022☆37May 7, 2022Updated 3 years ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Oct 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Telegram AI assistant based on LangGraph, supporting long-term memory, web search, in-depth research, and multi-user permission managemen…☆68Dec 27, 2025Updated 3 months ago
- Paper accepted by CIKM 2024. Codes of GongBU, a LLM fine-tuning platform for domain-specific adaptation.☆1,067Jan 22, 2026Updated 2 months ago
- DocEE: A Large-Scale and Fine-grained Benchmark for Document-level Event Extraction☆40Apr 19, 2023Updated 2 years ago
- Code for NAACL 2022 paper (Main Track) "RAAT: Relation-Augmented Attention Transformer for Relation Modeling in Document-Level Event Ex…☆36Aug 2, 2022Updated 3 years ago
- Data and code supporting data examples analysis in the paper "Assessing the interconnectedness and systemic risk contagion in the Chinese…☆21Aug 26, 2024Updated last year
- [Accepted by Information Fusion] Official code of the paper "Relational Representation Learning Network for Cross-Spectral Image Patch Ma…☆35Sep 13, 2025Updated 6 months ago
- NovaHook是一款轻量级鸿蒙应用层hook框架☆37Mar 16, 2025Updated last year