CarlanLark / Lp-Reg-devLinks
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆42Updated last month
Alternatives and similar repositories for Lp-Reg-dev
Users that are interested in Lp-Reg-dev are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 10 months ago
- ☆62Updated last year
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated last year
- ☆104Updated 11 months ago
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆42Updated last month
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆32Updated last year
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆59Updated last year
- Concise Evaluation Benchmark for Large Language Models☆25Updated 5 months ago
- Voice-to-motion aerial robot using ESP32-S3, Node.js, Deepgram, ChatGPT, and Arduino.☆30Updated 6 months ago
- This script monitors the remaining traffic of VMs on Vultr, DigitalOcean, and Linode. If the remaining traffic is zero, it shuts down the…☆33Updated last year
- ☆49Updated 2 years ago
- 强化学习-大语言模型☆68Updated 6 months ago
- ☆41Updated 9 months ago
- ☆57Updated last year
- 低代码核心组件:数据模型的实现☆56Updated last year
- ☆98Updated 9 months ago
- An iterative optimization system☆33Updated 6 months ago
- ☆40Updated 8 months ago
- Dynamic Topic Segmentation in Dialogues: Enhancing Boundaries with Topic-Aware Propagation☆42Updated last year
- Imagine building a whole operating system around just your notes.☆80Updated 10 months ago
- ☆28Updated 7 months ago
- A Chatbot with UI design is created, according to some certain datasets (can be replaced). Through statistical analysis and PINN model, i…☆27Updated 7 months ago
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆73Updated last month
- ☆81Updated last month
- a demo but fun snake game created in https://aide.ink☆66Updated 11 months ago
- ☆33Updated 2 years ago
- A PyTorch quantization tool for machine learning models☆78Updated 10 months ago
- Enable Agents to conduct web3 operations, support wider applications for cross-chain bridging☆43Updated 11 months ago
- Training and evaluation code of EGTLM model.☆22Updated last year
- Store and download PseudoMeta R Package☆28Updated 6 months ago