CarlanLark / Lp-Reg-devLinks
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆41Updated 3 weeks ago
Alternatives and similar repositories for Lp-Reg-dev
Users that are interested in Lp-Reg-dev are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 9 months ago
- ☆62Updated last year
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆42Updated last month
- Voice-to-motion aerial robot using ESP32-S3, Node.js, Deepgram, ChatGPT, and Arduino.☆30Updated 5 months ago
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆59Updated last year
- Training and evaluation code of EGTLM model.☆22Updated last year
- ☆41Updated 9 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated last year
- ☆104Updated 10 months ago
- ☆43Updated 2 years ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆32Updated last year
- 低代码核心组件:数据模型的实现☆56Updated last year
- ☆98Updated 9 months ago
- ☆49Updated 2 years ago
- A PyTorch quantization tool for machine learning models☆78Updated 9 months ago
- This script monitors the remaining traffic of VMs on Vultr, DigitalOcean, and Linode. If the remaining traffic is zero, it shuts down the…☆33Updated last year
- mobile predict☆25Updated last year
- FreeSwap Smart Contracts☆28Updated last year
- Store and download PseudoMeta R Package☆28Updated 5 months ago
- ☆57Updated last year
- ☆59Updated last year
- 通过RPN with FPN以及CRNN进行车牌检测和识别☆26Updated 10 months ago
- ☆33Updated 2 years ago
- a demo but fun snake game created in https://aide.ink☆66Updated 10 months ago
- HACAN: Hybrid Attention-Driven Cross-Layer Alignment Network for Image-Text Retrieval☆79Updated 7 months ago
- An iterative optimization system☆33Updated 5 months ago
- ☆12Updated 9 months ago
- `cryptor` is a Go package for secure encryption and decryption using NaCl's `secretbox` from `golang.org/x/crypto`☆60Updated 6 months ago
- Enable Agents to conduct web3 operations, support wider applications for cross-chain bridging☆43Updated 11 months ago
- A Chatbot with UI design is created, according to some certain datasets (can be replaced). Through statistical analysis and PINN model, i…☆27Updated 6 months ago