CarlanLark / Lp-Reg-devLinks
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
☆42Updated 2 months ago
Alternatives and similar repositories for Lp-Reg-dev
Users that are interested in Lp-Reg-dev are comparing it to the libraries listed below
Sorting:
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆41Updated 11 months ago
- ☆62Updated last year
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆59Updated last year
- ☆104Updated 11 months ago
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆42Updated 2 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated last year
- Modular multi-agent orchestration framework powered by LangGraph and FastAPI.☆26Updated 2 months ago
- a demo but fun snake game created in https://aide.ink☆66Updated last year
- ☆28Updated 8 months ago
- ☆43Updated 2 years ago
- Training and evaluation code of EGTLM model.☆22Updated last year
- A PyTorch quantization tool for machine learning models☆78Updated 10 months ago
- ☆41Updated 10 months ago
- 强化学习-大语言模型☆68Updated 7 months ago
- 低代码核心组件:数据模型的实现☆56Updated last year
- ☆59Updated last year
- ☆12Updated 10 months ago
- [NeurIPS 25 @ ER] Long-Context Modeling with Dynamic Hierarchical Sparse Attention for On-Device LLMs☆73Updated 2 months ago
- ☆49Updated 2 years ago
- Store and download PseudoMeta R Package☆28Updated 6 months ago
- HACAN: Hybrid Attention-Driven Cross-Layer Alignment Network for Image-Text Retrieval☆79Updated 8 months ago
- A Chatbot with UI design is created, according to some certain datasets (can be replaced). Through statistical analysis and PINN model, i…☆27Updated 7 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆32Updated last year
- A system demo based on Retrival Argument Generation to answer buddism question☆84Updated last year
- Voice-to-motion aerial robot using ESP32-S3, Node.js, Deepgram, ChatGPT, and Arduino.☆30Updated 6 months ago
- A 3D game involves melee combat and parkour system based on UE5.☆28Updated last year
- ☆81Updated 2 months ago
- ☆57Updated last year
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated 9 months ago
- ☆98Updated 10 months ago