tsinghua-fib-lab / SmartAgent
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
☆24Updated this week
Alternatives and similar repositories for SmartAgent:
Users that are interested in SmartAgent are comparing it to the libraries listed below
- ☆15Updated 7 months ago
- ☆18Updated 4 months ago
- ☆20Updated 8 months ago
- Self-Supervised Alignment with Mutual Information☆16Updated 9 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆36Updated last year
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆28Updated last month
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆41Updated 4 months ago
- This is a unified platform for performing prompting engineering in large language models (LLMs).☆12Updated last month
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 6 months ago
- ☆18Updated 4 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆16Updated 8 months ago
- ☆22Updated 5 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆23Updated 3 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆29Updated last year
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆18Updated 3 months ago
- ☆38Updated 4 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆55Updated 4 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆41Updated 4 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated 2 months ago
- [NAACL 25 main] Awesome LLM Causal Reasoning is a collection of LLM-based casual reasoning works, including papers, codes and datasets.☆42Updated 3 weeks ago
- ☆23Updated 9 months ago
- ☆20Updated 4 months ago
- ☆35Updated last week
- The code of arXiv paper: "Dynamic Scaling of Unit Tests for Code Reward Modeling"☆16Updated 2 months ago
- Dateset Reset Policy Optimization☆30Updated 11 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆28Updated last year