pettingllms-ai / PettingLLMsView external linksLinks
[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system
☆105Feb 3, 2026Updated last week
Alternatives and similar repositories for PettingLLMs
Users that are interested in PettingLLMs are comparing it to the libraries listed below
Sorting:
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆64Sep 13, 2025Updated 5 months ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆25Updated this week
- Platform API Project seed☆12Nov 8, 2023Updated 2 years ago
- ☆32May 24, 2025Updated 8 months ago
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆171Dec 25, 2025Updated last month
- ☆14Nov 19, 2024Updated last year
- the datasets of our paper☆11Feb 26, 2024Updated last year
- A relatively simple, unified method for reporting on Kubernetes resource issues.☆12Mar 5, 2020Updated 5 years ago
- Automatic Thief Detection via CCTV with Alarm System and Perpetrator Image Capture using YOLOv5 + ROI. This project utilizes computer vis…☆14Oct 21, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆12Dec 3, 2024Updated last year
- Application for Agent re-engineering for better and reliable Gen AI workflows.☆10Jul 20, 2025Updated 6 months ago
- Evaluation of Oasis Platform - simple install, UI and API☆14Feb 9, 2026Updated last week
- Reinforced Multi-LLM Agents training☆70Jan 18, 2026Updated 3 weeks ago
- ☆53Feb 19, 2025Updated 11 months ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated last year
- Automaton & Cognition☆16Apr 14, 2024Updated last year
- A Bunyan stream to send events to Seq☆11May 7, 2025Updated 9 months ago
- This repository contains the source code for the cloud.gov.au website.☆12Dec 7, 2022Updated 3 years ago
- Effortlessly process invoices with AI! This project uses the Llama3.2 Vision Model for OCR, converting invoice images into structured, ma…☆10Feb 5, 2025Updated last year
- 🇰🇷 Korean LLM Datasets | Pre-training, SFT, DPO, RLHF, CoT | 한국어 LLM 데이터셋 큐레이션☆31Jan 20, 2026Updated 3 weeks ago
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 8 months ago
- A tool to explore ideas generated from artificial intelligence chats.☆10Apr 3, 2023Updated 2 years ago
- 사용자인증 API서비스☆10Apr 21, 2021Updated 4 years ago
- Talk to your shell in natural language. Locally.☆54Updated this week
- ☆11Aug 15, 2024Updated last year
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- LLM as World Models using Bayesian inference☆16May 27, 2025Updated 8 months ago
- This repository contains reference implementation for multi-LLM ToM paper (accepted to EMNLP 2023), Theory of Mind for Multi-Agent Collab…☆18Jun 11, 2024Updated last year
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆11May 5, 2025Updated 9 months ago
- ☆34Feb 4, 2026Updated last week
- Automate Checkmarx Scanning and Onboarding Plus AWS Access☆12Jan 5, 2023Updated 3 years ago
- A modular, agentic-AI-based adaptive cybersecurity architecture for digital ecosystems. Combines Zero Trust, real-time telemetry, and int…☆21Jul 4, 2025Updated 7 months ago
- Amazon Bedrock 의 Nova, Claude 3.7 모델을 활용하여 pdf 도면을 파싱 합니다.☆12May 19, 2025Updated 8 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆175Sep 18, 2025Updated 4 months ago
- Official Implementation of ReALFRED (ECCV'24)☆44Oct 11, 2024Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆30Jan 25, 2026Updated 3 weeks ago