RDI-Foundation / agentbeats-tutorialLinks
☆75Updated 3 weeks ago
Alternatives and similar repositories for agentbeats-tutorial
Users that are interested in agentbeats-tutorial are comparing it to the libraries listed below
Sorting:
- An interface library for RL post training with environments.☆1,112Updated this week
- open source interpretability platform 🧠☆689Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- large population models☆567Updated this week
- CodeScientist: An automated scientific discovery system for code-based experiments☆310Updated 2 months ago
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆251Updated 3 weeks ago
- An agent benchmark with tasks in a simulated software company.☆635Updated 2 months ago
- Course Materials for Interpretability of Large Language Models (0368.4264) at Tel Aviv University☆297Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆583Updated 5 months ago
- (ACL 2025 Main) Code for MultiAgentBench : Evaluating the Collaboration and Competition of LLM agents https://www.arxiv.org/pdf/2503.019…☆213Updated 3 months ago
- ☆696Updated 9 months ago
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆344Updated 6 months ago
- An open-source tool for LLM prompt optimization.☆759Updated last week
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆129Updated 2 years ago
- Tool for generating high quality Synthetic datasets☆1,484Updated 3 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last month
- ⏰ AI conference deadline countdowns☆320Updated last week
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆126Updated 3 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆176Updated 2 weeks ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆426Updated last month
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆313Updated this week
- UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection☆1,106Updated this week
- ☆328Updated 6 months ago
- ☆281Updated 9 months ago
- Automatic evals for LLMs☆578Updated last month
- An alignment auditing agent capable of quickly exploring alignment hypothesis☆874Updated this week
- On the Theoretical Limitations of Embedding-Based Retrieval☆622Updated 4 months ago
- Repo for "Adaptation of Agentic AI"☆585Updated 2 weeks ago
- ☆270Updated 7 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆1,295Updated 3 weeks ago