TheAgentCompany / experimentsLinks
Open sourced result for The Agent Company
☆22Updated 2 months ago
Alternatives and similar repositories for experiments
Users that are interested in experiments are comparing it to the libraries listed below
Sorting:
- Workshop for Model Context Protocol☆18Updated 10 months ago
- OpenPipe Reinforcement Learning Experiments☆32Updated 10 months ago
- Test your local LLMs on the AIME problems☆32Updated 8 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Lego for GRPO☆30Updated 8 months ago
- Modified Beam Search with periodical restart☆12Updated last year
- A walk through HuggingFace smolagents☆48Updated 11 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- ☆30Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Updated 6 months ago
- ☆27Updated 5 months ago
- ☆26Updated 3 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆45Updated 10 months ago
- A JAX Research Toolkit for Visualizing, Manipulating, and Understanding Gemma Models with Multi-modal Support based on Penzai.☆89Updated 3 weeks ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- ☆87Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 8 months ago
- Very minimal (and stateless) agent framework☆44Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Updated 3 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆14Updated 7 months ago
- ☆56Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆63Updated 4 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Updated 6 months ago
- ☆29Updated 8 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 6 months ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆29Updated 10 months ago
- ☆20Updated 6 months ago