TheAgentCompany / experimentsLinks
Open sourced result for The Agent Company
☆18Updated last week
Alternatives and similar repositories for experiments
Users that are interested in experiments are comparing it to the libraries listed below
Sorting:
- Workshop for Model Context Protocol☆18Updated 4 months ago
- A walk through HuggingFace smolagents☆28Updated 5 months ago
- Test your local LLMs on the AIME problems☆32Updated 2 months ago
- OpenPipe Reinforcement Learning Experiments☆30Updated 4 months ago
- Modified Beam Search with periodical restart☆12Updated 11 months ago
- What Would Portland Do? Generative agent experience☆13Updated last year
- ☆21Updated last month
- ☆27Updated 2 months ago
- LLM reads a paper and produce a working prototype☆57Updated 4 months ago
- ☆28Updated 11 months ago
- Lego for GRPO☆28Updated 2 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 8 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆40Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 6 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆33Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆21Updated last month
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆10Updated 8 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆104Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆82Updated this week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 5 months ago
- The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"☆12Updated 7 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆42Updated 7 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 9 months ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆39Updated 3 weeks ago
- The original Shared Recurrent Memory Transformer implementation☆30Updated last month
- ☆17Updated 3 months ago
- ☆76Updated this week
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆56Updated 2 months ago