TheAgentCompany / experimentsLinks
Open sourced result for The Agent Company
☆22Updated 2 months ago
Alternatives and similar repositories for experiments
Users that are interested in experiments are comparing it to the libraries listed below
Sorting:
- Workshop for Model Context Protocol☆18Updated 9 months ago
- A walk through HuggingFace smolagents☆35Updated 10 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆23Updated last year
- ☆30Updated last year
- Lego for GRPO☆30Updated 7 months ago
- Test your local LLMs on the AIME problems☆31Updated 7 months ago
- Modified Beam Search with periodical restart☆12Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Updated 9 months ago
- ☆86Updated last year
- ☆25Updated 2 months ago
- Run LLMs on Replicate with vLLM☆26Updated 5 months ago
- ☆29Updated 7 months ago
- ☆55Updated last year
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- ☆63Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 9 months ago
- ☆20Updated last year
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆37Updated last year
- Python library for Entities, relationships and schemas extraction from documents☆45Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated last week
- Verifiers for LLM Reinforcement Learning☆80Updated 4 months ago
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆29Updated last year
- What Would Portland Do? Generative agent experience☆13Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Curated resources about automated GUI computer-use via LLMs. Highly opinionated, focus is on quality vs quantity.☆23Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆45Updated 9 months ago
- smolbox of recipies☆29Updated 8 months ago