google / langfunLinks

OO for LLMs

☆835

Alternatives and similar repositories for langfun

Users that are interested in langfun are comparing it to the libraries listed below

Sorting:

microsoft / sammo
A library for prompt engineering and optimization (SAMMO = Structure-aware Multi-Objective Metaprompt Optimization)
☆716Updated last month
microsoft / Trace
End-to-end Generative Optimization for AI Agents
☆631Updated last month
TheAgentCompany / TheAgentCompany
An agent benchmark with tasks in a simulated software company.
☆509Updated last week
vndee / llm-sandbox
Lightweight and portable LLM sandbox runtime (code interpreter) Python library.
☆421Updated this week
sierra-research / tau-bench
Code and Data for Tau-Bench
☆713Updated 3 weeks ago
WecoAI / aideml
AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.
☆972Updated last week
OpenAutoCoder / Agentless
Agentless🐱: an agentless approach to automatically solve software development problems
☆1,846Updated 7 months ago
ServiceNow / TapeAgents
TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle
☆288Updated last week
ShengranHu / ADAS
[ICLR 2025] Automated Design of Agentic Systems
☆1,395Updated 6 months ago
ganarajpr / awesome-dspy
An Awesome list of curated DSPy resources.
☆390Updated 5 months ago
SalesforceAIResearch / xLAM
xLAM: A Family of Large Action Models to Empower AI Agent Systems
☆513Updated this week
haizelabs / verdict
Inference-time scaling for LLMs-as-a-judge.
☆267Updated 3 weeks ago
Holmeswww / AgentKit
An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…
☆454Updated 7 months ago
SalesforceAIResearch / AgentLite
☆618Updated 6 months ago
openai / mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
☆823Updated last month
open-thought / system-2-research
System 2 Reasoning Link Collection
☆849Updated 4 months ago
ServiceNow / AgentLab
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…
☆372Updated this week
harishsg993010 / LLM-Research-Scripts
☆434Updated 10 months ago
MotleyAI / motleycrew
Flexible and powerful multi-agent AI framework
☆374Updated this week
bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,468Updated last week
NousResearch / Open-Reasoning-Tasks
A comprehensive repository of reasoning tasks for LLMs (and beyond)
☆448Updated 10 months ago
agent-husky / Husky-v1
Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …
☆345Updated last year
datadreamer-dev / DataDreamer
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
☆1,041Updated 6 months ago
simbianai / taskgen
Task-based Agentic Framework using StrictJSON as the core
☆455Updated 2 weeks ago
trotsky1997 / MathBlackBox
☆1,028Updated 7 months ago
prometheus-eval / prometheus-eval
Evaluate your LLM's response with Prometheus and GPT4 💯
☆978Updated 3 months ago
facebookresearch / MLGym
MLGym A New Framework and Benchmark for Advancing AI Research Agents
☆538Updated last week
togethercomputer / open_deep_research
Together Open Deep Research
☆331Updated 3 months ago
character-ai / prompt-poet
Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.
☆1,093Updated 2 weeks ago
AnswerDotAI / rerankers
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,505Updated 2 months ago