ArticulateAI / Agent-First-Organization
β24Updated this week
Alternatives and similar repositories for Agent-First-Organization:
Users that are interested in Agent-First-Organization are comparing it to the libraries listed below
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β36Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 6 months ago
- CRMArena: Understanding the Capacity of LLM Agents to Perform Professional CRM Tasks in Realistic Environmentsβ40Updated 2 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuningβ43Updated last year
- Codebase accompanying the Summary of a Haystack paper.β75Updated 3 months ago
- Writing Blog Posts with Generative Feedback Loops!β45Updated 9 months ago
- β44Updated 7 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".β66Updated 6 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β73Updated 2 months ago
- β30Updated 6 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"β97Updated 3 months ago
- β47Updated last month
- β43Updated 3 months ago
- β36Updated 3 weeks ago
- β46Updated 2 months ago
- Automating enterprise workflows with multimodal agentsβ97Updated 3 months ago
- β24Updated last year
- Beating the GAIA benchmark with Transformers Agents. πβ75Updated 2 months ago
- β29Updated 9 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ87Updated 3 weeks ago
- Explore the use of DSPy for extracting features from PDFs πβ37Updated 10 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ53Updated 4 months ago
- Automatic Evals for Instruction-Tuned Modelsβ100Updated this week
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"β77Updated 2 months ago
- Universal text classifier for generative modelsβ21Updated 5 months ago
- Testing and evaluation framework for voice agentsβ82Updated last month
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"β118Updated 4 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"β49Updated 3 months ago
- β19Updated 2 months ago