AIRI-Institute / AriGraph
☆61Updated last week
Related projects: ⓘ
- ☆130Updated last week
- AWM: Agent Workflow Memory☆121Updated last week
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆102Updated 3 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆57Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆73Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆62Updated 2 months ago
- ☆61Updated 2 months ago
- ☆74Updated 9 months ago
- ☆111Updated 3 months ago
- CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/☆167Updated this week
- ☆75Updated 3 weeks ago
- ☆242Updated 2 weeks ago
- An implemtation of Everyting of Thoughts (XoT).☆114Updated 7 months ago
- Evaluating LLMs with CommonGen-Lite☆83Updated 6 months ago
- ☆90Updated last month
- Beating the GAIA benchmark with Transformers Agents. 🚀☆56Updated 2 weeks ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆41Updated 2 months ago
- Automating enterprise workflows with multimodal agents☆83Updated last month
- General multi-task deep RL Agent☆158Updated 3 months ago
- Code for the paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆140Updated 3 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆115Updated 3 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆106Updated last week
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 2 months ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆53Updated 3 months ago
- Code for the paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆30Updated 3 months ago
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆60Updated 3 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems.☆48Updated 3 weeks ago
- ☆68Updated 2 months ago