alextamkin / generative-elicitationLinks
☆132Updated last year
Alternatives and similar repositories for generative-elicitation
Users that are interested in generative-elicitation are comparing it to the libraries listed below
Sorting:
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆65Updated last year
- ☆187Updated last year
- FireAct: Toward Language Agent Fine-tuning☆282Updated last year
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆366Updated last year
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆216Updated 2 months ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆301Updated 2 years ago
- The next generation of Multi-Modal Multi-Agent platform.☆106Updated 3 months ago
- 🍎APPL: A Prompt Programming Language. Seamlessly integrate LLMs with programs.☆259Updated 6 months ago
- ☆92Updated last year
- Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors (ACL Findings 2025)☆83Updated 2 months ago
- connecting humans and agents☆88Updated 8 months ago
- [ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".☆228Updated last year
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆56Updated last year
- Multimodal computer agent data collection program☆145Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆103Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆229Updated 2 years ago
- Langchain implementation of HuggingGPT☆133Updated 2 years ago
- LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.☆164Updated 3 months ago
- ☆183Updated 7 months ago
- Reasoning by Communicating with Agents☆29Updated 4 months ago
- Deep Reasoning Translation (DRT) Project☆227Updated 3 months ago
- Official implementation for "OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities" (keep updating)☆59Updated last year
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆82Updated 3 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆50Updated 4 months ago
- ☆320Updated 11 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆86Updated last year
- [NeurlPS D&B 2024] Generative AI for Math: MathPile☆416Updated 4 months ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated 2 years ago
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆129Updated last year