phunterlau / paper_without_codeLinks
LLM reads a paper and produce a working prototype
☆57Updated 6 months ago
Alternatives and similar repositories for paper_without_code
Users that are interested in paper_without_code are comparing it to the libraries listed below
Sorting:
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 9 months ago
- ☆40Updated 10 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 10 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆111Updated 7 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆88Updated last week
- ☆51Updated last year
- ☆55Updated last year
- Score LLM pretraining data with classifiers☆54Updated 2 years ago
- The Library for LLM-based multi-agent applications☆91Updated 3 months ago
- Training setup for Langchain's Open Deep Research☆69Updated 2 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆121Updated 9 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆92Updated last month
- Very minimal (and stateless) agent framework☆45Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- ☆60Updated 4 months ago
- ☆61Updated 11 months ago
- ☆79Updated this week
- ☆84Updated last year
- Automatic Prompt Optimization☆47Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆102Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆48Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 6 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆35Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆100Updated 2 weeks ago
- Simple Graph Memory for AI applications☆89Updated 5 months ago