HishamAlyahya / semantic_backpropLinks
Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€
β70Updated 6 months ago
Alternatives and similar repositories for semantic_backprop
Users that are interested in semantic_backprop are comparing it to the libraries listed below
Sorting:
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β71Updated this week
- Code for ExploreTomβ84Updated 6 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.β91Updated 2 months ago
- accompanying material for sleep-time compute paperβ95Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ53Updated 4 months ago
- β69Updated 4 months ago
- Train your own SOTA deductive reasoning modelβ94Updated 3 months ago
- Code for our paper PAPILLON: PrivAcy Preservation from Internet-based and Local Language MOdel ENsemblesβ35Updated last month
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β115Updated 4 months ago
- Set of scripts to finetune LLMsβ37Updated last year
- β61Updated 3 weeks ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ104Updated 6 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoningβ57Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ57Updated 9 months ago
- β51Updated 7 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β131Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.β173Updated 3 months ago
- The first dense retrieval model that can be prompted like an LMβ73Updated last month
- β115Updated 4 months ago
- β47Updated last year
- β50Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 11 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ81Updated 8 months ago
- β92Updated 3 months ago
- Official Repo for CRMArena and CRMArena-Proβ92Updated last week
- β124Updated 2 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)β91Updated 5 months ago
- Automatic Prompt Optimizationβ38Updated last year
- Simple GRPO scripts and configurations.β58Updated 4 months ago
- β127Updated 3 months ago