epfl-dlab / forcLinks

Framework for Cost-Effective Language Model Choice

☆13

Alternatives and similar repositories for forc

Users that are interested in forc are comparing it to the libraries listed below

Sorting:

OSU-NLP-Group / Middleware
Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)
☆37Updated 6 months ago
ZonglinY / MOOSE
[ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …
☆42Updated 8 months ago
dinobby / MAGDi
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆35Updated last year
MurongYue / LLM_MoT_cascade
This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…
☆24Updated last year
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
scandukuri / assistant-gate
☆25Updated last year
xxxiaol / QRData
Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data
☆41Updated 4 months ago
GAIR-NLP / OlympicArena
[NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
☆102Updated 4 months ago
abhika-m / FAVA
☆72Updated last year
nuochenpku / LLaMA_Analysis
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆30Updated last year
allenai / super-benchmark
☆45Updated 3 months ago
MLE-Dojo / MLE-Dojo
☆54Updated 2 weeks ago
alonj / Same-Task-More-Tokens
The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"
☆54Updated last year
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
OSU-NLP-Group / llm-planning-eval
[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"
☆54Updated last year
orionw / FollowIR
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆45Updated last year
DAMO-NLP-SG / CaRing
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
☆36Updated last year
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 5 months ago
allenai / marg-reviewer
Code/data for MARG (multi-agent review generation)
☆44Updated 7 months ago
Strong-AI-Lab / Logical-and-abstract-reasoning
Evaluation on Logical Reasoning and Abstract Reasoning Challenges
☆28Updated 2 months ago
Yu-Fangxu / FoR
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆100Updated last month
csitfun / LogiCoT
the instructions and demonstrations for building a formal logical reasoning capable GLM
☆53Updated 10 months ago
chenhongqiao / ToolDec
Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding
☆27Updated last year
princeton-nlp / LM-Science-Tutor
☆43Updated 11 months ago
msclar / symbolictom
☆21Updated last year
QingruZhang / PASTA
PASTA: Post-hoc Attention Steering for LLMs
☆120Updated 7 months ago
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆36Updated last year
LiqiangJing / DSBench
DSBench: How Far are Data Science Agents from Becoming Data Science Experts?
☆58Updated 4 months ago
oriyor / reasoning-on-cots
Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"
☆96Updated last year
tianyang-x / SaySelf
Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"
☆107Updated 9 months ago