robertvacareanu / llm4regressionLinks
Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update
☆160Updated 4 months ago
Alternatives and similar repositories for llm4regression
Users that are interested in llm4regression are comparing it to the libraries listed below
Sorting:
- Simple GRPO scripts and configurations.☆59Updated last year
- ☆86Updated 2 years ago
- ☆68Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆137Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- Automating enterprise workflows with multimodal agents☆115Updated last year
- ☆239Updated 2 months ago
- ☆80Updated last year
- ☆120Updated last year
- ☆37Updated last year
- ☆210Updated 7 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆108Updated 4 months ago
- ☆62Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated 2 years ago
- ☆137Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆277Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- ☆125Updated last year
- Evaluating LLMs with CommonGen-Lite☆94Updated last year
- An introduction to LLM Sampling☆79Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Evaluation of neuro-symbolic engines☆41Updated last year
- Interpret text data with LLMs (sklearn compatible).☆176Updated 2 weeks ago