robertvacareanu / llm4regressionLinks
Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update
☆154Updated last year
Alternatives and similar repositories for llm4regression
Users that are interested in llm4regression are comparing it to the libraries listed below
Sorting:
- ☆88Updated last year
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆114Updated last year
- Train your own SOTA deductive reasoning model☆106Updated 6 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆120Updated 7 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆110Updated 9 months ago
- ☆43Updated 10 months ago
- Evaluation of neuro-symbolic engines☆39Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆69Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆103Updated last year
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 9 months ago
- ☆54Updated 10 months ago
- Evaluating LLMs with CommonGen-Lite☆91Updated last year
- ☆229Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 11 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- Simple GRPO scripts and configurations.☆59Updated 7 months ago
- An introduction to LLM Sampling☆79Updated 9 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 11 months ago
- Code for ExploreTom☆86Updated 2 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆129Updated last week
- ☆145Updated last year
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆49Updated 4 months ago
- A library for benchmarking the Long Term Memory and Continual learning capabilities of LLM based agents. With all the tests and code you…☆78Updated 9 months ago
- ☆39Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- code for training & evaluating Contextual Document Embedding models☆197Updated 4 months ago