robertvacareanu / llm4regressionLinks
Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update
☆153Updated 10 months ago
Alternatives and similar repositories for llm4regression
Users that are interested in llm4regression are comparing it to the libraries listed below
Sorting:
- ☆87Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆118Updated 5 months ago
- Simple GRPO scripts and configurations.☆59Updated 6 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆72Updated 8 months ago
- Automating enterprise workflows with multimodal agents☆108Updated 10 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆105Updated 7 months ago
- Code for NeurIPS LLM Efficiency Challenge☆59Updated last year
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch 😊☆129Updated last week
- Train your own SOTA deductive reasoning model☆103Updated 5 months ago
- An introduction to LLM Sampling☆79Updated 7 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆112Updated 10 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- ☆226Updated this week
- ☆79Updated last year
- ☆145Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Code for ExploreTom☆84Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 10 months ago
- ☆53Updated 9 months ago
- ☆69Updated 11 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆45Updated 3 months ago
- ☆43Updated 9 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆34Updated last week
- Score LLM pretraining data with classifiers☆55Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 6 months ago
- ☆38Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆102Updated last year