robertvacareanu / llm4regression
Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update
☆130Updated 4 months ago
Alternatives and similar repositories for llm4regression:
Users that are interested in llm4regression are comparing it to the libraries listed below
- ☆76Updated 7 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆183Updated 8 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 5 months ago
- The first dense retrieval model that can be prompted like an LM☆64Updated 4 months ago
- ☆48Updated 2 months ago
- ☆112Updated 5 months ago
- Set of scripts to finetune LLMs☆36Updated 10 months ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 3 months ago
- ☆40Updated 8 months ago
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago
- ☆67Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought"☆89Updated last week
- code for training & evaluating Contextual Document Embedding models☆166Updated 2 weeks ago
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 4 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆103Updated 4 months ago
- ☆87Updated 11 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- Evaluation of neuro-symbolic engines☆34Updated 5 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated last month
- ☆30Updated 6 months ago
- An introduction to LLM Sampling☆75Updated last month
- ☆108Updated 5 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ☆110Updated 4 months ago
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆158Updated 2 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆46Updated last year