robertvacareanu / llm4regression
Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their context, without any parameter update
☆115Updated last week
Related projects: ⓘ
- Automating enterprise workflows with multimodal agents☆83Updated last month
- Functional Benchmarks and the Reasoning Gap☆74Updated last month
- ☆85Updated 7 months ago
- Evaluating LLMs with CommonGen-Lite☆83Updated 5 months ago
- Official implementation of MetaTree: Learning a Decision Tree Algorithm with Transformers☆97Updated this week
- Codebase accompanying the Summary of a Haystack paper.☆65Updated 2 months ago
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆49Updated 3 weeks ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆107Updated 2 weeks ago
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated 8 months ago
- Set of scripts to finetune LLMs☆36Updated 5 months ago
- Chat Markup Language conversation library☆53Updated 8 months ago
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆60Updated 2 weeks ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆73Updated 6 months ago
- ☆91Updated last month
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- ☆71Updated 3 months ago
- ☆75Updated 3 weeks ago
- An automated tool for discovering insights from research papaer corpora☆131Updated 3 months ago
- This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.☆99Updated 4 months ago
- ☆68Updated last month
- AWM: Agent Workflow Memory☆121Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems.☆48Updated 3 weeks ago
- ☆130Updated last week
- ☆50Updated 2 months ago
- Mixtral finetuning☆19Updated 7 months ago
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆81Updated this week
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆57Updated 7 months ago