awslabs / extending-the-context-length-of-open-source-llms
☆51Updated last month
Alternatives and similar repositories for extending-the-context-length-of-open-source-llms:
Users that are interested in extending-the-context-length-of-open-source-llms are comparing it to the libraries listed below
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 4 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated 2 months ago
- The first dense retrieval model that can be prompted like an LM☆64Updated 4 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 9 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 10 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 10 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- Tools for formatting large language model prompts.☆12Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆74Updated 3 months ago
- ☆76Updated 7 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆19Updated 2 months ago
- ☆27Updated 2 months ago
- ☆24Updated last year
- A framework for evaluating function calls made by LLMs☆36Updated 6 months ago
- ☆18Updated 3 months ago
- ☆74Updated last year
- Generalist and Lightweight Model for Text Classification☆59Updated last week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆99Updated last year
- Evaluating tool-augmented LLMs in conversation settings☆76Updated 7 months ago
- ☆48Updated 2 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆66Updated 3 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆52Updated 11 months ago
- ☆20Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆100Updated last month
- Code for NeurIPS LLM Efficiency Challenge☆54Updated 9 months ago