Arize-ai/LLMTest_NeedleInAHaystack

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Arize-ai/LLMTest_NeedleInAHaystack)

Arize-ai / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

☆110

Alternatives and similar repositories for LLMTest_NeedleInAHaystack

Users that are interested in LLMTest_NeedleInAHaystack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gkamradt / needle-in-a-haystack
View on GitHub
Doing simple retrieval from LLM models at various context lengths to measure accuracy
☆2,355Jun 8, 2026Updated last month
jxnl / instructor-classify
View on GitHub
☆37May 5, 2025Updated last year
Muhtasham / summarization-eval
View on GitHub
📝 Reference-Free automatic summarization evaluation with potential hallucination detection
☆104Jan 15, 2024Updated 2 years ago
g588928812 / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Jul 22, 2023Updated 3 years ago
kubernetes-bad / reward-composer
View on GitHub
Lego for GRPO
☆30May 27, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ivanleomk / modal-grpo
View on GitHub
☆19Mar 16, 2025Updated last year
JoelNiklaus / LegalDatasets
View on GitHub
This repository serves as a collection of scrapers procuring and structuring various legal datasets
☆19Jun 16, 2023Updated 3 years ago
RamVegiraju / GenAI-Samples
View on GitHub
GenAI Examples
☆16Dec 13, 2024Updated last year
salesforce / summary-of-a-haystack
View on GitHub
Codebase accompanying the Summary of a Haystack paper.
☆82Jun 25, 2026Updated last month
strangeloopcanon / ReflectGPT
View on GitHub
Add ability to interrupt own message
☆14Apr 21, 2024Updated 2 years ago
shoggoth13 / aityping
View on GitHub
☆13Jul 16, 2023Updated 3 years ago
ArmelRandy / tree-of-problems
View on GitHub
[EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality
☆20Mar 4, 2025Updated last year
mobarski / aidapter
View on GitHub
Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)
☆20Sep 21, 2023Updated 2 years ago
som-shahlab / med-nota
View on GitHub
☆15Jun 11, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
axiomic-ai / axiomic
View on GitHub
Creating Generative AI Apps which work
☆17Apr 14, 2025Updated last year
elsatch / daily_hf_papers_abstracts
View on GitHub
This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file
☆16Jul 26, 2024Updated 2 years ago
AnswerDotAI / toolslm
View on GitHub
Tools to make language models a bit easier to use
☆67Updated this week
davanstrien / data-for-fine-tuning-llms
View on GitHub
☆80Jun 5, 2024Updated 2 years ago
ari-holtzman / newformer
View on GitHub
☆16Jul 20, 2023Updated 3 years ago
yxtay / python-project-template
View on GitHub
Starter template for python projects
☆18Feb 15, 2024Updated 2 years ago
uq-project / UQ
View on GitHub
UQ: Assessing Language Models on Unsolved Questions
☆30Aug 26, 2025Updated 11 months ago
dm4ml / motion
View on GitHub
Framework for building and maintaining self-updating prompts for LLMs
☆65Jun 9, 2024Updated 2 years ago
rrtucci / mappa_mundi
View on GitHub
Causal DAG Extraction from Text (DEFT)
☆66Jan 11, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
weaviate-tutorials / Hurricane
View on GitHub
Writing Blog Posts with Generative Feedback Loops!
☆52Mar 19, 2024Updated 2 years ago
hanneshapke / TensorFlow-World-Adv-Introduction-TF-Serving
View on GitHub
This repository contains all code examples for my TensorFlow World talk about "Advanced model deployments with TensorFlow Serving"
☆17Dec 8, 2022Updated 3 years ago
zocomputer / substrate-typescript
View on GitHub
Substrate TypeScript SDK
☆10Sep 20, 2024Updated last year
Goekdeniz-Guelmez / MLX-Benchmark
View on GitHub
The best benchmark for LLMs on Apple's MLX framework knowledge and coding tasks.
☆36Jun 12, 2026Updated last month
jesussantana / DeepLearning.AI-Introduction-to-Machine-Learning-in-Production
View on GitHub
In the first course of Machine Learning Engineering for Production Specialization, you will identify the various components and design an…
☆11Nov 4, 2021Updated 4 years ago
akandykeller / Wave_RNNs
View on GitHub
Official implementation of "Traveling Waves Encode the Recent Past and Enhance Sequence Learning" (ICLR 2024)
☆12Mar 15, 2024Updated 2 years ago
brycedrennan / pytest-modal-t
View on GitHub
Run all the tests at the same time with modal.com
☆11Mar 2, 2024Updated 2 years ago
qdrant / qdrant-haystack
View on GitHub
An integration of Qdrant ANN vector database backend with Haystack
☆46Jul 6, 2026Updated 3 weeks ago
sheetagent / sheetagent.github.io
View on GitHub
☆14Apr 25, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
AI-ANK / c3-python-nostream
View on GitHub
Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…
☆24Jan 7, 2024Updated 2 years ago
Arize-ai / prompt-learning
View on GitHub
☆316Apr 2, 2026Updated 3 months ago
muellerzr / minimal-trainer-zoo
View on GitHub
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆197May 6, 2024Updated 2 years ago
radekosmulski / machine_learning_notebooks
View on GitHub
☆27May 2, 2018Updated 8 years ago
hrishioa / meeting-diary
View on GitHub
Simple meeting diarization and speaker id assistant for meetings.
☆12Feb 10, 2025Updated last year
smitkiri / news-qa
View on GitHub
Reading comprehension based question-answering model for news articles.
☆11Jun 22, 2022Updated 4 years ago
ConsequentAI / fneval
View on GitHub
Functional Benchmarks and the Reasoning Gap
☆90Updated this week