llmonpy / needle-in-a-needlestack
☆111Updated 2 weeks ago
Alternatives and similar repositories for needle-in-a-needlestack:
Users that are interested in needle-in-a-needlestack are comparing it to the libraries listed below
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- a curated list of data for reasoning ai☆128Updated 6 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆130Updated this week
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆222Updated this week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆82Updated last week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Chat Markup Language conversation library☆55Updated last year
- ☆152Updated 7 months ago
- ☆74Updated last year
- Hallucinations (Confabulations) Document-Based Benchmark for RAG☆90Updated last week
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 9 months ago
- Data preparation code for Amber 7B LLM☆85Updated 9 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 6 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆127Updated last week
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆164Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆136Updated last year
- Mistral7B playing DOOM☆127Updated 7 months ago
- Action library for AI Agent☆209Updated this week
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated 11 months ago
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.☆88Updated 7 months ago
- Low-Rank adapter extraction for fine-tuned transformers models☆169Updated 9 months ago
- Transformer GPU VRAM estimator☆50Updated 10 months ago
- Generate ideal question-answers for testing RAG☆126Updated 2 weeks ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- ☆72Updated 3 weeks ago
- Visualize the intermediate output of Mistral 7B☆339Updated 3 weeks ago
- ☆108Updated 5 months ago
- Functional Benchmarks and the Reasoning Gap☆82Updated 4 months ago