llmonpy / needle-in-a-needlestackLinks
☆115Updated 10 months ago
Alternatives and similar repositories for needle-in-a-needlestack
Users that are interested in needle-in-a-needlestack are comparing it to the libraries listed below
Sorting:
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- a curated list of data for reasoning ai☆140Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- Pivotal Token Search☆132Updated this week
- ☆164Updated 4 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 2 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 9 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆106Updated 2 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 2 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- Visualize the intermediate output of Mistral 7B☆381Updated 10 months ago
- ☆210Updated 5 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆170Updated last year
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- ☆45Updated 2 years ago
- Enforce structured output from LLMs 100% of the time☆248Updated last year
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆220Updated 2 weeks ago
- A repo to evaluate various LLM's chess playing abilities.☆85Updated last year
- Use the OpenAI Batch tool to make async batch requests to the OpenAI API.☆101Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Mistral7B playing DOOM☆138Updated last year
- ☆124Updated last year
- ☆159Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆219Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆250Updated 9 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- Your buddy in the (L)LM space.☆64Updated last year