llmonpy / needle-in-a-needlestackLinks
☆116Updated 5 months ago
Alternatives and similar repositories for needle-in-a-needlestack
Users that are interested in needle-in-a-needlestack are comparing it to the libraries listed below
Sorting:
- a curated list of data for reasoning ai☆136Updated 11 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 3 weeks ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago
- Pivotal Token Search☆109Updated this week
- Just a bunch of benchmark logs for different LLMs☆119Updated 11 months ago
- Visualize the intermediate output of Mistral 7B☆366Updated 5 months ago
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 3 weeks ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆84Updated 9 months ago
- Transformer GPU VRAM estimator☆66Updated last year
- Mistral7B playing DOOM☆132Updated last year
- Efficient vector database for hundred millions of embeddings.☆206Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆101Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Benchmark that evaluates LLMs using 651 NYT Connections puzzles extended with extra trick words☆130Updated this week
- Enforce structured output from LLMs 100% of the time☆249Updated 11 months ago
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- ☆157Updated last year
- ☆210Updated 2 weeks ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆240Updated 5 months ago
- explore token trajectory trees on instruct and base models☆134Updated last month
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆175Updated last year
- ☆122Updated 11 months ago
- Your buddy in the (L)LM space.☆64Updated 9 months ago
- Build Secure and Compliant AI agents and MCP Servers. YC W23☆145Updated last month
- Function Calling Benchmark & Testing☆87Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆187Updated last year