llmonpy / needle-in-a-needlestackLinks
☆115Updated 10 months ago
Alternatives and similar repositories for needle-in-a-needlestack
Users that are interested in needle-in-a-needlestack are comparing it to the libraries listed below
Sorting:
- a curated list of data for reasoning ai☆140Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?☆233Updated last month
- Pivotal Token Search☆141Updated last week
- Visualize the intermediate output of Mistral 7B☆381Updated 11 months ago
- ☆125Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 10 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆91Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 3 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- Alice in Wonderland code base for experiments and raw experiments data☆131Updated 3 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆111Updated last year
- Synthetic Data for LLM Fine-Tuning☆120Updated 2 years ago
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- ☆164Updated 4 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆263Updated 10 months ago
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆170Updated last year
- Efficient vector database for hundred millions of embeddings.☆211Updated last year
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- Enforce structured output from LLMs 100% of the time☆249Updated last year
- Benchmark that evaluates LLMs using 759 NYT Connections puzzles extended with extra trick words