normal-computing / extended-mind-transformersLinks
☆122Updated 11 months ago
Alternatives and similar repositories for extended-mind-transformers
Users that are interested in extended-mind-transformers are comparing it to the libraries listed below
Sorting:
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆84Updated 9 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated last year
- ☆47Updated last year
- A framework for optimizing DSPy programs with RL☆89Updated this week
- Just a bunch of benchmark logs for different LLMs☆119Updated 11 months ago
- look how they massacred my boy☆63Updated 8 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- smolLM with Entropix sampler on pytorch☆150Updated 8 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆177Updated this week
- A framework for orchestrating AI agents using a mermaid graph☆77Updated last year
- Inference-time scaling for LLMs-as-a-judge.☆250Updated last week
- Fast parallel LLM inference for MLX☆198Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆106Updated 7 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆111Updated last year
- Chat Markup Language conversation library☆55Updated last year
- smol models are fun too☆93Updated 8 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆70Updated 5 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆137Updated 2 months ago
- ☆154Updated 7 months ago
- Plotting (entropy, varentropy) for small LMs☆97Updated last month
- Simple UI for debugging correlations of text embeddings☆287Updated last month
- ☆66Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆240Updated 4 months ago
- Train your own SOTA deductive reasoning model☆96Updated 4 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆447Updated 9 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year