Dicklesworthstone / llm_introspective_compression_and_metacognitionLinks
A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advanced capabilities like reasoning backtracking, latent thought optimization, and metacognitive control.
☆26Updated 8 months ago
Alternatives and similar repositories for llm_introspective_compression_and_metacognition
Users that are interested in llm_introspective_compression_and_metacognition are comparing it to the libraries listed below
Sorting:
- Pivotal Token Search☆132Updated last week
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆57Updated 9 months ago
- OpenPipe Reinforcement Learning Experiments☆32Updated 8 months ago
- Editor with LLM generation tree exploration☆80Updated 9 months ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Updated last year
- Simple LLM inference server☆20Updated last year
- lossily compress representation vectors using product quantization☆59Updated last month
- ☆62Updated 5 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆33Updated 8 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆45Updated 8 months ago
- Latent Large Language Models☆19Updated last year
- entropix style sampling + GUI☆27Updated last year
- ☆40Updated last year
- Framework-Agnostic RL Environments for LLM Fine-Tuning☆39Updated last week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆148Updated 9 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated last month
- ☆30Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated last year
- Lego for GRPO☆30Updated 6 months ago
- look how they massacred my boy☆63Updated last year
- Modified Beam Search with periodical restart☆12Updated last year
- A library for building software agents using behavior trees and language models.☆90Updated 10 months ago
- ☆16Updated last year
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 8 months ago
- ☆68Updated last year