Dicklesworthstone / llm_introspective_compression_and_metacognitionLinks
A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advanced capabilities like reasoning backtracking, latent thought optimization, and metacognitive control.
☆29Updated 9 months ago
Alternatives and similar repositories for llm_introspective_compression_and_metacognition
Users that are interested in llm_introspective_compression_and_metacognition are comparing it to the libraries listed below
Sorting:
- Pivotal Token Search☆142Updated last month
- Clue inspired puzzles for testing LLM deduction abilities☆44Updated 9 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆58Updated 10 months ago
- Approximating the joint distribution of language models via MCTS☆22Updated last year
- ☆62Updated 6 months ago
- entropix style sampling + GUI☆27Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆23Updated last year
- ☆40Updated last year
- ☆50Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆24Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- Very minimal (and stateless) agent framework☆44Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 11 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- OpenPipe Reinforcement Learning Experiments☆32Updated 10 months ago
- lossily compress representation vectors using product quantization☆59Updated 2 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 4 months ago
- Digital Red Queen: Adversarial Program Evolution in Core War with LLMs☆141Updated last week
- ☆30Updated last year
- An introduction to DSPy☆32Updated 4 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 9 months ago
- Simple LLM inference server☆20Updated last year
- An AI character interaction system with emotional modeling and advanced memory management☆17Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 9 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 2 weeks ago
- look how they massacred my boy☆63Updated last year