LAION-AI / AIW
Alice in Wonderland code base for experiments and raw experiments data
☆129Updated 2 months ago
Alternatives and similar repositories for AIW:
Users that are interested in AIW are comparing it to the libraries listed below
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated last month
- look how they massacred my boy☆63Updated 6 months ago
- Functional Benchmarks and the Reasoning Gap☆85Updated 6 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆170Updated 3 months ago
- a curated list of data for reasoning ai☆133Updated 8 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆144Updated 2 months ago
- PyTorch implementation of models from the Zamba2 series.☆179Updated 2 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆186Updated 4 months ago
- ☆38Updated 8 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Replicating O1 inference-time scaling laws☆83Updated 4 months ago
- ☆128Updated 2 weeks ago
- RWKV-7: Surpassing GPT☆82Updated 4 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆119Updated last week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆125Updated 4 months ago
- Train your own SOTA deductive reasoning model☆83Updated last month
- Code for ExploreTom☆79Updated 4 months ago
- ☆80Updated 3 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 2 months ago
- smolLM with Entropix sampler on pytorch☆151Updated 5 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆189Updated 10 months ago
- Token Omission Via Attention☆126Updated 6 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 6 months ago
- An introduction to LLM Sampling☆77Updated 4 months ago
- Code repository for the c-BTM paper☆106Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆86Updated 3 weeks ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆204Updated 4 months ago
- ☆49Updated last year
- ☆67Updated 8 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated last year