LAION-AI / AIWLinks
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated last week
Alternatives and similar repositories for AIW
Users that are interested in AIW are comparing it to the libraries listed below
Sorting:
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆173Updated 5 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago
- Code for ExploreTom☆84Updated 6 months ago
- PyTorch implementation of models from the Zamba2 series.☆182Updated 5 months ago
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆102Updated 2 months ago
- ☆51Updated 7 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆80Updated last month
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 6 months ago
- look how they massacred my boy☆63Updated 8 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆149Updated 4 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆63Updated 2 months ago
- ☆115Updated 4 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆190Updated last year
- Pivotal Token Search☆107Updated last month
- ☆134Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆63Updated 7 months ago
- GRadient-INformed MoE☆263Updated 9 months ago
- Functional Benchmarks and the Reasoning Gap☆87Updated 8 months ago
- Code repository for the c-BTM paper☆106Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Token Omission Via Attention☆128Updated 8 months ago
- ☆61Updated 3 weeks ago
- ☆98Updated 5 months ago
- ☆79Updated 10 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆112Updated last month
- Replicating O1 inference-time scaling laws☆87Updated 6 months ago
- ☆68Updated 10 months ago
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 2 months ago