LAION-AI / AIWLinks

Alice in Wonderland code base for experiments and raw experiments data

☆131

Alternatives and similar repositories for AIW

Users that are interested in AIW are comparing it to the libraries listed below

Sorting:

casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆173Updated 5 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆140Updated 4 months ago
facebookresearch / ExploreToM
Code for ExploreTom
☆84Updated 6 months ago
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆182Updated 5 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆94Updated 3 months ago
OpenEvaByte / evabyte
EvaByte: Efficient Byte-level Language Models at Scale
☆102Updated 2 months ago
arcee-ai / DAM
☆51Updated 7 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆80Updated last month
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆126Updated 6 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆149Updated 4 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆63Updated 2 months ago
SalesforceAIResearch / LaTRO
☆115Updated 4 months ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆190Updated last year
codelion / pts
Pivotal Token Search
☆107Updated last month
google-deepmind / mishax
☆134Updated 2 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 3 months ago
Aleph-Alpha-Research / scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆63Updated 7 months ago
microsoft / GRIN-MoE
GRadient-INformed MoE
☆263Updated 9 months ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆87Updated 8 months ago
kernelmachine / cbtm
Code repository for the c-BTM paper
☆106Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆118Updated last year
schwartz-lab-NLP / TOVA
Token Omission Via Attention
☆128Updated 8 months ago
allenai / infinigram-api
☆61Updated 3 weeks ago
LucasPrietoAl / grokking-at-the-edge-of-numerical-stability
☆98Updated 5 months ago
RobertCsordas / moeut
☆79Updated 10 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆112Updated last month
hughbzhang / o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
☆87Updated 6 months ago
KaiNylund / lm-weights-encode-time
☆68Updated 10 months ago
benpry / why-think-step-by-step
Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"
☆60Updated 2 months ago