LAION-AI / AIWLinks
Alice in Wonderland code base for experiments and raw experiments data
☆131Updated this week
Alternatives and similar repositories for AIW
Users that are interested in AIW are comparing it to the libraries listed below
Sorting:
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated 3 weeks ago
- Mixing Language Models with Self-Verification and Meta-Verification☆112Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆175Updated last year
- ☆56Updated last year
- ☆40Updated last year
- Official repo for Learning to Reason for Long-Form Story Generation☆74Updated 9 months ago
- a curated list of data for reasoning ai☆141Updated last year
- ☆105Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Pivotal Token Search☆144Updated last month
- look how they massacred my boy☆63Updated last year
- Experiments for efforts to train a new and improved t5☆76Updated last year
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- ☆91Updated last month
- smolLM with Entropix sampler on pytorch☆149Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated last year
- ☆68Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 6 months ago
- Code repository for the c-BTM paper☆108Updated 2 years ago
- ☆41Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆115Updated 9 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- ☆152Updated 4 months ago
- Simple GRPO scripts and configurations.☆59Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated last year
- PyTorch implementation of models from the Zamba2 series.☆186Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198Updated last year