subhashk01 / LLM-addition
LLMs represent numbers on a helix and manipulate that helix to do addition.
☆21Updated 2 months ago
Alternatives and similar repositories for LLM-addition:
Users that are interested in LLM-addition are comparing it to the libraries listed below
- NanoGPT-speedrunning for the poor T4 enjoyers☆61Updated 2 weeks ago
- Repository to create traveling waves integrate special information through time☆50Updated last month
- look how they massacred my boy☆63Updated 6 months ago
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆39Updated last week
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆96Updated last month
- ☆128Updated 3 weeks ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆33Updated last month
- Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆60Updated 2 weeks ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆135Updated last month
- ☆38Updated 8 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆62Updated last month
- ☆48Updated 5 months ago
- working implimention of deepseek MLA☆40Updated 3 months ago
- EvaByte: Efficient Byte-level Language Models at Scale☆87Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆16Updated last month
- Official Code Release for "Training a Generally Curious Agent"☆20Updated 3 weeks ago
- σ-GPT: A New Approach to Autoregressive Models☆62Updated 8 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆51Updated this week
- ☆65Updated this week
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 4 months ago
- ☆33Updated 3 months ago
- Collection of LLM completions for reasoning-gym task datasets☆19Updated this week
- ☆41Updated this week
- Official repo of paper LM2☆37Updated 2 months ago
- ☆63Updated 6 months ago
- ☆79Updated last year