lee-ny / teaching_arithmeticLinks

☆83

Alternatives and similar repositories for teaching_arithmetic

Users that are interested in teaching_arithmetic are comparing it to the libraries listed below

Sorting:

GFNOrg / gfn-lm-tuning
☆184Updated last year
princeton-nlp / TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
☆162Updated last year
roeehendel / icl_task_vectors
☆96Updated last year
UFO-101 / auto-circuit
A library for efficient patching and automatic circuit discovery.
☆73Updated 2 weeks ago
McGill-NLP / length-generalization
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023
☆136Updated last year
gregorbachmann / Next-Token-Failures
☆88Updated last year
mega002 / ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆94Updated 3 years ago
berlino / seq_icl
☆53Updated last year
dtsip / in-context-learning
☆234Updated last year
protagolabs / odyssey-math
☆84Updated 6 months ago
redwoodresearch / Easy-Transformer
☆121Updated last year
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
nouhadziri / faith-and-fate
☆34Updated last year
KihoPark / linear_rep_geometry
☆103Updated 5 months ago
shunzh / Code-AI-Tree-Search
☆119Updated last year
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆175Updated 3 months ago
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆46Updated last year
ucl-dark / llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
☆113Updated last year
McGill-NLP / VinePPO
Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"
☆167Updated 2 months ago
abhishekpanigrahi1996 / transformer_in_transformer
☆45Updated last year
princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆78Updated last year
epfml / llm-baselines
nanoGPT-like codebase for LLM training
☆102Updated 2 months ago
Edward-Sun / easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
☆123Updated 10 months ago
princeton-nlp / USACO
Can Language Models Solve Olympiad Programming?
☆119Updated 6 months ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆221Updated 3 weeks ago
vwxyzjn / summarize_from_feedback_details
☆147Updated 8 months ago
likenneth / othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆186Updated 2 years ago
MadryLab / DsDm
☆50Updated last year
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…
☆108Updated last year
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆124Updated last year