MichalRyszardWojcik / transformer-language-modelLinks
A clean no-jargon mathematical definition of transforrmer language model with a Python implementation that focuses on clarity rather than efficiency.
☆11Updated 3 years ago
Alternatives and similar repositories for transformer-language-model
Users that are interested in transformer-language-model are comparing it to the libraries listed below
Sorting:
- LeanAgent is a novel lifelong learning framework for formal theorem proving that continuously generalizes to and improves on ever-expandi…☆38Updated 4 months ago
- ☆163Updated last year
- Evaluation of neuro-symbolic engines☆39Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆42Updated last year
- ☆44Updated 2 years ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- Causal DAG Extraction from Text (DEFT)☆66Updated 9 months ago
- ☆69Updated last year
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- An interactive exploration of Transformer programming.☆269Updated last year
- Neural theorem proving tutorial, version II☆39Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆109Updated 10 months ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆215Updated 4 months ago
- Google Research☆46Updated 3 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆15Updated 4 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 2 years ago
- git extension for {collaborative, communal, continual} model development☆215Updated 11 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated 2 years ago
- ☆45Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆136Updated 6 months ago
- gzip Predicts Data-dependent Scaling Laws☆34Updated last year
- Understanding how features learned by neural networks evolve throughout training☆39Updated last year
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- ☆34Updated 10 months ago
- ☆68Updated 11 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆65Updated 2 years ago
- Learning Universal Predictors☆80Updated last year
- Multibackend Graph Neural Networks in Keras 3☆25Updated last year
- ☆30Updated last year