☆18Mar 18, 2024Updated last year
Alternatives and similar repositories for basedxl
Users that are interested in basedxl are comparing it to the libraries listed below
Sorting:
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆21Updated this week
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- A repo to do interpretability of pre-trained acoustic models☆15Oct 15, 2023Updated 2 years ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Dec 11, 2023Updated 2 years ago
- ☆17Dec 19, 2024Updated last year
- Minimalistic, hackable PyTorch implementation of SimSiam in ~400 lines. Achieves good performance on ImageNet with ResNet50. Features dis…☆21Nov 25, 2024Updated last year
- ☆18Dec 2, 2024Updated last year
- ☆32Jul 2, 2025Updated 8 months ago
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated last year
- ☆16Mar 22, 2024Updated last year
- ☆20May 30, 2024Updated last year
- ☆19Dec 4, 2025Updated 3 months ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆23Oct 17, 2024Updated last year
- ☆26May 30, 2023Updated 2 years ago
- ☆24Sep 25, 2024Updated last year
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Jun 6, 2024Updated last year
- train with kittens!☆63Oct 25, 2024Updated last year
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆30Jan 28, 2026Updated last month
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆33Nov 29, 2024Updated last year
- Accelerated First Order Parallel Associative Scan☆195Jan 7, 2026Updated last month
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Dec 3, 2024Updated last year
- ☆35Apr 12, 2024Updated last year
- gzip Predicts Data-dependent Scaling Laws☆34May 28, 2024Updated last year
- Simple and efficient pytorch-native transformer training and inference (batched)☆79Apr 2, 2024Updated last year
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- Official Roadmap for Catalyst – Track upcoming features, improvements, and releases. Submit feature requests and help shape the future of…☆11Feb 12, 2025Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- ☆91Aug 18, 2024Updated last year
- Patch convolution to avoid large GPU memory usage of Conv2D☆95Jan 23, 2025Updated last year
- ☆44Jun 19, 2024Updated last year
- 我借用了DominikDoom大神的文件,为了方便我在布置翻译文件的时候方便下载。☆11May 21, 2023Updated 2 years ago
- CVPR 2023: PAniC-3D, Vtubers dataset downloader☆13Apr 22, 2023Updated 2 years ago