☆58Sep 17, 2025Updated 6 months ago
Alternatives and similar repositories for openCLT
Users that are interested in openCLT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated last month
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆69Updated this week
- ☆31Nov 30, 2025Updated 3 months ago
- ☆25Jun 16, 2024Updated last year
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆85Jan 12, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆25Feb 20, 2026Updated last month
- The Full Spectrum of Deepnet Hessians at Scale: Dynamics with SGD Training and Sample Size☆19May 19, 2019Updated 6 years ago
- Benchmarking Optimizers for LLM Pretraining☆56Dec 30, 2025Updated 2 months ago
- ☆48Jul 21, 2025Updated 8 months ago
- Public implementation of ICML'19 paper "White-box vs Black-box: Bayes Optimal Strategies for Membership Inference"☆18May 28, 2020Updated 5 years ago
- ☆35Jul 5, 2023Updated 2 years ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 科研工作专用ChatGPT/GLM拓展,特别优化学术Paper润色体验,模块化设计支持自定义快捷按钮&函数插件,支持代码块表格显示,Tex公式双显示,新增Python和C++项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持gpt-…☆13Jan 25, 2026Updated 2 months ago
- Monitoring the health of ARR☆29Jan 24, 2026Updated 2 months ago
- A framework that allows you to apply Sparse AutoEncoder on any models☆51Jul 11, 2025Updated 8 months ago
- A carefully curated collection of high-quality libraries, projects, tutorials, research papers, and other essential resources focused on …☆65Updated this week
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆16Jul 10, 2024Updated last year
- 模型压缩的小白入门教程☆22Jul 7, 2024Updated last year
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 5 months ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆27Oct 20, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Talk to ChatGPT and Generate image via any Matrix client!☆16Apr 25, 2023Updated 2 years ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- A resource repository for representation engineering in large language models☆149Nov 14, 2024Updated last year
- We study toy models of skill learning.☆33Feb 3, 2026Updated last month
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆27Oct 14, 2025Updated 5 months ago
- ☆10Mar 4, 2024Updated 2 years ago
- ☆18Jun 20, 2025Updated 9 months ago
- CUDA implementation of Multidimensional Scaling☆15May 8, 2021Updated 4 years ago
- ACL2024: TTM-RE Memory-Augmented Document-Level Relation Extraction☆20Oct 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- ☆25May 20, 2025Updated 10 months ago
- Library that provides metrics to assess representation quality☆26Feb 5, 2025Updated last year
- Simple MoE - Day 17 of 365 Days of Repos☆18Jan 17, 2025Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆27Jun 4, 2024Updated last year
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago