perceptron-ai-inc / perceptronLinks
The official Python SDK for the Perceptron API
☆58Updated last week
Alternatives and similar repositories for perceptron
Users that are interested in perceptron are comparing it to the libraries listed below
Sorting:
- ☆63Updated last year
- ☆56Updated last year
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆84Updated 8 months ago
- ☆24Updated 8 months ago
- ☆40Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Updated 10 months ago
- Fork of Flame repo for training of some new stuff in development☆19Updated 3 weeks ago
- ☆82Updated 4 months ago
- ☆169Updated 4 months ago
- Large multi-modal models (L3M) pre-training.☆229Updated 4 months ago
- A repository for research on medium sized language models.☆77Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- EvaByte: Efficient Byte-level Language Models at Scale☆115Updated 9 months ago
- Focused on fast experimentation and simplicity☆80Updated last year
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆119Updated 3 weeks ago
- Model Merging with Functional Dual Anchors☆45Updated 2 months ago
- H-Net Dynamic Hierarchical Architecture☆81Updated 4 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- RS-IMLE☆43Updated last year
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 8 months ago
- Open Character Training☆66Updated 2 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆127Updated 3 months ago
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- DeMo: Decoupled Momentum Optimization☆198Updated last year
- Extending the Context of Pretrained LLMs by Dropping Their Positional Embedding☆193Updated 2 weeks ago
- ☆34Updated last year
- Repository to create traveling waves integrate special information through time☆56Updated 5 months ago
- ☆91Updated last year