perceptron-ai-inc / perceptronLinks
The official Python SDK for the Perceptron API
☆58Updated this week
Alternatives and similar repositories for perceptron
Users that are interested in perceptron are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆82Updated 7 months ago
- ☆63Updated last year
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆134Updated 3 months ago
- Large multi-modal models (L3M) pre-training.☆224Updated 3 months ago
- Fork of Flame repo for training of some new stuff in development☆19Updated this week
- Model Merging with Functional Dual Anchors☆44Updated last month
- ☆56Updated last year
- ☆152Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Updated last year
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆118Updated 2 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 6 months ago
- Focused on fast experimentation and simplicity☆79Updated last year
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆123Updated 5 months ago
- ☆24Updated 7 months ago
- RLP: Reinforcement as a Pretraining Objective☆222Updated 3 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆174Updated 11 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆73Updated 8 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆44Updated 5 months ago
- RS-IMLE☆43Updated last year
- ☆40Updated last year
- H-Net Dynamic Hierarchical Architecture☆80Updated 3 months ago
- A repository for research on medium sized language models.☆77Updated last year
- ☆80Updated last year
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆62Updated 3 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆149Updated 3 months ago
- σ-GPT: A New Approach to Autoregressive Models☆70Updated last year
- ☆26Updated 11 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆56Updated 10 months ago