σ-GPT: A New Approach to Autoregressive Models
☆75Aug 14, 2024Updated last year
Alternatives and similar repositories for sigma-gpt
Users that are interested in sigma-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆119Sep 22, 2024Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- ☆68Mar 19, 2026Updated last week
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 7 months ago
- ☆16May 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A flexible, fast and scalable python library for Self-Organizing Maps☆16Aug 9, 2025Updated 7 months ago
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆31Jul 4, 2024Updated last year
- A Python implementation of COMO-CMA-ES, a non-elitist multiobjective Evolution Strategy☆16Jan 24, 2026Updated 2 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Oct 31, 2024Updated last year
- ☆10Nov 17, 2022Updated 3 years ago
- This is the official PyTorch implementation for our NAACL 2024 paper: "AnchorAL: Computationally Efficient Active Learning for Large and …☆22Apr 15, 2025Updated 11 months ago
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A new model for quickly training and simulating adaptive leaky integrate-and-fire spiking neural networks.☆14Apr 9, 2024Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- ☆14Mar 22, 2024Updated 2 years ago
- ☆45Nov 1, 2025Updated 4 months ago
- An alternate reality web browser, powered by an LLM☆18Apr 29, 2024Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Apr 20, 2024Updated last year
- Multi-Modal Multi-Task (3MT) Road Segmentation, IEEE RA-L 2023☆15Feb 13, 2024Updated 2 years ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆33May 15, 2024Updated last year
- A fast and robust algorithm for temporal difference learning☆22Mar 16, 2026Updated last week
- ☆18Nov 25, 2023Updated 2 years ago
- FastHTML app that makes other FastHTML apps with LLMs☆19Sep 13, 2024Updated last year
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆37Mar 3, 2025Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- [CVPR 2025] Official implementation of the paper "SimMotionEdit: Text-Based Human Motion Editing with Motion Similarity Prediction"☆47Dec 11, 2025Updated 3 months ago
- ☆36Apr 30, 2024Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆111Mar 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Convert a regular GPT call into a ChatGPT call☆14Mar 2, 2023Updated 3 years ago
- Self-Alignment with Principle-Following Reward Models☆170Sep 18, 2025Updated 6 months ago
- ☆10Feb 9, 2025Updated last year
- ☆17Apr 19, 2024Updated last year
- Machine Learning for Mathematics Faculty (HSE) 2018☆18Jan 23, 2022Updated 4 years ago
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 9 months ago
- ☆16Mar 25, 2024Updated 2 years ago