σ-GPT: A New Approach to Autoregressive Models
☆75Aug 14, 2024Updated last year
Alternatives and similar repositories for sigma-gpt
Users that are interested in sigma-gpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆122Sep 22, 2024Updated last year
- ☆12Aug 21, 2020Updated 5 years ago
- ☆84Mar 12, 2026Updated 2 months ago
- ☆16Jul 16, 2024Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official Implementation of Infinite-Resolution Integral Noise Warping for Diffusion Models [ICLR 2025]☆16Mar 15, 2025Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- Complete Reinforcement Learning Toolkit for Large Language Models!☆21Aug 2, 2025Updated 9 months ago
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- ☆16May 14, 2024Updated 2 years ago
- Energy Consumption-Aware Tabular Benchmark For Neural Architecture Search☆11Aug 18, 2025Updated 9 months ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- ☆31Jun 25, 2024Updated last year
- A flexible, fast and scalable python library for Self-Organizing Maps☆16Aug 9, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- Effective transpose on Hopper GPU☆28Sep 6, 2025Updated 8 months ago
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated 2 years ago
- Minimal RLHF implementation built on top of minGPT.☆32Jul 4, 2024Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Oct 31, 2024Updated last year
- ☆10Nov 17, 2022Updated 3 years ago
- Your favourite classical machine learning algos on the GPU/TPU☆22Dec 14, 2025Updated 5 months ago
- Source code for ICLR 2024 paper "GRAPH-CONSTRAINED DIFFUSION FOR END-TO-END PATH PLANNING"☆14Jun 4, 2024Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Jun 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Pre-production releases for Spacy in Catalan☆14Nov 30, 2021Updated 4 years ago
- ☆33May 15, 2024Updated 2 years ago
- ☆44Sep 19, 2024Updated last year
- splits videos into scenes with gpt-4o-mini and saves them separately☆12Dec 19, 2024Updated last year
- Exploration of the latent space of generative models on Lung-CT scans☆16Jan 25, 2023Updated 3 years ago
- This code was used to collect, process, and validate the REFLACX (Reports and Eye-Tracking Data for Localization of Abnormalities in Ches…☆19Apr 6, 2022Updated 4 years ago
- Codes of "Knee Cartilage Defect Assessment by Graph Representation and Surface Convolution"☆16Nov 11, 2022Updated 3 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆111Mar 7, 2025Updated last year
- Convert a regular GPT call into a ChatGPT call☆14Mar 2, 2023Updated 3 years ago
- 8-bit computational substrates☆50Jun 28, 2024Updated last year
- ☆38Apr 30, 2024Updated 2 years ago
- Make new tmux windows and panes inherit the currently active conda environment.☆18Dec 22, 2025Updated 5 months ago
- Neural Network Genetic Algorithm library used for deep learning problems☆18Jun 2, 2021Updated 4 years ago