CarperAI / nmmo-environmentLinks
Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research
☆15Updated last year
Alternatives and similar repositories for nmmo-environment
Users that are interested in nmmo-environment are comparing it to the libraries listed below
Sorting:
- ☆26Updated 2 years ago
- ☆37Updated 3 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆132Updated last year
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆15Updated 3 months ago
- 🚀 Automatically convert unstructured data into a high-quality 'textbook' format, optimized for fine-tuning Large Language Models (LLMs)☆25Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated 2 years ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆39Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆169Updated 2 weeks ago
- ☆12Updated 4 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆42Updated 11 months ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated 2 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆12Updated 11 months ago
- ☆61Updated last year
- Pytorch implementation of the Gato paper from Deepmind☆11Updated 2 years ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Documentation for dynamic machine learning systems.☆29Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆65Updated 7 months ago
- Code base for internal reward models and PPO training☆24Updated 2 years ago
- ☆27Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆19Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- The Next Generation Multi-Modality Superintelligence☆69Updated last year
- I clearly unravel how I came to invent the supermanifold hypothesis in deep learning, (a part of a system called 'thought curvature') in …☆20Updated 2 years ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆149Updated last year
- Clean RL implementation using MLX☆33Updated last year
- 🏥 Health monitor for a Petals swarm☆39Updated last year
- ☆163Updated last year
- A2Perf is a benchmark for evaluating agents on sequential decision problems that are relevant to the real world. This repository contains…☆10Updated last year
- Inference code for LLaMA 2 models☆30Updated last year