☆126Aug 6, 2024Updated last year
Alternatives and similar repositories for extended-mind-transformers
Users that are interested in extended-mind-transformers are comparing it to the libraries listed below
Sorting:
- Exact OU processes with JAX☆59Jan 6, 2026Updated 2 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- ☆19Dec 4, 2025Updated 3 months ago
- Developer showcase of projects built on Cartesia☆20Aug 28, 2024Updated last year
- Uncertainty quantification with PyTorch☆378Jan 27, 2026Updated last month
- ☆21Oct 6, 2023Updated 2 years ago
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20May 31, 2023Updated 2 years ago
- GPT2 Byte Pair Encoding implementation in Golang☆24Jul 9, 2025Updated 7 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Jul 27, 2024Updated last year
- ☆40Jul 26, 2024Updated last year
- AI that dreams☆22Apr 10, 2023Updated 2 years ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45May 16, 2024Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆66Jun 29, 2023Updated 2 years ago
- Entailment self-training☆27May 30, 2023Updated 2 years ago
- A tracery Twitter bot, generating graphic scores to inspire musicians, composers, and anyone else.☆10Mar 12, 2016Updated 9 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- A VPN written in Rust☆13Apr 17, 2025Updated 10 months ago
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- ☆11Dec 11, 2024Updated last year
- High-performance tokenized language data-loader for Python C++ extension☆14Jul 22, 2024Updated last year
- Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing☆14Mar 28, 2023Updated 2 years ago
- For ACL25 paper "WAFFLE: Multi-Modal Model for Automated Front-End Development" - by Shanchao Liang and Nan Jiang and Shangshu Qian and L…☆11May 28, 2025Updated 9 months ago
- A strongly typed Python DSL for developing message passing multi agent systems☆53Apr 9, 2024Updated last year
- ☆17Jun 20, 2023Updated 2 years ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 2 years ago
- ☆10Mar 1, 2025Updated last year
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18May 30, 2025Updated 9 months ago
- ☆48Jan 3, 2026Updated 2 months ago
- ☆35Aug 16, 2024Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- nanoGPT using Equinox☆15Mar 3, 2023Updated 3 years ago
- A React Native Elements example app for android, iOS, and web that shares a single codebase☆13Aug 22, 2018Updated 7 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆14Feb 28, 2026Updated last week
- A Discord Bot for distilling papers, GitHub repos, Blogposts, and much more using the power of LLMs and vector search.☆13May 3, 2023Updated 2 years ago
- ☆14Mar 28, 2024Updated last year
- ☆23Jan 27, 2026Updated last month
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Dec 11, 2023Updated 2 years ago
- ☆35Apr 12, 2024Updated last year
- A Ruby on Rails style framework for the DSPy (Demonstrate, Search, Predict) project for Language Models like GPT, BERT, and LLama.☆132Oct 16, 2024Updated last year