Decentralised-AI / LFM-Liquid-AI-Liquid-Foundation-ModelsLinks
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models
☆113Updated last year
Alternatives and similar repositories for LFM-Liquid-AI-Liquid-Foundation-Models
Users that are interested in LFM-Liquid-AI-Liquid-Foundation-Models are comparing it to the libraries listed below
Sorting:
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆191Updated last week
- ☆199Updated 10 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆75Updated 10 months ago
- GRadient-INformed MoE☆264Updated last year
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆232Updated 7 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆159Updated last month
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆201Updated last year
- PyTorch implementation of models from the Zamba2 series.☆185Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated this week
- accompanying material for sleep-time compute paper☆115Updated 5 months ago
- Code repository for Black Mamba☆256Updated last year
- ☆88Updated last week
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- ☆136Updated last year
- ☆78Updated last week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆342Updated 10 months ago
- Code for ExploreTom☆86Updated 3 months ago
- An extension of the nanoGPT repository for training small MOE models.☆196Updated 7 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆290Updated this week
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆524Updated 2 weeks ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆59Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆296Updated last month
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆125Updated last month
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 5 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 7 months ago
- ☆96Updated 2 weeks ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆268Updated this week
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 11 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆324Updated 11 months ago
- working implimention of deepseek MLA☆44Updated 9 months ago