Decentralised-AI / LFM-Liquid-AI-Liquid-Foundation-Models
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models
☆93Updated 7 months ago
Alternatives and similar repositories for LFM-Liquid-AI-Liquid-Foundation-Models:
Users that are interested in LFM-Liquid-AI-Liquid-Foundation-Models are comparing it to the libraries listed below
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆163Updated 2 weeks ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆67Updated 5 months ago
- ☆176Updated 4 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆54Updated last year
- Code for ExploreTom☆81Updated 4 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 11 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 9 months ago
- PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"☆167Updated last month
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆210Updated 2 months ago
- minimal GRPO implementation from scratch☆87Updated last month
- PyTorch implementation of models from the Zamba2 series.☆180Updated 3 months ago
- ☆129Updated 8 months ago
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆291Updated last week
- ☆115Updated last month
- RWKV-7: Surpassing GPT☆83Updated 5 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆109Updated 2 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆49Updated 3 months ago
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"☆154Updated 6 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆215Updated 6 months ago
- Code repository for Black Mamba☆246Updated last year
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆231Updated 3 months ago
- Exploring Applications of GRPO☆189Updated this week
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 3 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- making the official triton tutorials actually comprehensible☆27Updated last month
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)☆153Updated 3 weeks ago
- Normalized Transformer (nGPT)☆174Updated 5 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 10 months ago
- ☆186Updated this week