BICLab / SpikingBrain-7BLinks
☆590Updated this week
Alternatives and similar repositories for SpikingBrain-7B
Users that are interested in SpikingBrain-7B are comparing it to the libraries listed below
Sorting:
- ☆619Updated 3 weeks ago
- Live-bending a foundation model’s output at neural network level.☆265Updated 5 months ago
- Continuous Thought Machines, because thought takes time and reasoning is a process.☆1,294Updated 2 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆859Updated 3 months ago
- ☆227Updated 6 months ago
- ☆1,187Updated 2 months ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆889Updated 3 months ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆748Updated 2 months ago
- Code release for "LLMs can see and hear without any training"☆450Updated 4 months ago
- DFloat11: Lossless LLM Compression for Efficient GPU Inference☆536Updated 3 weeks ago
- AlphaGo Moment for Model Architecture Discovery.☆1,072Updated last month
- Self-Adapting Language Models☆785Updated last month
- codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)☆618Updated this week
- MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining☆1,550Updated 3 months ago
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆187Updated last week
- Git Based Memory Storage for Conversational AI Agent☆608Updated last week
- Sparse Inferencing for transformer based LLMs☆197Updated last month
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆713Updated 3 months ago
- noise_step: Training in 1.58b With No Gradient Memory☆221Updated 8 months ago
- ☆296Updated last month
- GRadient-INformed MoE☆264Updated 11 months ago
- Docs for GGUF quantization (unofficial)☆258Updated last month
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆322Updated 10 months ago
- Verification of Google DeepMind's AlphaEvolve 48-multiplication matrix algorithm, a breakthrough in matrix multiplication after 56 years.☆120Updated 3 months ago
- ☆797Updated this week
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆200Updated 2 months ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆272Updated 3 weeks ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆586Updated 3 months ago
- ☆196Updated 4 months ago
- It takes a village to raise a child: Google DeepThink 🧠 but in LangGraph and free - an original algorithm for collaborative agents using…☆126Updated 2 weeks ago