IBM / analog-foundation-modelsLinks
Code for paper "Analog Foundation Models"
☆30Updated 4 months ago
Alternatives and similar repositories for analog-foundation-models
Users that are interested in analog-foundation-models are comparing it to the libraries listed below
Sorting:
- Train, tune, and infer Bamba model☆137Updated 8 months ago
- ☆29Updated 3 months ago
- ☆19Updated 11 months ago
- ☆52Updated last year
- AI-Driven Research Systems (ADRS)☆119Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 4 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- Work in progress.☆79Updated 2 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆46Updated 6 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆37Updated 4 months ago
- ☆119Updated last month
- ☆97Updated 2 weeks ago
- ☆39Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- All information and news with respect to Falcon-H1 series☆108Updated 4 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆117Updated last month
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆59Updated 10 months ago
- ☆46Updated 8 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- ☆21Updated 6 months ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆39Updated this week
- Fork of Flame repo for training of some new stuff in development☆19Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 5 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- Repository to create traveling waves integrate special information through time☆56Updated 6 months ago
- Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.☆117Updated last week
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆90Updated 10 months ago
- ☆20Updated 6 months ago
- Official Repository for Task-Circuit Quantization☆24Updated 8 months ago
- Training Proactive and Personalized LLM Agents☆100Updated 3 weeks ago