IBM / analog-foundation-modelsLinks
Code for paper "Analog Foundation Models"
☆27Updated last month
Alternatives and similar repositories for analog-foundation-models
Users that are interested in analog-foundation-models are comparing it to the libraries listed below
Sorting:
- ☆27Updated 4 months ago
- ☆51Updated last year
- ☆19Updated 7 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆40Updated 2 months ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Updated 10 months ago
- Work in progress.☆74Updated 3 months ago
- Fork of Flame repo for training of some new stuff in development☆18Updated last week
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆37Updated 2 weeks ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆101Updated last week
- ☆22Updated 2 months ago
- ☆36Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 8 months ago
- Official implementation of "Reasoning Path Compression: Compressing Generation Trajectories for Efficient LLM Reasoning"☆22Updated 4 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last week
- Train, tune, and infer Bamba model☆134Updated 4 months ago
- A repository for research on medium sized language models.☆78Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆130Updated 10 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆77Updated 7 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆132Updated this week
- Lego for GRPO☆30Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated last week
- Samples of good AI generated CUDA kernels☆91Updated 4 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆29Updated 10 months ago
- Official Repository for Task-Circuit Quantization☆24Updated 4 months ago
- All information and news with respect to Falcon-H1 series☆91Updated last week
- GPTQ and efficient search for GGUF☆51Updated last month
- Repository to create traveling waves integrate special information through time☆55Updated 2 months ago
- look how they massacred my boy☆63Updated last year
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆59Updated 11 months ago
- MPI Code Generation through Domain-Specific Language Models☆14Updated 11 months ago