attentionmech / trunkLinks
LLM trunk in 2d
☆10Updated last month
Alternatives and similar repositories for trunk
Users that are interested in trunk are comparing it to the libraries listed below
Sorting:
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use☆15Updated 3 weeks ago
- OmegaViT (ΩViT) is a cutting-edge vision transformer architecture that combines multi-query attention, rotary embeddings, state space mod…☆14Updated last week
- Lego for GRPO☆28Updated last week
- aesthetic tensor visualiser☆22Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 3 months ago
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- ☆28Updated 9 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 3 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆31Updated last month
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Train your own SOTA deductive reasoning model☆93Updated 3 months ago
- We study toy models of skill learning.☆28Updated 4 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆48Updated 4 months ago
- How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆35Updated last month
- ☆33Updated 3 months ago
- MatFormer repo☆26Updated 5 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 3 months ago
- ☆31Updated last year
- BH hackathon☆14Updated last year
- alternative way to calculating self attention☆18Updated last year
- Very minimal (and stateless) agent framework☆44Updated 4 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆37Updated 2 months ago
- ☆33Updated 5 months ago
- ☆49Updated 7 months ago
- Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.☆133Updated this week
- ☆59Updated 2 weeks ago
- An introduction to LLM Sampling☆78Updated 5 months ago