joey00072 / Attention-as-graphLinks

alternative way to calculating self attention

☆18

Alternatives and similar repositories for Attention-as-graph

Users that are interested in Attention-as-graph are comparing it to the libraries listed below

Sorting:

xjdr-alt / muzero_sketch
☆38Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 5 months ago
brendanhogan / completion_tree_view
☆13Updated 3 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 5 months ago
SebastianBodza / EnsembleForecasting
Using multiple LLMs for ensemble Forecasting
☆16Updated last year
Alignment-Lab-AI / KnowledgeBase
never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…
☆37Updated last year
devadigapratham / CoDSPy
An intelligent code optimization system leveraging AI analysis, automated refactoring, and test generation. Built with DSPy and Gradio, i…
☆20Updated 6 months ago
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
catid / lllm
Latent Large Language Models
☆18Updated 11 months ago
meetdavidwan / clamr
CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval
☆17Updated last month
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 6 months ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
doomslide / autoloom
Approximating the joint distribution of language models via MCTS
☆21Updated 9 months ago
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 2 months ago
andrew-silva / mlx-rlhf
An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
☆32Updated last year
KempnerInstitute / traveling-waves-integrate
Repository to create traveling waves integrate special information through time
☆53Updated 4 months ago
raphaelmansuy / iteration_of_tought
Example implementation of Iteration of Tought - Gives a star if you like the project
☆42Updated 7 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 2 months ago
lumpenspace / FRAG
Flexible, efficient, and context-aware generation from large unstructured knowledge sources.
☆17Updated last year
knowrohit / know_medical_dialogues
KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…
☆24Updated last year
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)
☆91Updated 6 months ago
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 10 months ago
axolotl-ai-cloud / axolotl-cookbook
☆34Updated 4 months ago
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated last month
lalalune / gptcoder
RAG Agent for the ARC AGI Challenge
☆21Updated last year
Narsil / hf-chat
☆26Updated 7 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 4 months ago
yoheinakajima / babyagi_og
The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)
☆21Updated 10 months ago
brendanhogan / picoDeepResearch
☆64Updated 2 months ago