yacineMTB / just-large-modelsView external linksLinks
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Sep 6, 2023Updated 2 years ago
Alternatives and similar repositories for just-large-models
Users that are interested in just-large-models are comparing it to the libraries listed below
Sorting:
- Stampy's copy of Alignment Research Dataset scraper☆13Dec 26, 2025Updated last month
- OpenAI's human-eval sampling benchmark☆13Jan 29, 2024Updated 2 years ago
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- Port of Facebook's LLaMA model in C/C++☆16Jul 3, 2023Updated 2 years ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- Let's make sand talk☆588Oct 17, 2023Updated 2 years ago
- Simplex Random Feature attention, in PyTorch☆75Oct 10, 2023Updated 2 years ago
- This repository stores the source code for the Mistral Hackathon 2024 in Paris☆16Aug 23, 2024Updated last year
- learn from your favorite tech companies☆165Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Generative cellular automaton-like learning environments for RL.☆20Jan 30, 2025Updated last year
- Stream of my favorite papers and links☆44Updated this week
- RAG Agent for the ARC AGI Challenge☆20Jul 1, 2024Updated last year
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆201Sep 24, 2023Updated 2 years ago
- papers.day☆93Dec 15, 2023Updated 2 years ago
- ☆57Apr 14, 2023Updated 2 years ago
- Code Interpreter Replica☆26Jul 14, 2023Updated 2 years ago
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆27Apr 21, 2023Updated 2 years ago
- ☆27Jul 9, 2024Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆23Apr 7, 2024Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Oct 8, 2025Updated 4 months ago
- LMQL implementation of tree of thoughts☆36Jan 31, 2024Updated 2 years ago
- ☆17Sep 1, 2024Updated last year
- ☆11Jun 22, 2016Updated 9 years ago
- Generate textbook-quality synthetic LLM pretraining data☆509Oct 19, 2023Updated 2 years ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Oct 31, 2024Updated last year
- ☆32Jun 6, 2024Updated last year
- ☆13Mar 1, 2023Updated 2 years ago
- column generation implementation based on google or-tools for cutting stock problem☆14Aug 19, 2025Updated 5 months ago
- ☆13Apr 27, 2021Updated 4 years ago
- Angular library for integrating Interswitch payments easily☆11Jul 30, 2021Updated 4 years ago
- ☆14Updated this week
- This repository contains the registries for components, agents and services, the second part of the autonolas-v1 protocol.☆15Updated this week
- EOSIO-Taurus - The Most Powerful Infrastructure for Decentralized Applications☆13Mar 29, 2024Updated last year
- ☆45Jun 2, 2023Updated 2 years ago
- ☆40Mar 25, 2023Updated 2 years ago
- High Quality Resources on GPU Programming/Architecture☆592Jul 26, 2024Updated last year
- My name is Ozymandias, King of Kings; Look on my Works, ye Mighty, and despair!☆40Aug 26, 2023Updated 2 years ago