Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Sep 6, 2023Updated 2 years ago
Alternatives and similar repositories for just-large-models
Users that are interested in just-large-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Sep 26, 2023Updated 2 years ago
- Port of Facebook's LLaMA model in C/C++☆16Jul 3, 2023Updated 2 years ago
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- ☆18Dec 1, 2023Updated 2 years ago
- Let's make sand talk☆590Oct 17, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆19Sep 26, 2023Updated 2 years ago
- Sample and play YouTube☆16May 4, 2025Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Jan 7, 2024Updated 2 years ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆31May 23, 2024Updated 2 years ago
- ☆10Aug 14, 2023Updated 2 years ago
- ☆11Aug 26, 2024Updated last year
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- A tool for turning boring log messages into fun interactions with anime characters.☆47Mar 13, 2023Updated 3 years ago
- learn from your favorite tech companies☆164Feb 9, 2026Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Port of OpenAI's Whisper model in C/C++☆10Jul 12, 2023Updated 2 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- My solutions for Advanced Python Mastery (course by @dabeaz)☆11Jan 29, 2024Updated 2 years ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆38May 14, 2024Updated 2 years ago
- Stampy's copy of Alignment Research Dataset scraper☆24May 30, 2026Updated 2 weeks ago
- A Python program that simulates a satellite network using pygame, allowing users to create, configure, and visualize the network state ov…☆11Apr 25, 2023Updated 3 years ago
- AniPortrait with Gradio: Audio-Driven Synthesis of Photorealistic Portrait Animation☆22Mar 31, 2024Updated 2 years ago
- OpenC3 COSMOS Project Configuration Structure☆22Jun 1, 2026Updated 2 weeks ago
- ☆34Sep 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆19Feb 27, 2023Updated 3 years ago
- ☆27Jul 9, 2024Updated last year
- Stream of my favorite papers and links☆44Apr 19, 2026Updated 2 months ago
- ☆45Jun 2, 2023Updated 3 years ago
- ☆14Jan 7, 2024Updated 2 years ago
- Multilayer Authenticity Identifier (MAI), a CNN model that attempts to identify synthetic AI images.☆33Feb 15, 2025Updated last year
- ☆21Oct 6, 2023Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆33Jun 20, 2023Updated 2 years ago
- papers.day☆93Dec 15, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- 👋 Simple examples of enabling Multipath TCP with different programming languages☆23Apr 1, 2025Updated last year
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- Tiny inference-only implementation of LLaMA☆91Apr 3, 2024Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆508Oct 19, 2023Updated 2 years ago
- Poor man's Solidity REPL☆61Sep 20, 2022Updated 3 years ago
- Portable and lightweight brain segmentation in the terminal!☆20Apr 28, 2026Updated last month