cccntu / LoRAnanoGPTLinks

The simplest, fastest repository for training/finetuning medium-sized GPTs.

☆19

Alternatives and similar repositories for LoRAnanoGPT

Users that are interested in LoRAnanoGPT are comparing it to the libraries listed below

Sorting:

CERC-AAI / Robin
☆63Updated 8 months ago
LLM360 / k2-data-prep
☆20Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated last year
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆20Updated 11 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
katzurik / Knowledge_Navigator
☆21Updated 3 months ago
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated 9 months ago
xaedes / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆21Updated last year
facebookresearch / dual-system-for-visual-language-reasoning
Github repo for Peifeng's internship project
☆13Updated last year
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆43Updated last year
zarakiquemparte / zaraki-tools
☆27Updated last year
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated last week
Zyphra / Zyda_processing
☆34Updated 11 months ago
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆25Updated last year
euclaise / supertrainer2000
☆49Updated last year
facebookresearch / PostText
PostText is a QA system for querying your text data. When appropriate structured views are in place, PostText is good at answering querie…
☆32Updated last year
huggingface / pixparse
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆21Updated 10 months ago
facebookresearch / DIG-In
This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.
☆20Updated last year
ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago
iantbutler01 / ditty
A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.
☆16Updated 7 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆19Updated 5 months ago
kyegomez / FastFF
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Updated 6 months ago
ibm-granite / granite-embedding-models
☆28Updated 4 months ago
LLM360 / crystalcoder-data-prep
Data preparation code for CrystalCoder 7B LLM
☆44Updated last year
eth-easl / fmengine
Utilities for Training Very Large Models
☆58Updated 8 months ago
sekstini / basedxl
☆17Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
facebookresearch / MultiModalExplorer
Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…
☆27Updated last year
Zyphra / zcookbook
Training hybrid models for dummies.
☆21Updated 4 months ago