osanseviero / ml_timelineLinks

☆588

Alternatives and similar repositories for ml_timeline

Users that are interested in ml_timeline are comparing it to the libraries listed below

Sorting:

abacaj / awesome-transformers
A curated list of awesome transformer models.
☆669Updated 2 years ago
HazyResearch / ama_prompting
Ask Me Anything language model prompting
☆546Updated 2 years ago
booydar / recurrent-memory-transformer
[NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.
☆773Updated last year
NVlabs / prismer
The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
☆1,305Updated last year
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,063Updated last year
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆722Updated 9 months ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆488Updated 2 years ago
zeno-ml / zeno-build
Build, evaluate, understand, and fix LLM-based apps
☆491Updated last year
teknium1 / GPTeacher
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,631Updated 2 years ago
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆823Updated 2 years ago
danielgross / LlamaAcademy
A school for camelids
☆1,208Updated 2 years ago
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,013Updated last year
HazyResearch / manifest
Prompt programming with FMs.
☆444Updated last year
facebookresearch / belebele
Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.
☆336Updated 11 months ago
conceptofmind / PaLM
An open-source implementation of Google's PaLM models
☆818Updated last year
stanford-crfm / ecosystem-graphs
☆270Updated 9 months ago
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆301Updated 2 years ago
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆307Updated 2 years ago
keerthanpg / talktopapers
☆212Updated 2 years ago
HazyResearch / meerkat
Explore and understand your training and validation data.
☆848Updated 10 months ago
HazyResearch / evaporate
This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Da…
☆492Updated last year
huggingface / community-events
Place where folks can contribute to 🤗 community events
☆426Updated last year
SkunkworksAI / hydra-moe
☆415Updated 2 years ago
srush / MiniChain
A tiny library for coding with large language models.
☆1,236Updated last year
madaan / memprompt
A method to fix GPT-3 after deployment with user feedback, without re-training.
☆330Updated 2 years ago
lucidrains / PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆824Updated 3 years ago
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆464Updated 2 years ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆714Updated 2 years ago
sanjeevanahilan / nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
☆293Updated last year
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆506Updated 2 years ago