lando22 / GPT-3TLinks

Building language models to predict more than one token ahead to enable further ahead predictions.

☆12

Alternatives and similar repositories for GPT-3T

Users that are interested in GPT-3T are comparing it to the libraries listed below

Sorting:

emrgnt-cmplxty / SmolTrainer
☆19Updated last year
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆71Updated 9 months ago
kyegomez / autogpt-tot
Simple Autogpt with tree of thoughts
☆14Updated 2 years ago
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆58Updated 2 months ago
soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆43Updated last year
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆25Updated last year
fblgit / tree-of-knowledge
ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…
☆54Updated last year
toufunao / SCM4LLMs
☆33Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆123Updated last year
Alignment-Lab-AI / AutoMaticAssistant
☆24Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆66Updated last year
zarakiquemparte / zaraki-tools
☆27Updated last year
g588928812 / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Updated last year
Alignment-Lab-AI / Our-Projects
A repository of projects and datasets under active development by Alignment Lab AI
☆22Updated last year
jina-ai / jerboa
LLM finetuning
☆42Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 6 months ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆103Updated 2 weeks ago
mobarski / alpaca-libre
Reimplementation of the task generation part from the Alpaca paper
☆119Updated 2 years ago
qrdlgit / graph-of-thoughts
Based on the tree of thoughts paper
☆48Updated last year
jquesnelle / literAI
Generate visual podcasts about novels using open source models
☆25Updated 2 years ago
cognitivecomputations / SystemChat
☆30Updated 11 months ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated last year
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated 8 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 7 months ago
venuv / LangSynth
Conduct consumer interviews with synthetic focus groups using LLMs and LangChain
☆43Updated last year
SkunkworksAI / CodeFusion
☆15Updated last year
OpenAccess-AI-Collective / ggml-webui
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
☆35Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
SLAM-group / newhope
☆22Updated last year