catid / oaillama3Links

Simple setup to self-host LLaMA3-70B model with an OpenAI API

☆19

Alternatives and similar repositories for oaillama3

Users that are interested in oaillama3 are comparing it to the libraries listed below

Sorting:

cognitivecomputations / kraken
☆66Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
Alignment-Lab-AI / AutoMaticAssistant
☆24Updated last year
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year
teknium1 / ShareGPT-Builder
☆115Updated 6 months ago
yoheinakajima / babyagi_og
The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)
☆21Updated 8 months ago
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
stanford-futuredata / Megatron-LM
Ongoing research training transformer models at scale
☆38Updated last year
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
ahmed-moubtahij / TokenHealer
☆22Updated last year
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 2 months ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
PuchToTalk / DOOM-MistralAI
Mistral7B playing DOOM
☆28Updated last year
impel-intelligence / dippy-bittensor-subnet
☆51Updated last week
Alignment-Lab-AI / KnowledgeBase
never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…
☆37Updated last year
catid / lllm
Latent Large Language Models
☆18Updated 10 months ago
multimodalart / grog
Gradio UI for a Cog API
☆68Updated last year
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆21Updated 11 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆70Updated 4 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
smolorg / smoltropix
MLX port for xjdr's entropix sampler (mimics jax implementation)
☆64Updated 7 months ago
zarakiquemparte / zaraki-tools
☆27Updated last year
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 3 months ago
Contextualist / lone-arena
Self-hosted LLM chatbot arena, with yourself as the only judge
☆41Updated last year
bjj / exllamav2-openai-server
An OpenAI API compatible LLM inference server based on ExLlamaV2.
☆25Updated last year
bdambrosio / AllTheWorldAPlay
All the world is a play, we are but actors in it.
☆50Updated this week
lucyknada / detective-needle-llm
☆12Updated 9 months ago
Maximilian-Winter / llama_cpp_function_calling
☆31Updated last year
diicellman / dynamite-dogs
BH hackathon
☆14Updated last year
teknium1 / transformers-gptq-quant
☆47Updated last year