BrunoGeorgevich / llama3.cp

Adapted version of llama3.np (NumPy) to a CuPy implementation for the Llama 3 model.

☆36

Related projects ⓘ

Alternatives and complementary repositories for llama3.cp

monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆47Updated 5 months ago
thooton / muse
Let's create synthetic textbooks together :)
☆70Updated 9 months ago
abgulati / hf-waitress
Serving LLMs in the HF-Transformers format via a PyFlask API
☆68Updated 2 months ago
mzbac / mlx-moe
Scripts to create your own moe models using mlx
☆86Updated 8 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆105Updated 2 weeks ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers model
☆162Updated 6 months ago
cognitivecomputations / kraken
☆64Updated 5 months ago
kerekovskik / autologic
autologic is a Python package that implements the SELF-DISCOVER framework proposed in the paper SELF-DISCOVER: Large Language Models Self…
☆57Updated 8 months ago
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆155Updated last year
tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆83Updated this week
chigkim / Ollama-MMLU-Pro
☆64Updated last month
multiplexerai / Complex-to-Simple-RAG
☆39Updated 8 months ago
statchamber / ebook-to-chatml-conversion
idea: https://github.com/nyxkrage/ebook-groupchat/
☆81Updated 2 months ago
leafspark / AutoGGUF
automatically quant GGUF models
☆137Updated this week
Xalp / ECHO
Official homepage for "Self-Harmonized Chain of Thought"
☆83Updated last month
nath1295 / LLMFlex
A python package for developing AI applications with local LLMs.
☆141Updated 4 months ago
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆53Updated this week
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆74Updated 4 months ago
multiplexerai / mplx_rag
Complex RAG backend
☆28Updated 7 months ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆62Updated last month
cognitivecomputations / spectrum
☆92Updated last month
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆172Updated 3 months ago
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Updated 10 months ago
cognitivecomputations / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆231Updated 5 months ago
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆160Updated 3 months ago
mounta11n / plusplus-camall
After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…
☆56Updated 2 months ago
cognitivecomputations / grokadamw
☆116Updated 2 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆54Updated 2 months ago
4dh / GRDN
GRDN.AI app for garden optimization
☆69Updated 9 months ago
lucyknada / detective-needle-llm
☆12Updated last month
cognitivecomputations / OpenChatML
☆148Updated 3 months ago