shivendrra / SmallLanguageModelLinks

a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model

☆148

Alternatives and similar repositories for SmallLanguageModel

Users that are interested in SmallLanguageModel are comparing it to the libraries listed below

Sorting:

BhabhaAI / dataformer
Solving data for LLMs - Create quality synthetic datasets!
☆150Updated 6 months ago
Vaibhavs10 / gpu-poor-llm-notebooks
☆75Updated 10 months ago
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆82Updated last year
AK391 / dailypapersHN
☆86Updated 10 months ago
ai8hyf / OpenResearchAssistant
An automated tool for discovering insights from research papaer corpora
☆138Updated last year
AlexBodner / How_Much_VRAM
☆102Updated 11 months ago
aigeek0x0 / rag-with-langchain-colbert-and-ragatouille
Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB
☆122Updated last year
FareedKhan-dev / Building-llama3-from-scratch
LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.
☆175Updated 11 months ago
githubpradeep / notebooks
☆54Updated 6 months ago
togethercomputer / finetuning
Finetune Llama-3-8b on the MathInstruct dataset
☆111Updated 9 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
teknium1 / ShareGPT-Builder
☆116Updated 7 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆83Updated 3 months ago
geronimi73 / qlora-minimal
☆86Updated last year
AymenKallala / RAG_Maestro
Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.
☆168Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 9 months ago
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆111Updated last year
AtakanTekparmak / tiny_fnc_engine
tiny_fnc_engine is a minimal python library that provides a flexible engine for calling functions extracted from a LLM.
☆38Updated 10 months ago
Vaibhavs10 / notebooks
☆127Updated 4 months ago
brendanhogan / picoDeepResearch
☆65Updated 2 months ago
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆320Updated 4 months ago
smolorg / smolvecstore
a tiny vectorstore implementation built with numpy.
☆62Updated last year
TrelisResearch / one-click-llms
One click templates for inferencing Language Models
☆203Updated this week
yoheinakajima / autofinetune
auto fine tune of models with synthetic data
☆76Updated last year
TrelisResearch / install-guides
Various installation guides for Large Language Models
☆72Updated 3 months ago
QuixiAI / grokadamw
☆134Updated 11 months ago
QuixiAI / OpenChatML
☆157Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
ComposioHQ / Composio-Function-Calling-Benchmark
Function Calling Benchmark & Testing
☆88Updated last year
cohere-ai / DiskVectorIndex
☆211Updated last month