cityzen95 / LLM_from_scratchLinks
Building LLMs from scratch following the book from S. Raschka
☆31Updated 4 months ago
Alternatives and similar repositories for LLM_from_scratch
Users that are interested in LLM_from_scratch are comparing it to the libraries listed below
Sorting:
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 5 months ago
- This is a repository for the course "From Beginner to LLM Developer" by Towards AI.☆11Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆82Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆122Updated 6 months ago
- Fine tune Gemma 3 on an object detection task☆74Updated 3 weeks ago
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 9 months ago
- Notebooks for fine tuning pali gemma☆112Updated 3 months ago
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆31Updated last year
- A minimal yet unstoppable blueprint for multi-agent AI—anchored by the rare, far-reaching “Multi-Agent AI DAO” (2017 Prior Art)—empowerin…☆27Updated 6 months ago
- ☆59Updated last year
- Repository containing awesome resources regarding Hugging Face tooling.☆47Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- A collection of hand on notebook for LLMs practitioner☆49Updated 6 months ago
- ☆46Updated 4 months ago
- Starter template for your ML/AI projects (uv package manager, RestAPI with FastAPI and Dockerfile support)☆27Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated 11 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Collection of autoregressive model implementation☆86Updated 3 months ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Updated last year
- Generic MCP Client to use any MCP tool in a chat☆44Updated 2 months ago
- Benchmarks for Business Document Foundation Models☆10Updated last year
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆101Updated 7 months ago
- ☆123Updated 2 weeks ago
- Notebooks to demonstrate TimmWrapper☆16Updated 6 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 3 weeks ago
- ☆17Updated last year
- ☆27Updated 3 weeks ago
- ☆86Updated 10 months ago
- code for training and using chess embeddings models☆12Updated last year