building a Large Language Model (LLM) from scratch.
☆35Feb 4, 2025Updated last year
Alternatives and similar repositories for llm-scratch
Users that are interested in llm-scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Colored Kimia Path24 Dataset: Configurations and Benchmarks with Deep Embeddings☆10Jun 6, 2024Updated last year
- Simple and beginner friendly C++ projects you can clone☆20Jun 6, 2025Updated 10 months ago
- Compare Naive Bayes, SVM, XGBoost, Bagging, AdaBoost, K-Nearest Neighbors, Random Forests for classification of Malaria Cells☆11Jun 5, 2019Updated 6 years ago
- Information about the CodedotAI reading group sessions.☆12Aug 16, 2021Updated 4 years ago
- Resturant-Recommendation-Multi-Modal RAG-using-Gemini☆13Dec 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- A curated list of papers & resources linked to concept learning☆12Aug 9, 2023Updated 2 years ago
- Example of how to implement AG-UI protocol together with CrewAI agents.☆46Dec 15, 2025Updated 4 months ago
- ☆12Jan 21, 2019Updated 7 years ago
- C++ Programs Collection for Beginners☆23Oct 22, 2023Updated 2 years ago
- Developed a fast implementation of autoregressive models with fitting, ACF/PACF tests and forecasting.☆13Sep 8, 2019Updated 6 years ago
- Materials for the Ultimate Hybrid Search Workshop☆45Dec 13, 2024Updated last year
- A Python-based tool that monitors dark web sources for mentions of specific organizations for Threat Monitoring.☆25Apr 7, 2025Updated last year
- ☆65Mar 9, 2026Updated last month
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆145Oct 5, 2025Updated 6 months ago
- ☆17Feb 24, 2026Updated last month
- ☆15Jul 6, 2022Updated 3 years ago
- ☆13Jul 10, 2020Updated 5 years ago
- Build a Recommendation System Agent using LATS Agent Approach☆33Feb 26, 2025Updated last year
- LAReQA is a challenging benchmark for evaluating language agnostic answer retrieval from a multilingual candidate pool. This repository c…☆14May 19, 2020Updated 5 years ago
- Aussie AI Base C++ Library is the source code repo for the book Generative AI in C++, along with various other AI/ML kernels.☆21Aug 30, 2024Updated last year
- The Journey of RAG: From Notebook to Microservices☆27Feb 22, 2024Updated 2 years ago
- ICLR 2026: Agent-X Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆39Apr 5, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆10Jan 31, 2022Updated 4 years ago
- This is the python program which performs text summarization with pronoun replacement method. This method initially identifies pronouns i…☆10Dec 5, 2018Updated 7 years ago
- T2NER: Transformers based Transfer Learning Framework for Named Entity Recognition (EACL 2021)☆11Sep 24, 2022Updated 3 years ago
- ☆12Apr 6, 2023Updated 3 years ago
- Materials for the Neural Network tutorial at PyData NYC 2019☆15Feb 15, 2023Updated 3 years ago
- A Powerful C++ Library for High-Performance Numerical Computing☆18Oct 5, 2024Updated last year
- Collections of Actions for Custom GPTs (some created by Captain Action)☆11Jan 7, 2024Updated 2 years ago
- ☆10Jul 26, 2025Updated 8 months ago
- This project is to provide spell check help from Urdu to Hindi transliteration.The spelling errors in our case mostly comprises of errors…☆10Aug 18, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15May 28, 2020Updated 5 years ago
- Mixture of Experts from scratch☆13Apr 12, 2024Updated 2 years ago
- [WACV 2024] TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding☆13May 30, 2024Updated last year
- This is a [forked version] for author's debugging. Please jump to https://github.com/QualityAssessment/DOVER for stable version to use.☆14Oct 29, 2023Updated 2 years ago
- Multilingual Neural Machine Translation using Transformers with Conditional Normalization.☆18Mar 24, 2023Updated 3 years ago
- Learning Imbalanced Datasets With Maximum Margin Losss☆12Jun 17, 2023Updated 2 years ago
- [ICIP2023] Code for the paper 'Action Anticipation with Goal Consistency'☆12Apr 5, 2024Updated 2 years ago