ThinamXx / build-GPT
Building GPT ...
β17Updated 5 months ago
Alternatives and similar repositories for build-GPT:
Users that are interested in build-GPT are comparing it to the libraries listed below
- Complete implementation of Llama2 with/without KV cache & inference πβ47Updated 11 months ago
- Fine-tune an LLM to perform batch inference and online serving.β110Updated this week
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systemsβ104Updated 3 months ago
- Making of cuda kernelβ15Updated 2 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- Just some stuff for Interview questions, books, annotated paper, notes, cheat sheets etc etc related to ML,AI, Deep Learning and Data Scβ¦β114Updated last week
- A set of scripts and notebooks on LLM finetunning and dataset creationβ107Updated 7 months ago
- A repository containing general tutorials I'd like to share with the world.β43Updated 2 weeks ago
- zero-to-lightningβ30Updated last year
- β45Updated last month
- Deep Learning for Computer Visionβ54Updated 10 months ago
- Direct Preference Optimization Implementationβ16Updated last year
- Notebooks for fine tuning pali gemmaβ100Updated 3 weeks ago
- Prune transformer layersβ69Updated 11 months ago
- A repository consisting of paper/architecture replications of classic/SOTA AI/ML papers in pytorchβ189Updated last week
- A collection of hand on notebook for LLMs practitionerβ47Updated 3 months ago
- β80Updated 2 weeks ago
- This playlab encompasses a multitude of projects crafted through the utilization of Large Language Models, showcasing the versatility andβ¦β118Updated 2 weeks ago
- β43Updated this week
- RL significantly the reasoning capability of Qwen2.5-1.5B-Instructβ28Updated 2 months ago
- GPU Kernelsβ172Updated last week
- Repo for ML Models built from scratch such as Self-Attention, Linear +Logistic Regression, PCA, LDA. CNN, LSTM, Neural Networks using Nuβ¦β47Updated 3 months ago
- Collection of resources for RL and Reasoningβ25Updated 3 months ago
- Fine-Tuning Llama3-8B LLM in a multi-GPU environment using DeepSpeedβ17Updated 11 months ago
- β91Updated last month
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ102Updated last month
- Quantization of LLMs and benchmarking.β10Updated last year
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.β124Updated last year
- β87Updated last month
- β143Updated 9 months ago