shivendrra / SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆133Updated 7 months ago
Alternatives and similar repositories for SmallLanguageModel:
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆118Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆145Updated last month
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 6 months ago
- ☆82Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 3 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆80Updated 8 months ago
- Finetune Llama-3-8b on the MathInstruct dataset☆106Updated 4 months ago
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆141Updated 10 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 3 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆253Updated 2 months ago
- ☆61Updated 3 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆100Updated last week
- ☆14Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆89Updated 3 weeks ago
- A new benchmark for measuring LLM's capability to detect bugs in large codebase.☆29Updated 8 months ago
- Port of Andrej Karpathy's nanoGPT to Apple MLX framework.☆105Updated last year
- Training and Fine-tuning an llm in Python and PyTorch.☆41Updated last year
- Data extraction with LLM on CPU☆112Updated last year
- ☆87Updated last year
- An automated tool for discovering insights from research papaer corpora☆136Updated 8 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆100Updated 10 months ago
- ☆84Updated 4 months ago
- Claude API Test Project☆87Updated 9 months ago
- Andrej Kapathy's micrograd implemented in c☆29Updated 6 months ago
- ☆105Updated 2 months ago
- ☆111Updated 2 months ago
- ☆45Updated 10 months ago
- Scripts to create your own moe models using mlx☆86Updated 11 months ago
- a tiny vectorstore implementation built with numpy.☆60Updated 9 months ago