shivendrra / SmallLanguageModelLinks
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆153Updated last year
Alternatives and similar repositories for SmallLanguageModel
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below
Sorting:
- Solving data for LLMs - Create quality synthetic datasets!☆151Updated 7 months ago
- ☆54Updated this week
- ☆75Updated 11 months ago
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆122Updated last year
- ☆86Updated 11 months ago
- ☆102Updated 11 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆83Updated last year
- Finetune Llama-3-8b on the MathInstruct dataset☆111Updated 10 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆168Updated last year
- ☆127Updated 5 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 9 months ago
- Efficient vector database for hundred millions of embeddings.☆207Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last month
- A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)☆126Updated 11 months ago
- A lightweight evaluation suite tailored specifically for assessing Indic LLMs across a diverse range of tasks☆37Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆129Updated last year
- This is an open-source version of OpenAI's O1 Model Series by Siraj Raval & O1-Preview☆96Updated 10 months ago
- Data extraction with LLM on CPU☆112Updated last year
- ☆88Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated last week
- ☆210Updated 2 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- Own your AI, search the web with it🌐😎☆89Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- a tiny vectorstore implementation built with numpy.☆63Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆322Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆63Updated 9 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆326Updated 2 months ago
- Various installation guides for Large Language Models☆72Updated 4 months ago