shivendrra / SmallLanguageModelLinks
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆146Updated last year
Alternatives and similar repositories for SmallLanguageModel
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below
Sorting:
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆120Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆149Updated 5 months ago
- ☆74Updated 9 months ago
- ☆87Updated last year
- ☆89Updated last year
- Repository for fine-tuning gemma models using unsloth for indic languages☆94Updated last year
- ☆86Updated last year
- Various installation guides for Large Language Models☆70Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- ☆80Updated last year
- rl from zero pretrain, can it be done? we'll see.☆56Updated this week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆80Updated last month
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆81Updated last year
- ☆101Updated 9 months ago
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆167Updated last year
- Chat with PDF using Zephyr 7B Alpha, Langchain, ChromaDB, and Gradio with Free Google Colab☆136Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆128Updated last year
- ☆115Updated 6 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- Efficient vector database for hundred millions of embeddings.☆206Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆197Updated last year
- This project enhances the construction of RAG applications by addressing challenges, improving accessibility, scalability, and managing d…☆145Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Simple examples using Argilla tools to build AI☆53Updated 7 months ago
- ☆86Updated 9 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago