a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆168Jun 25, 2024Updated last year
Alternatives and similar repositories for SmallLanguageModel
Users that are interested in SmallLanguageModel are comparing it to the libraries listed below
Sorting:
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 3 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Nov 25, 2024Updated last year
- ☆45Oct 13, 2023Updated 2 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Non Metric Space ( Approximate ) Library in R☆12Feb 2, 2023Updated 3 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- GSoC '17 - R language bindings for TensorFlow☆13Sep 18, 2017Updated 8 years ago
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Dec 13, 2024Updated last year
- A Pipe-Friendly Image Calculator☆14Mar 3, 2022Updated 4 years ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85May 29, 2024Updated last year
- RAG example using DSPy, Gradio, FastAPI☆92Apr 11, 2024Updated last year
- a set of scripts to easily convert all training data from huggingface into alpaca instruct or sharegpt format, which should allow for eas…☆18Mar 14, 2025Updated 11 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆151Jan 20, 2025Updated last year
- Real-world AI engineering dataset creation, SFT fine-tuning, and GRPO alignment ETL pipeline.☆33Aug 27, 2025Updated 6 months ago
- Mixtral finetuning☆19Feb 2, 2024Updated 2 years ago
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆16May 17, 2024Updated last year
- R package for Byte Pair Encoding based on YouTokenToMe☆16Sep 5, 2025Updated 6 months ago
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆388Apr 30, 2024Updated last year
- ☆24Mar 1, 2025Updated last year
- Labs for deep learning course.☆16Jun 21, 2021Updated 4 years ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆507Aug 26, 2024Updated last year
- ☆21Oct 14, 2024Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆27Mar 6, 2024Updated 2 years ago
- R implementation of advanced optimizers for torch☆26Jun 8, 2023Updated 2 years ago
- This project allows you to plug in a GitHub repository URL, generate vectors for a LLM and use ChatGPT models to interact. The main frame…☆19Jun 4, 2023Updated 2 years ago
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 2 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 2 weeks ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆21May 2, 2024Updated last year
- Image Binarization for improving OCR and HTR☆23Aug 18, 2022Updated 3 years ago
- MLX implementation of Meta's ESM-1 protein language model☆21Apr 17, 2024Updated last year
- Experiments with BitNet inference on CPU☆55Apr 1, 2024Updated last year
- FastAPI wrapper around DSPy☆292Mar 11, 2024Updated last year
- Framework to evaluate LLM generated ReactJS code.☆59Mar 25, 2024Updated last year
- prompt engineering experiments with DSPy GEPA and TextGrad☆68Sep 2, 2025Updated 6 months ago
- R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece☆28Feb 9, 2026Updated last month
- PromptMII: Meta-Learning Instruction Induction for LLMs☆47Jan 12, 2026Updated last month