keeeeenw / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆11Updated last year
Alternatives and similar repositories for TinyLlama
Users that are interested in TinyLlama are comparing it to the libraries listed below
Sorting:
- ☆53Updated 11 months ago
- OpenPipe Reinforcement Learning Experiments☆24Updated 2 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆22Updated last month
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- ☆66Updated 11 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 5 months ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- ☆73Updated last year
- ☆24Updated 3 months ago
- entropix style sampling + GUI☆26Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆52Updated 3 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆49Updated 3 months ago
- A pipeline parallel training script for LLMs.☆143Updated 2 weeks ago
- RWKV-7: Surpassing GPT☆84Updated 5 months ago
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆22Updated 4 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated last month
- ☆113Updated 4 months ago
- ☆33Updated 10 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 3 months ago
- ☆117Updated 8 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆54Updated 7 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆22Updated 2 weeks ago
- ☆27Updated 8 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆26Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆93Updated 4 months ago
- ☆17Updated 4 months ago
- run ollama & gguf easily with a single command☆50Updated 11 months ago