lutzroeder / models
A minimal version of GPT-2 in 175 lines of PyTorch code.
β40Updated this week
Alternatives and similar repositories for models:
Users that are interested in models are comparing it to the libraries listed below
- An implementation of delta-iris in tinygradβ72Updated 7 months ago
- [WIP] A π₯ interface for running code in the cloudβ86Updated 2 years ago
- alternative way to calculating self attentionβ18Updated 10 months ago
- Can RL solve simple problems?β54Updated last year
- Exploration into the Firefly algorithm in Pytorchβ35Updated last month
- Github repo for storing LlamaDatasetsβ33Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptationsβ33Updated last year
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)β21Updated 6 months ago
- ππ§ A minimalistic tool to fine-tune your LLMsβ18Updated last year
- The package used to build the documentation of our Hugging Face reposβ107Updated this week
- https://mlabonne.github.io/blog/β36Updated 3 weeks ago
- LLM as a Chatbot Serviceβ16Updated last year
- β17Updated last year
- β22Updated last year
- Testing KAN-based text generation GPT modelsβ16Updated 10 months ago
- β60Updated last year
- Community Open Source Implementation of GPT4o in PyTorchβ29Updated 2 weeks ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, anβ¦β13Updated 2 weeks ago
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1β20Updated 2 weeks ago
- β89Updated last month
- A synthetic story narration dataset to study small audio LMs.β32Updated last year
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.β54Updated 2 months ago
- stream-of-consciousness experience of an AI's thinking process, complete with creative tangents and unexpected connections.β11Updated 2 months ago
- Chrome Extension for exploring Hugging Face datasets πβ49Updated 6 months ago
- Interactive Textbook Demoβ40Updated last year
- Low-Rank Adaptation of Large Language Models clean implementationβ8Updated last year
- The Swarm Ecosystemβ19Updated 8 months ago
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrationsβ¦β13Updated last year
- Hub for researchers exploring VLMs and Multimodal Learning:)β19Updated this week
- π₯ Health monitor for a Petals swarmβ36Updated 8 months ago