Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation
☆73Sep 18, 2023Updated 2 years ago
Alternatives and similar repositories for phi-1
Users that are interested in phi-1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆13Jan 29, 2024Updated 2 years ago
- Simple Autogpt with tree of thoughts☆14May 25, 2023Updated 3 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- PegasusX: The Future of Multimodal Embeddings 🦄 🦄☆14Oct 16, 2024Updated last year
- A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…☆12Mar 11, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Plug in and Play Prompt Technique to Boost Model reasoning by 40%☆11May 30, 2023Updated 3 years ago
- The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…☆16Dec 11, 2023Updated 2 years ago
- Tiktok is an advanced multimedia recommender system that fuses the generative modality-aware collaborative self-augmentation and contrast…☆14Aug 18, 2023Updated 2 years ago
- The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"☆32Jun 22, 2026Updated last week
- Hugging Face RoBERTa with Flash Attention 2☆24Sep 14, 2025Updated 9 months ago
- Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2☆15Jun 27, 2025Updated last year
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Nov 11, 2024Updated last year
- ☆18Aug 11, 2022Updated 3 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆17Jun 22, 2026Updated last week
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Feb 1, 2024Updated 2 years ago
- An plug in and play pipeline that utilizes segment anything to segment datasets with rich detail for downstream fine-tuning on vision mod…☆20Feb 22, 2024Updated 2 years ago
- ProfitPilot closes deals for you effortlessly 24/7, just provide a list of customer and ProfitPilot will reach out on your behalf and clo…☆21Sep 7, 2023Updated 2 years ago
- Backdooring Neural Code Search☆14Sep 8, 2023Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆27Nov 11, 2024Updated last year
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆22Jun 29, 2024Updated 2 years ago
- An simple pytorch implementation of Flash MultiHead Attention☆22Feb 5, 2024Updated 2 years ago
- Ongoing research training transformer models at scale☆37Jan 19, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Transform youtube URL into text 100x faster with whisperx☆20May 8, 2023Updated 3 years ago
- ☆21Aug 27, 2023Updated 2 years ago
- My Implementation of " Structure and Content-Guided Video Synthesis with Diffusion Models" by RunwayML☆26Jan 16, 2024Updated 2 years ago
- Masked Structural Growth for 2x Faster Language Model Pre-training☆25Apr 28, 2024Updated 2 years ago
- Finetuning Stable Diffusion from Diffusers☆11Mar 11, 2024Updated 2 years ago
- ☆17Apr 10, 2024Updated 2 years ago
- AthenaOS is a next generation AI-native operating system managed by Swarms of AI Agents☆40Jul 18, 2023Updated 2 years ago
- A multi-modal AI Model that can generate high quality novel videos with text, images, or video clips.☆64Aug 18, 2023Updated 2 years ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Oct 14, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆44Jun 9, 2023Updated 3 years ago
- [EMNLP 2023] Lion: Adversarial Distillation of Proprietary Large Language Models☆210Feb 11, 2024Updated 2 years ago
- unofficial Split Mean Flow Implementation from bytedance☆70Aug 12, 2025Updated 10 months ago
- Code repo for paper: ICML 2020 paper Natural lottery ticket winner: RL for ordinary neural circuits☆13Jun 1, 2020Updated 6 years ago
- The open source implementation of the cross attention mechanism from the paper: "JOINTLY TRAINING LARGE AUTOREGRESSIVE MULTIMODAL MODELS"☆37Mar 11, 2024Updated 2 years ago
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆100Oct 13, 2023Updated 2 years ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆70Aug 18, 2023Updated 2 years ago