kyegomez / GeminiLinks
The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
β460Updated this week
Alternatives and similar repositories for Gemini
Users that are interested in Gemini are comparing it to the libraries listed below
Sorting:
- Mamba-Chat: A chat LLM based on the state-space model architecture πβ938Updated last year
- Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"β714Updated 2 years ago
- β228Updated 2 years ago
- A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplestβ¦β461Updated 2 weeks ago
- β717Updated last year
- β1,027Updated 11 months ago
- [ICLR-2025-SLLM Spotlight π₯]MobiLlama : Small Language Model tailored for edge devicesβ668Updated 8 months ago
- Code for fine-tuning Platypus fam LLMs using LoRAβ630Updated last year
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modelingβ943Updated 2 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AIβ773Updated 2 years ago
- Fine-tuning LLMs using QLoRAβ269Updated last year
- LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skillsβ763Updated 2 years ago
- ποΈ + π¬ + π§ = π€ Curated list of top foundation and multimodal models! [Paper + Code + Examples + Tutorials]β637Updated last year
- β445Updated last year
- Reaching LLaMA2 Performance with 0.1M Dollarsβ987Updated last year
- An open-source implementation of Google's PaLM modelsβ820Updated last year
- PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attentionβ¦β294Updated last year
- Beyond Language Models: Byte Models are Digital World Simulatorsβ333Updated last year
- Implementation of I-JEPA from "Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture"β282Updated last year
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.β189Updated last year
- Build high-performance AI models with modular building blocksβ577Updated last week
- Inference code for Mistral and Mixtral hacked up into original Llama implementationβ371Updated 2 years ago
- Effort to open-source NLLB checkpoints.β476Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchβ¦β599Updated 2 years ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.β685Updated last year
- Train Models Contrastively in Pytorchβ774Updated 10 months ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fastβ150Updated last year
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".β821Updated last year
- β416Updated 2 years ago
- Finetuning Large Language Models on One Consumer GPU in 2 Bitsβ734Updated last year