lamm-mit / Cephalo-Phi-3-Vision-MoE
☆11Updated 9 months ago
Alternatives and similar repositories for Cephalo-Phi-3-Vision-MoE:
Users that are interested in Cephalo-Phi-3-Vision-MoE are comparing it to the libraries listed below
- Train, tune, and infer Bamba model☆86Updated 2 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 10 months ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆79Updated this week
- A repository for research on medium sized language models.☆77Updated 9 months ago
- A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM☆53Updated 11 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- Unofficial implementation of https://arxiv.org/pdf/2407.14679☆44Updated 6 months ago
- [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding☆109Updated 3 months ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆37Updated 3 weeks ago
- ☆48Updated 4 months ago
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 2 weeks ago
- PB-LLM: Partially Binarized Large Language Models☆151Updated last year
- QuIP quantization☆51Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- Collection of autoregressive model implementation☆83Updated last month
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆61Updated 7 months ago
- vLLM adapter for a TGIS-compatible gRPC server.☆23Updated last week
- Code for NeurIPS LLM Efficiency Challenge☆57Updated 11 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆52Updated 6 months ago
- A pipeline for LLM knowledge distillation☆96Updated last month
- ☆53Updated 9 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆45Updated this week
- ☆49Updated 4 months ago