GreenBitAI / gbx-lmLinks
Run GreenBitAI's Quantized LLMs on Apple Devices with MLX
☆25Updated 3 weeks ago
Alternatives and similar repositories for gbx-lm
Users that are interested in gbx-lm are comparing it to the libraries listed below
Sorting:
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- ☆53Updated last year
- Github repo for Peifeng's internship project☆13Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- A collection of reproducible inference engine benchmarks☆31Updated last month
- ☆54Updated last year
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆124Updated last week
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆44Updated 2 months ago
- Self-host LLMs with LMDeploy and BentoML☆19Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 9 months ago
- ☆48Updated 3 weeks ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 9 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆23Updated 2 months ago
- RWKV-7: Surpassing GPT☆88Updated 6 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆69Updated 2 weeks ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆85Updated last month
- Very minimal (and stateless) agent framework☆44Updated 4 months ago
- Official Repository for Task-Circuit Quantization☆20Updated this week
- 👷♂️Minion is Agent's Brain. Minion is designed to execute any type of queries, offering a variety of features that demonstrate its flex…☆19Updated this week
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆48Updated 10 months ago
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 4 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last week
- ☆34Updated last month
- LLM reads a paper and produce a working prototype☆57Updated last month
- LLM based agents with proactive interactions, long-term memory, external tool integration, and local deployment capabilities.☆100Updated this week
- ☆26Updated last year
- ☆41Updated 5 months ago
- From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging☆71Updated last week