microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table
583Updated this week

Related projects

Alternatives and complementary repositories for T-MAC