RichardKelley / hflmLinks
A simple library for working with Hugging Face models.
☆14Updated 5 months ago
Alternatives and similar repositories for hflm
Users that are interested in hflm are comparing it to the libraries listed below
Sorting:
- entropix style sampling + GUI☆26Updated 7 months ago
- ☆21Updated 3 months ago
- ☆28Updated 9 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated last year
- Training hybrid models for dummies.☆23Updated 5 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago
- Simple LLM inference server☆20Updated last year
- Modified Beam Search with periodical restart☆12Updated 9 months ago
- ☆38Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- An AI character interaction system with emotional modeling and advanced memory management☆16Updated 8 months ago
- ☆15Updated last year
- a lightweight, open-source blueprint for building powerful and scalable LLM chat applications☆28Updated last year
- Simple GRPO scripts and configurations.☆58Updated 4 months ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- One Line To Build Zero-Data Classifiers in Minutes☆56Updated 9 months ago
- ☆10Updated 2 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆31Updated 3 months ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 4 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- ☆53Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 7 months ago
- ☆31Updated last year
- Modeling code for a BitNet b1.58 Llama-style model.☆25Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year