Nikityyy / lilleLinks
A powerful 130-million-parameter model trained from scratch as part of a truly open-source stack, including a custom tokenizer, dataset, and optimizer.
☆69Updated 5 months ago
Alternatives and similar repositories for lille
Users that are interested in lille are comparing it to the libraries listed below
Sorting:
- ☆112Updated 7 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Updated last year
- ~950 line, minimal, extensible LLM inference engine built from scratch.☆406Updated last month
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆940Updated 8 months ago
- A little(lil) Language Model (LM). A tiny reproduction of LLaMA 3's model architecture.☆55Updated 9 months ago
- ☆124Updated 7 months ago
- ☆182Updated 2 months ago
- look how they massacred my boy☆63Updated last year
- Exploration into the proposed architecture from Sapient Intelligence of Singapore 🇸🇬☆73Updated 5 months ago
- GRadient-INformed MoE☆264Updated last year
- AI management tool☆119Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- Live-bending a foundation model’s output at neural network level.☆273Updated 10 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 4 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 11 months ago
- Implementation snake game based on Diffusion model☆93Updated last year
- Multi-Agent Step Race Benchmark: Assessing LLM Collaboration and Deception Under Pressure. A multi-player “step-race” that challenges LLM…☆85Updated 2 months ago
- A character-level language diffusion model trained on Tiny Shakespeare☆849Updated 3 weeks ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆347Updated last year
- Sparse Inferencing for transformer based LLMs☆217Updated 6 months ago
- ☆159Updated 9 months ago
- ☆137Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated last month
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- Docs for GGUF quantization (unofficial)☆366Updated 6 months ago
- ☆304Updated 3 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- webgpu autograd library☆33Updated 8 months ago
- Clue inspired puzzles for testing LLM deduction abilities☆45Updated 10 months ago
- ☆107Updated 3 months ago