at-aaims / forge
☆12Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for forge
- ☆12Updated last year
- Example of applying CUDA graphs to LLaMA-v2☆10Updated last year
- Packages and instructions for training and inference of LLMs on NVIDIA's new GH200 machines☆19Updated 2 months ago
- Inference code for LLaMA models☆38Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆37Updated 5 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated 11 months ago
- ☆26Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆104Updated last month
- ☆57Updated 11 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- My Gen AI research☆11Updated 5 months ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆16Updated last year
- ☆41Updated 2 weeks ago
- FastFeedForward Networks☆18Updated 11 months ago
- Certified Reasoning with Language Models☆27Updated 11 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆60Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆47Updated 9 months ago
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- NLP with Rust for Python 🦀🐍☆59Updated 5 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Updated 7 months ago
- look how they massacred my boy☆58Updated last month
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆37Updated last year
- ☆48Updated last year
- Data preparation code for Amber 7B LLM☆83Updated 6 months ago
- Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"☆14Updated last week
- ☆35Updated 3 weeks ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆92Updated 5 months ago
- Make triton easier☆41Updated 5 months ago
- I learn about and explain quantization☆25Updated 7 months ago