iamhankai / Forest-of-Thought
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning
☆14Updated last week
Alternatives and similar repositories for Forest-of-Thought:
Users that are interested in Forest-of-Thought are comparing it to the libraries listed below
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆40Updated 6 months ago
- Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models☆128Updated 7 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆44Updated 4 months ago
- [ICML'24 Oral] The official code of "DiJiang: Efficient Large Language Models through Compact Kernelization", a novel DCT-based linear at…☆99Updated 7 months ago
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆119Updated this week
- Converting Mixtral-8x7B to Mixtral-[1~7]x7B☆20Updated 10 months ago
- mllm-npu: training multimodal large language models on Ascend NPUs☆90Updated 4 months ago
- ☆92Updated 9 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆38Updated 10 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆70Updated 2 months ago
- [NeurIPS 2024] Needle In A Multimodal Haystack (MM-NIAH): A comprehensive benchmark designed to systematically evaluate the capability of…☆109Updated last month
- FuseAI Project☆75Updated last month
- ☆54Updated 4 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆55Updated 9 months ago
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆61Updated last year
- An object detection codebase based on MegEngine.☆28Updated 2 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆80Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆30Updated 6 months ago
- ☆32Updated 7 months ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆12Updated 9 months ago
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆42Updated 7 months ago
- ☆31Updated 7 months ago
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆46Updated 6 months ago
- Code for paper "Patch-Level Training for Large Language Models"☆75Updated 2 months ago
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆23Updated last year
- Odysseus: Playground of LLM Sequence Parallelism☆64Updated 7 months ago
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆56Updated 2 months ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆35Updated 10 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆71Updated last year