git-disl / PokeLLMon
☆168Updated 2 months ago
Related projects: ⓘ
- AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval (https://arxiv.org/abs/2406.11200)☆140Updated last month
- The official implementation of Self-Play Preference Optimization (SPPO)☆461Updated last month
- WorldGPT: Empowering LLM as Multimodal World Model☆116Updated last month
- This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tas…☆60Updated 3 weeks ago
- [ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?☆133Updated 2 weeks ago
- Benchmarking LLMs via Uncertainty Quantification☆206Updated 7 months ago
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆350Updated last week
- A recipe for online RLHF.☆376Updated 3 weeks ago
- AAGPT is another experimental open-source application showcasing the capabilities of large language models, such as GPT-3.5 and GPT-4.☆154Updated last year
- We leverage 14 datasets as OOD test data and conduct evaluations on 8 NLU tasks over 21 popularly used models. Our findings confirm that …☆115Updated last year
- STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (https://stark.stanford.edu/)☆282Updated last month
- [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding☆203Updated 2 weeks ago
- Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models☆295Updated this week
- Pytorch Library for Relational Table Learning with LLMs.☆270Updated last week
- Grimoire is All You Need for Enhancing Large Language Models☆115Updated 6 months ago
- Recipes to train reward model for RLHF.☆634Updated last week
- A deployment, monitoring and autoscaling service towards serverless LLM serving.☆152Updated last week
- A Comprehensive Benchmark for Code Information Retrieval.☆61Updated last week
- The nanoGPT-style implementation of RWKV Language Model - an RNN with GPT-level LLM performance.☆193Updated 10 months ago
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆46Updated last month
- The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling☆480Updated last month
- ☆347Updated 3 months ago
- Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"☆192Updated last month
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆81Updated 5 months ago
- ☆327Updated 4 months ago
- [ACL 2024] CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and …☆107Updated last month
- Empower Your Model with Longer and Better Context Comprehention☆50Updated last year
- [NeurIPS22] "Advancing Model Pruning via Bi-level Optimization" by Yihua Zhang*, Yuguang Yao*, Parikshit Ram, Pu Zhao, Tianlong Chen, Min…☆141Updated last year
- An interpretable large language model (LLM) for medical diagnosis.☆68Updated last week
- (AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions☆260Updated 5 months ago