Azure-Vision / SuperDebug
SuperDebug,debug如此简单!
☆17Updated 2 years ago
Alternatives and similar repositories for SuperDebug:
Users that are interested in SuperDebug are comparing it to the libraries listed below
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- ☆23Updated 10 months ago
- ☆33Updated last month
- [ACL 2024] The project of Symbol-LLM☆54Updated 9 months ago
- [ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain☆102Updated last year
- Open-Pandora: On-the-fly Control Video Generation☆33Updated 4 months ago
- Webpage for RLHFlow☆9Updated 2 months ago
- LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification☆44Updated last month
- Implementation of the methods described in our paper "Explicit Planning Helps Language Models in Logical Reasoning"☆22Updated 2 years ago
- ☆12Updated this week
- [ICML 2024] Self-Infilling Code Generation☆19Updated 11 months ago
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiw…☆29Updated 11 months ago
- The code and data for the paper JiuZhang3.0☆43Updated 10 months ago
- ☆33Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆23Updated last year
- [ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials☆28Updated last month
- Extending context length of visual language models☆11Updated 4 months ago
- MULTI-Benchmark: Multimodal Understanding Leaderboard with Text and Images☆35Updated 2 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 5 months ago
- ☆18Updated 3 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆23Updated 6 months ago
- ☆12Updated 7 months ago
- ☆17Updated last year
- Reproducing R1 for Code with Reliable Rewards☆167Updated last week
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆32Updated 6 months ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆28Updated 9 months ago
- ☆15Updated last year
- The rule-based evaluation subset and code implementation of Omni-MATH☆19Updated 3 months ago
- [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling☆82Updated 2 years ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆47Updated 9 months ago