☆49Aug 6, 2024Updated last year
Alternatives and similar repositories for LM-Science-Tutor
Users that are interested in LM-Science-Tutor are comparing it to the libraries listed below
Sorting:
- Javascripts☆11Feb 10, 2026Updated 3 weeks ago
- ☆11Jan 3, 2024Updated 2 years ago
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated 2 years ago
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆75Sep 17, 2025Updated 5 months ago
- Utilities for processing and visualization of the SynScapes dataset☆13Mar 13, 2020Updated 5 years ago
- SuperCLUE高考作文机器自动阅卷系统☆17Jun 8, 2023Updated 2 years ago
- Official code for "Attribution Guided Factorization for Neural Networks Visualization"☆16Oct 23, 2022Updated 3 years ago
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- A Survey of Neural Dialogue Systems☆19Dec 31, 2021Updated 4 years ago
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- [ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent☆35Nov 29, 2024Updated last year
- ☆28Nov 10, 2025Updated 3 months ago
- ☆24Jul 6, 2021Updated 4 years ago
- EMNLP2023 - InfoSeek: A New VQA Benchmark focus on Visual Info-Seeking Questions☆25May 30, 2024Updated last year
- ☆25Mar 26, 2024Updated last year
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆111May 22, 2025Updated 9 months ago
- [ACL 2024]Official GitHub repo for OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scie…☆184Jun 8, 2025Updated 8 months ago
- RePo: Language Models with Context Re-Positioning☆71Dec 24, 2025Updated 2 months ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Dec 11, 2020Updated 5 years ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Jul 9, 2024Updated last year
- This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'☆139May 30, 2025Updated 9 months ago
- 中文大语言模型评测第三期☆35Dec 30, 2025Updated 2 months ago
- a size profiler for cuda binary☆72Jan 15, 2026Updated last month
- [ICLR 2025] DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference☆49Jun 17, 2025Updated 8 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- Collection of iPython notebooks with some quick demos☆11May 25, 2017Updated 8 years ago
- ☆12Jan 11, 2026Updated last month
- Photorealism model use RealVisXL v4.0☆12Feb 20, 2024Updated 2 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- A Swedish Natural Language Understanding Benchmark☆11Dec 12, 2025Updated 2 months ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- ☆12Aug 15, 2023Updated 2 years ago
- [CVPR2024] Learning from Synthetic Human Group Activities☆14Feb 24, 2025Updated last year
- The implement of ACL2024: "MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization"☆43Jun 15, 2024Updated last year
- Full disclosure for http://stackoverflow.com/questions/17465061/how-to-parse-space-separated-floats-in-c-quickly/17479702☆11Nov 6, 2016Updated 9 years ago
- Code and performance tests to demonstrate the COUNTLESS algorithm. https://medium.com/@willsilversmith/countless-high-performance-2x-down…☆10Oct 23, 2019Updated 6 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- Open Set Semantic Segmentation☆10Dec 23, 2020Updated 5 years ago