ByteDance-Seed / EvaLearnLinks
EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in challenging tasks.
☆31Updated this week
Alternatives and similar repositories for EvaLearn
Users that are interested in EvaLearn are comparing it to the libraries listed below
Sorting:
- ☆65Updated 8 months ago
- A Knowledge Base on Pre-made Dishes☆106Updated last week
- Please visit our demonstration website for interactive demonstrations☆29Updated 8 months ago
- ☆84Updated this week
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆39Updated 4 months ago
- ☆50Updated last month
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆58Updated 9 months ago
- 通过RPN with FPN以及CRNN进行车牌检测和识别☆26Updated 5 months ago
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆22Updated 6 months ago
- a demo but fun snake game created in https://aide.ink☆66Updated 5 months ago
- [ICME 2024] Official Datasets and example of LLM-SAP: Large Language Model Situational Awareness Based Planning☆33Updated 3 months ago
- ☆43Updated last year
- Unleashing the Power of Distributed Content Management and Transformation☆75Updated 8 months ago
- ☆53Updated last month
- Training and evaluation code of EGTLM model.☆23Updated last year
- Participate in the open source security award program implemented by the China Cyberspace Security Association.☆90Updated 5 months ago
- ☆40Updated 2 months ago
- File Explorer☆86Updated this week
- HACAN: Hybrid Attention-Driven Cross-Layer Alignment Network for Image-Text Retrieval☆80Updated last month
- ☆106Updated 4 months ago
- 最终幻想14英文笔记☆97Updated last year
- ☆44Updated 2 months ago
- Here is a numerical calculator to simulate the weapoens, under the Markov chain, to achieve its highest critical value.☆65Updated 4 months ago
- A high-performance Swift wrapper for MaxMind's GeoIP2 databases, offering thread-safe IP geolocation lookups with optimized memory manage…☆102Updated last month
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆28Updated last week
- ☆68Updated last year
- ☆89Updated 5 months ago
- ☆51Updated 2 months ago
- A system demo based on Retrival Argument Generation to answer buddism question☆84Updated 8 months ago
- Assignment, homework and everything in Northeastern University Miami☆33Updated this week