SciMT / SciMT-benchmark
☆11Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for SciMT-benchmark
- SciKnowEval: Evaluating Multi-level Scientific Knowledge of Large Language Models☆14Updated 3 weeks ago
- Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation, ICML 2024☆21Updated 4 months ago
- Structured Chemistry Reasoning with Large Language Models☆31Updated 6 months ago
- PRESTO: Progressive Pretraining Enhances Synthetic Chemistry Outcomes☆19Updated this week
- Pre-trained Language Model for Scientific Text☆42Updated 9 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆36Updated last year
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆36Updated last week
- ☆26Updated last year
- [AAAI 2024] MELO: Enhancing Model Editing with Neuron-indexed Dynamic LoRA☆21Updated 7 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆69Updated last month
- Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆22Updated last month
- ☆18Updated last week
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆30Updated 6 months ago
- Code for https://arxiv.org/abs/2401.17139 (NeurIPS 2024)☆25Updated last week
- ☆23Updated 6 months ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆16Updated last year
- Listing some diffusion papers in NLP domain I have read, text generation is main, table will continue to be updated.☆31Updated last week
- ☆15Updated 3 months ago
- ☆37Updated 4 months ago
- Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆13Updated last month
- ☆39Updated last month
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆12Updated 2 months ago
- ☆11Updated last year
- ☆30Updated this week
- ☆16Updated 4 months ago
- Official implementation of paper "General Preference Modeling with Preference Representations for Aligning Language Models" (https://arxi…☆18Updated 3 weeks ago
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆29Updated last month
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆36Updated 5 months ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆12Updated 5 months ago
- Code for paper "Diffusion Language Models Can Perform Many Tasks with Scaling and Instruction-Finetuning"☆63Updated 9 months ago