SalesforceAIResearch / MobileAIBench
β19Updated last week
Related projects β
Alternatives and complementary repositories for MobileAIBench
- Learning adapter weights from task descriptionsβ15Updated last year
- [NeurIPS-2024] π Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623β71Updated last month
- β39Updated last month
- β46Updated 2 weeks ago
- The official implementation of Self-Exploring Language Models (SELM)β55Updated 5 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMsβ48Updated 7 months ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformersβ75Updated last month
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationalesβ54Updated last week
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)β45Updated 7 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examplesβ39Updated last month
- Code implementation of synthetic continued pretrainingβ60Updated last month
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswalβ¦β44Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)β97Updated 7 months ago
- β49Updated 6 months ago
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)β26Updated 2 weeks ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]β104Updated last month
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,β¦β43Updated 4 months ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptorsβ69Updated 9 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"β31Updated 6 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$β31Updated last month
- PASTA: Post-hoc Attention Steering for LLMsβ108Updated 2 months ago
- β34Updated 3 months ago
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"β19Updated last month
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenariosβ46Updated 7 months ago
- [SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Ratesβ60Updated 3 weeks ago
- Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"β64Updated last week
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuningβ84Updated 6 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"β91Updated 4 months ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Leβ¦β68Updated 8 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"β55Updated 4 months ago