sjunhongshen / Tag-LLM
☆23Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for Tag-LLM
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆60Updated 3 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers"☆40Updated last month
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆102Updated 6 months ago
- ☆20Updated this week
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆31Updated 3 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- ☆148Updated 9 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated 9 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆34Updated 2 weeks ago
- ☆123Updated 6 months ago
- ☆68Updated 2 months ago
- A repository for research on medium sized language models.☆74Updated 5 months ago
- MEXMA: Token-level objectives improve sentence representations☆32Updated this week
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆142Updated 3 weeks ago
- ☆61Updated 2 months ago
- ☆55Updated 3 months ago
- ☆102Updated 2 months ago
- Official repository for the paper "SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention"☆91Updated last month
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆51Updated 3 weeks ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆109Updated 2 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆45Updated last month
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆168Updated last month
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- This is the official repository for Inheritune.☆105Updated last month
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆62Updated 2 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆93Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆71Updated last month
- This repository is maintained to release dataset and models for multimodal puzzle reasoning.☆44Updated 3 months ago