IIT-DM / Fin-Fact
A Benchmark Dataset for Multimodal Scientific Fact Checking
☆14Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Fin-Fact
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated last year
- ☆36Updated 5 months ago
- ☆13Updated 8 months ago
- Code for Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks☆47Updated 7 months ago
- Official implementation of our paper "Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration".☆12Updated this week
- LLM guided text clustering☆72Updated last year
- Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.☆18Updated 10 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆40Updated 9 months ago
- [Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory☆54Updated last year
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆54Updated last week
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆48Updated 5 months ago
- Implementation code for the paper "Meta-learning via Language Model In-context Tuning" (ACL 2022)☆21Updated 2 years ago
- ☆27Updated 3 months ago
- ☆22Updated 2 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆28Updated 8 months ago
- When FLUE Meets FLANG: Benchmarks and Large Pretrained Language Model for Financial Domain☆45Updated 4 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆30Updated 3 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆34Updated 9 months ago
- Codebase for the paper "Schema-guided User Satisfaction Modeling for Task-oriented Dialogues"☆11Updated last month
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆57Updated last month
- ☆11Updated 11 months ago
- The benchmark proposed in paper: GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability☆16Updated 8 months ago
- [ICML 2023] Tuning Language Models as Training Data Generators for Augmentation-Enhanced Few-Shot Learning☆39Updated last year
- ☆42Updated 4 months ago
- Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"☆31Updated last month
- ☆22Updated 2 weeks ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆35Updated last month
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆15Updated 2 weeks ago
- ☆8Updated last year