2025

PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI

S. Mukhopadhyay, S. Reddy, S. Muthukumar, J. An, P. Kumaraguru

Preprint

2025

InterChart: Benchmarking Visual Reasoning Across Decomposed Charts

A. Iyengar*, S. Mukhopadhyay*, A. Qidwai*, S. Singh, D. Roth, V. Gupta

AACL 2025

2025

MapIQ: Evaluating Multimodal Large Language Models for Map QA

V. Srivastava, F. Lei, S. Mukhopadhyay, V. Gupta, R. Maciejewski

COLM 2025

2025

PRAISE: Enhancing Product Descriptions with LLM-Driven Structured Insights

A. Qidwai*, S. Mukhopadhyay*, P. Khatiwada*, D. Roth, V. Gupta

ACL 2025 (System Demonstrations)

2024

Unraveling the Truth: Do VLMs really Understand Charts?

S. Mukhopadhyay*, A. Qidwai*, A. Garimella, P. Ramu, V. Gupta, D. Roth

EMNLP 2024

* denotes equal contribution