I am an applied scientist at Salesforce Research working on conversational AI, with a focus on evaluation of LLMs and LLM-based agents.


Some highlights from my previous work:

Are You Sure? Challenging LLMs Leads to Performance Drops in The FlipFlop Experiment

Salespeople vs SalesBot: Exploring the Role of Educational Value in Conversational Recommender Systems

Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions

MixQG: Neural Question Generation with Mixed Answer Types