Rajkumar Pujari




LLM-Human Schema Pipeline for Cultural Context Grounding of Conversations

Rajkumar Pujari, Dan Goldwasser
Under Submission

Abstract: Conversations often adhere to generally understood social norms that may vary across cultures. For example, while addressing parents by name is commonplace in the West, it is rare in most Asian cultures. Adherence or violation of such norms often dictates the tenor of conversations. Modeling relevant cultural information could help in building more effective systems for conversational understanding. However, such information is rarely available explicitly. Rather, it could be learned by observing repeating patterns of human behavior.

In this paper, we tackle this problem by introducing a Cultural Context Schema for conversations. It comprises (1) conversational information such as emotions, dialogue acts, etc., and (2) cultural information such as social norms, violations, etc. We generate ~110k social norm and violation descriptions for ~23k conversations from Chinese culture using GPT-3.5. We refine them using automated verification strategies which are evaluated against cultural expert judgements. We organize these descriptions into meaningful clusters which we call 'norm concepts' using an interactive human-in-loop concept discovery framework. We ground the norm concepts and the descriptions in conversations using symbolic explanations. Finally, we use the obtained dataset for downstream tasks such as emotion detection, and dialogue act identification. We show it significantly improves the empirical performance.