Zero-shot Multi-lingual Interrogative Qestion Generation for “People Also Ask” at Bing Rajarshee Mitra, Rhea Jain, Aditya Srikanth Veerubhotla, Manish Gupta {ramitra,rhea.jain,aditya.veerubhotla,gmanish}@microsoft.com Microsof India ABSTRACT Multi-lingual question generation (QG) is the task of generating natural language questions for single answer passage in any given language. In this paper, we design a system for supporting multi- lingual QG in the “People Also Ask” (PAA) module for Bing. For zero shot seting, the primary challenge is to transfer the knowl- edge from trained QG model in the pivot language to other lan- guages without further addition of training data in these languages. Compared to other zero-shot tasks, the diferentiating and chal- lenging aspect in QG is to preserve the question structure so that the resulting output is interrogative. Existing models for similar tasks tend to generate natural language queries or copy sub-span of the passage, failing to preserve the question structure. In our work, we demonstrate how knowledge transfer in multi-lingual IQG (Interrogative QG) can be signifcantly improved using aux- iliary tasks either in multi-task or pre-training task seting. We explore two kinds of tasks – cross-lingual translation and multi- lingual denoising auto-encoding of questions, especially when us- ing translate-train. Using data for 13 languages from Bing PAA as well as online A/B tests, we show that both of these tasks signif- icantly improve the quality of zero-shot IQG on non-trained lan- guages. CCS CONCEPTS Computing methodologies Neural networks; Natural lan- guage generation;• Information systems Web search en- gines; Content ranking. KEYWORDS interrogative question generation, deep learning, natural language processing, zero shot modeling, multi-lingual denoising auto-encoding, People Also Ask ACM Reference Format: Rajarshee Mitra, Rhea Jain, Aditya Srikanth Veerubhotla, Manish Gupta. 2021. Zero-shot Multi-lingual Interrogative Qestion Generation for “Peo- ple Also Ask” at Bing. In Proceedings of the 27th ACM SIGKDD Confer- ence on Knowledge Discovery and Data Mining (KDD ’21), August 14–18, Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for proft or commercial advantage and that copies bear this notice and the full cita- tion on the frst page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permited. To copy otherwise, or re- publish, to post on servers or to redistribute to lists, requires prior specifc permission and/or a fee. Request permissions from permissions@acm.org. KDD ’21, August 14–18, 2021, Virtual Event, Singapore © 2021 Association for Computing Machinery. ACM ISBN 978-1-4503-8332-5/21/08…$15.00 https://doi.org/10.1145/3447548.3469403 Q: deep learning framework Q: francia cultura y tradiciones (A) English (B) Spanish Figure 1: Examples of the “People Also Ask” module from Bing (retrieved 8-Feb-2021) 2021, Virtual Event, Singapore. ACM, New York, NY, USA, 9 pages. https: //doi.org/10.1145/3447548.3469403 1 INTRODUCTION Qestion generation (QG) involves producing natural language ques- tion from a passage, ofen accompanied by the actual answer to the question. QG is very useful in information retrieval spanning from question answering, passage ranking, machine reading com- prehension to conversational models. It can help in creating more question-passage mappings that can lead to beter ranking. For learning management systems, it could be useful for automatically designing quiz questions. In question answering, it could be useful in creating an ofine corpus of QA pairs beforehand for fast lookup. More recently, it has been shown that adding generated questions to passage helps in improved passage ranking [26]. Other uses in- clude creating FAQs automatically from unstructured content like product descriptions or processes. Te most widely known form of QG is answer-aware QG where the generated question relates to a specifc answer from within a big passage [25, 35, 36, 40]. Another form of QG involves gener- ating free form QG from passages [6, 7, 30, 34, 38] without any extra answer input. Commercial search engines ofen produce de- scriptive passage answer (or featured snippets) to user queries (e.g., “why did usa bomb japan”, “what kind of houses eskimos live in”). Here, QG plays a pivotal role in creating QA corpus that can be used for retrieval. In most setings like designing quiz questions, creating questions for QA systems, generating questions for Peo- ple Also Ask module 1 , we need well-formed questions and not key- word queries. We show examples of PAA module in English as well as Spanish in Fig. 1. “People Also Ask” module allows users to discover related questions pertaining to the current query. Such a module can be useful in (1) reducing gap between user’s infor- mation need and the query, especially for non-expert users, (2) 1 htps://www.searchenginewatch.com/2020/10/14/people-also-ask-paa-feature- uncovering-googles-hidden-gem/ ADS Track Paper KDD ’21, August 14–18, 2021, Virtual Event, Singapore 3414