An Open-Source Chatbot by Using ParlAI Kiran Mai Narnavaram Kennesaw State University Marietta, Georgia, USA knarnava@students.kennesaw.edu Dan Lo Kennesaw State University Marietta, Georgia, USA dlo2@kennesaw.edu ABSTRACT While existing work shows a great success, the accuracy, the re- sponse time, the interoperability, the extendibility, etc. remain to be improved. We proposed “ParlAIDialogTeacher” using the open- source ParlAI to improve the accuracy, the response time, the inter- operability, and the extendibility. Training is based on Maximum Likelihood Estimation (MLE) approach for the generative model. This platform eases sharing, training, and evaluating dialog models with multiple datasets available. Our results indicate the proposed model outperforms x, y, z in terms of response time and accuracy. Future work include extending the model with other genitive mod- els such as GAN. CCS CONCEPTS • Computing methodologies → Neural networks; Learning paradigms and algorithms; Semantic networks; Dialog systems. KEYWORDS deep learning, chatbot, ParlAI - Blended Skill Talk ACM Reference Format: Kiran Mai Narnavaram and Dan Lo. 2024. An Open-Source Chatbot by Using ParlAI. In 2024 ACM Southeast Conference (ACMSE 2024), April 18– 20, 2024, Marietta, GA, USA. ACM, New York, NY, USA, 2 pages. https:// doi.org/10.1145/3603287.3656160 1 INTRODUCTION AND RELATED WORK Chatbots, advancing digital communication, face challenges in achieving human-like conversational abilities [1], with specializa- tion enhancing eﬃciency but limiting versatility. Recent break- throughs in neural approaches [2], particularly large pre-trained Transformer models, have shown promise in open-source conver- sational AI, as evidenced by successes in the ConvAI2 competition and subsequent improvements in dialogue system evaluations [10]. ParlAI and BlenderBot 2.0 exemplify the collaborative spirit and potential for chatbots to engage in more meaningful dialogues. Figure 1 shows that ParlAI and BlenderBot 2.0 exemplify the col- laborative spirit and potential for chatbots to engage in meaningful dialogues. The development of Meena, a model with 2.6 billion pa- rameters, marks a signiﬁcant milestone in surpassing benchmarks and pushing the boundaries of chatbot capabilities [5]. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for proﬁt or commercial advantage and that copies bear this notice and the full citation on the ﬁrst page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). ACMSE 2024, April 18–20, 2024, Marietta, GA, USA © 2024 Copyright held by the owner/author(s). ACM ISBN 979-8-4007-0237-2/24/04. https://doi.org/10.1145/3603287.3656160 Figure 1: Chatting Example 2 METHODOLOGY Our methodology revolutionizes chatbot development with a dy- namic Seq2Seq transformer architecture in ParlAI for response generation, diverges from static databases. Employs Hugging Face Tokenizers for Byte-Level BPE, tests models with 90M to 9.4B pa- rameters including a setup with four-layer encoder and 32-layer decoder [5]. Uses Maximum Likelihood Estimation (MLE) with an 8-layer RNN and GeLu-activated network [7], incorporates min- imum length and predictive training for authentic dialogue, and enhances computational eﬃciency via remote server ﬁne-tuning with ParlAI. 3 EXPERIMENTAL SETUP AND METRICS Figure 2 illustrates our approach utilizing the Fairseq toolkit for pre-training [6], with models of 2.7B and 9.4B parameters optimized by the Adam optimizer. We incorporated Megatron-LM’s model parallelism [9] for eﬃcient node distribution, leveraging vertical transformer layer slicing for reduced cross-GPU communication. A sublinear memory footprint, the Adafactor [8] is used to facilitate larger batch size precision training. 4 PERPLEXITY Automatic metric correlating with human judgment accelerates discourse model creation; perplexity, an uncertainty measure in Seq2Seq models. Our research demonstrates a positive correlation 2024 ACM Southeast Conference – ACMSE 2024 – Session 3: Posters – ISBN: 979-8-4007-0237-2 Marietta, Georgia, USA, April 18-20, 2024 323