International Journal of Scientific Engineering and Research (IJSER) www.ijser.in ISSN (Online): 2347-3878, Impact Factor (2014): 3.05 Volume 3 Issue 8, August 2015 Licensed Under Creative Commons Attribution CC BY A Survey On Analyzing Students’Academic Experiences Using Social Media Data Priya Lande 1 , Vipul Dalal 2 1 Vidyalankar Institute of Technology, University of Mumbai, Department of Computer Engineering, Mumbai, India 2 Vidyalankar Institute of Technology, University of Mumbai, Department of Computer Engineering, Mumbai, India Abstract: Students’ social media behavior reveals about their day-to-day life. Students post their experiences on social networking sites be it personal or academic. Analysis of these posts, however, is not an easy task. Pure manual analysis is not fruitful as data increases at a rapid rate. There is a workflow developed by the survey topic that assimilates the qualitative analysis and data mining techniques. The focus is primarily on engineering students’ posts so their problems can be analyzed. This uses the approach of multi-label classification, which is due to building of categories among students’ problems-heavy study load, sleep deprivation, lack of social engagement, etc. This enables social media to shed light on students’ academic experiences. Keywords: Education, web text analysis, social-networking, students 1. Introduction The online social life of people defines the complete picture of their life experiences. There is a lot that can be analyzed and explored beneath this social media data of the people. Students’ learning experiences can be studied from their online posts. But, the problem arises when these posts are counted on and on and it becomes a tedious job to study them. Hence, pure manual analysis cannot alone result out into productivity. It needs use of data mining techniques for the analysis. Students’ online conversations reveal aspects of their experiences that are not easily observed in formal classroom settings. Traversal of engineering students’ informal conversations to trace their root problems hence is essential. 2. Historic Development The study mainly targets the educational experiences of engineering students as engineering is the building block of our technology and future. The application of leant science is directly proportional to academic growth of the engineers. Below we discuss the contributions of various researchers in this field given in the literature, chronologically. Goffman in [1] has a theory called as the Goffman’s theory of social performance, which relates to the students as they are are liberal and open to a platform on which they are comfortable unlike surveys conducted in closed classrooms, group discussions, forums conducted on the topic of debate. Their opinions are not transparent. Hence it can be concluded that many social media websites available for people to stay connected and share their experiences is a platform where students find themselves comfortable to share their experiences. From [2] it can profoundly be said that for analyzing students’ posts considering twitter is a good option. Twitter uses the concept of hash-tag. The symbol # is a hash-tag that consists of all the related keywords and related content of the topic specified in the hash-tag. Tweet classification is the prime reason for the choice of twitter for the study. In [3] a framework is proposed for supervised sentiment classification based on twitter data. It avails 15 smileys and 50 hash-tags as training labels for various sentiments. It cuts the requirement of manual intervention into labeling. It allows discerning and classifying multiple types of sentiments of short text. It also assesses multiple types of features like punctuation, patterns, words and n-gram for drawing out sentiments. There are two more methods that provide for automated recognition of sentiment types that overlap and deal within their interdependencies. In [4] the study proposed a method to bring out the sentiment into positive and negative, this also aids in cutting down the requirement of manual efforts for the same job. The primary idea is to use tweets along with emoticons for distant supervision learning. This method works better when used in combination with machine learning algorithm. They give high accuracy of the result for sentiment classified when using this method. The classification algorithm takes into account the number of classes .in the given document. Classification can result into: Single-label classification Multi-label classification Single label classification is when each data item can be labeled for a single class only. Multi-label classification is when each data item can be labeled for more than two classes. Multi-label classification has been attracting many studies therefore, in [4] Naïve-Bayes multi-label classifier is explored as it is one of the most popular multi-label classifier. Implementation of this classifier is done by breaking down the multi-label classification problem into multiple single-label classification problems. This can be aggregated into a method called as one versus all or binary relevance. It basically works over the assumption of independent categories and providing training to binary classifier for each label set. In [5] the hash-tag #enggproblems was most talked about hash- tag for engineering students. Tweets and posts were collected Paper ID: IJSER15447 115 of 116