More than words-Social networks’ text mining for consumer brand sentiments.pdf
《More than words-Social networks’ text mining for consumer brand sentiments.pdf》由会员分享,可在线阅读,更多相关《More than words-Social networks’ text mining for consumer brand sentiments.pdf(11页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、More than words: Social networks text mining for consumer brand sentiments Mohamed M. Mostafa Instituto Universitrio de Lisboa, Business Research Unit, Avenida das Foras Armadas, Lisbon, Portugal a r t i c l ei n f o Keywords: Consumer behavior Global brands Sentiment analysis Text mining Twitter a
2、b s t r a c t Blogs and social networks have recently become a valuable resource for mining sentiments in fi elds as diverse as customer relationship management, public opinion tracking and text fi ltering. In fact knowl- edge obtained from social networks such as Twitter and Facebook has been shown
3、 to be extremely valu- able to marketing research companies, public opinion organizations and other text mining entities. However, Web texts have been classifi ed as noisy as they represent considerable problems both at the lexical and the syntactic levels. In this research we used a random sample o
4、f 3516 tweets to evaluate con- sumers sentiment towards well-known brands such as Nokia, T-Mobile, IBM, KLM and DHL. We used an expert-predefi ned lexicon including around 6800 seed adjectives with known orientation to conduct the analysis. Our results indicate a generally positive consumer sentimen
5、t towards several famous brands. By using both a qualitative and quantitative methodology to analyze brands tweets, this study adds breadth and depth to the debate over attitudes towards cosmopolitan brands. ? 2013 Elsevier Ltd. All rights reserved. 1. Introduction Opinions expressed in social netwo
6、rks play a major role in infl u- encing public opinions behavior across areas as diverse as buying products, capturing the pulse of stock markets and voting for the president (Bai, 2011; Eirinaki, Pisal, Leong, Lee, and RQ2. Can companies effectively use the blogosphere to redesign their marketing a
7、nd advertising campaigns? This paper is organized as follows. Next section provides a brief literature review on the major areas of SA applications. Section 3 deals with the method used to conduct the analysis. In this section issues related to research design and, sampling and data analysis techniq
8、ues are presented. In Section 4, the results of sentiment analysis are presented. Finally, Section 5 presents research implica- tions and limitations. This section also explores avenues for future research. 2. Literature review SA techniques have been recently utilized in applications such as extrac
9、ting suggestions from consumers product reviews (e.g., Vishwanath Papacharissi (1) for buy; (0) for hold; (?1) for sell; and (?2) for strong sell. In this study the authors also used a weighting scheme to assign weights for each sentiment obtained based on the reputation and previous accuracy of the
10、 poster. Using a simulated environment to mimic real trading, the authors reported around four percent increase in returns over one month based on sentiment analysis. Bollen, Mao, and Zeng (2011) found that the aggregation of mil- lions of tweets posted daily on Twitter can be used to predict stock
11、market over time. The authors used measures such as daily Twitter posts over around ten months to predict the Dow Jones Industrial average closing values. To cross-validate the results, the authors also used the resulting time series of Twitter moods to detect the general publics response towards th
12、e outcome of the US presiden- tial campaign. Other studies investigated the relationship between investors sentiments and other factors such as stock returns fol- lowing a major earthquake (Shan & Gong, 2012), air disasters involving US vs. foreign airlines (Kaplanski & Levy, 2010) and local sports
13、events (Chang, Chen, Chou, & Lin, 2012). 3. Method 3.1. Twitter sampling Twitter is a microblogging service that was launched formally on July 13, 2006. Unlike other social media, Twitter is considered a microblog because its central activity revolves around posting short updates or tweets using the
14、 Web or mobile/cell phones. The maximum size of the blog is 140 characters-roughly the size of a newspaper headline. According to S (2012), a marketing research company, there are now around 500 million active twitters. Fig. 2 shows top ranked countries according to ac- tive tweets in 2012 (S, 2012)
15、. Tweets are available publicly as a default, and are also directly broadcasted to the users followers (Bliss, Klouman, Harris, Danforth, & Dodds, 2012). A recent analysis of Twitter activities found that more than 80% of the users either update their followers on what they actually doing or dissemi
16、nate information regarding their daily experiences (Thelwall, Buckley, & Paltoglou, 2011). Since Twitter is the most large, popular and well-known microblog Web site, it was selected to conduct the analysis reported in this study. The data used repre- sent a random set of Twitter posts from July 18,
17、 2012, to August 17, 2012. The data comprised 3516 tweets for sixteen brands. To guar- antee representativeness, sample selection has been varied by day of the week and hours in the day. Our sample size is comparable in size to Qiu, He, Zhang, Shi, Bu, and Chens (2010) sample, which in- cluded 3783
18、opinion sentences. Table 1 shows the random sample of tweets for each brand included in the study. Following Thelwall et al. (2011), only tweets in English was chosen in order to remove complications that might arise with analyzing multilingual tweets. Table 2 shows a sample of tweets for Air India
19、with a manual classifi cation of customers sentiments. As can be seen from the ta- ble, tweets represent a very noisy environment in which messages posted to virtual audience includes abbreviated words, the and the hashtag (#) characters, and heteroglossia-referring to other voices in the tweets in
20、order to convey interpersonal and ideational Fig. 3. Proximity plot based on Egypt Air tweets. M.M. Mostafa/Expert Systems with Applications 40 (2013) 424142514245 meanings (Bakhtin, 1981). Huang, Thornton, and Efthimiadis (2010) found that the hashtag was invented by Twitter users early in 2008 to
21、help followers fi nd a specifi c tweet or post. As opposed to the hashtag, the character has been introduced to address a tweet to another follower, which allows Twitter to function effec- tively as a collaboration and conversation system (Honeycutt & Herring, 2009). 3.2. Lexicon Categorizing words
22、for SA is a major step in applying the tech- nique. Broadly speaking, there are two widely used methods for sentiment orientation identifi cation: the lexicon-based approach and the corpus-based method (Miao, Li, & Zeng, 2010). However, since the corpus-based method has rarely been used to analyze s
23、entiment orientation, we will focus here on the lexicon-based method. Nevertheless, both methods require a pre-defi ned dictio- nary or corpus of subjective words. The sentiment is determined by comparing tweets against the expert-defi ned entry in the dic- tionary, which makes it easy to determine
24、the polarity of a specifi c sentence. Thus, it is crucial to have an accurate classifi er to be used to construct indicators of sentiment. Previous research has typi- cally incorporated lexicons such as the manually coded General In- quirer (Stone, Dunphy, Smith, & Ogilvie, 1966), which includes ove
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 数字图像处理
链接地址:https://www.taowenge.com/p-4062591.html
限制150内