Real-time sentiment analysis of Twitter public stream
Tekijät
Päivämäärä
2015Sentiment analysis on Twitter public stream has been a topic of research recently. Several non-commercial libraries and software were developed to perform sentiment analysis, however none of them performed the analytics in real-time for Twitter data. Performing the same task in real-time can gives us insight of Twitter users public opinions regarding recent happenings of the time that analysis was made. In this thesis work, we propose a full-stack architecture with a software prototype that performs real- time sentiment analysis on Twitter public stream. We address the problem using large- scale online learning and specifically online parallel decision trees. Large-scale learning is utilized due to the fact that social media website such as Twitter produce data with high volume (around 5800 tweets per second in 2014) and in addition, there is a high time constraint (up to seconds) in real-time analytics in both learning, processing and query response time. Moreover, Twitter stream data arrives instance-by-instance and therefore we have utilized online learning with incremental and per-instance learning flexibility. SAMOA is a framework that provides support for a set of scalable online learning algorithms such as Vertical Hoeffding Tree. We use SAMOA’s VHT learner with Apache Storm as our Stream Processing Engine. However, utilizing only VHT and Apache Storm cannot solve the problem at hand. Therefore, we also developed an open- source Java library called Sentinel that enables real-time Twitter stream reading, in- memory pre-processing computations and data structures, feature selection, frequent miner algorithms and etc. that completes our architecture. In Chapter 3, we show the architecture of our solution and its applicability and usefulness is shown in chapter 4.
...
Asiasanat
Metadata
Näytä kaikki kuvailutiedotKokoelmat
- Pro gradu -tutkielmat [28143]
Samankaltainen aineisto
Näytetään aineistoja, joilla on samankaltainen nimeke tai asiasanat.
-
Samsung and Volkswagen Crisis Communication in Facebook and Twitter : A Comparative Study
Zhang, Boyang; Veijalainen, Jari; Kotkov, Denis (SCITEPRESS Science And Technology Publications, 2017)Since September 2015 at least two major crises have emerged where major industrial companies producing consumer products have been involved. In September 2015 diesel cars manufactured by Volkswagen turned out to be ... -
Spreading ideologies through tweets : examining extreme and moderate Muslims usage of Twitter
Salameh, Ahmad (2018)Twitter enables groups with certain agendas to organize and distribute their ideologies. This research compares the different practices performed by the extreme and the moderate Muslims to build their networks and recruit ... -
“Congratulations, you’re on TV!” : middle-space performances of live tweeters during the FIFA World Cup
Salomaa, Elina; Lehtinen, Esa (Elsevier, 2018)Social television has transformed the traditional role of the television viewers by providing ‘ordinary people’ access to the public stage. This article describes how public access to television affected the dialogues of ... -
Congresswoman Alexandria Ocasio-Cortez’s 2020 re-election campaign on Twitter : A discourse analytic study
Salonen, Roosa-Mari (2021)Pro Gradu –tutkielma tarkastelee yhdysvaltalaisen edustajanhuoneen jäsenen Alexandria Ocasio-Cortezin poliittista viestintää ja viestintästrategioita sosiaalisessa mediassa. Sosiaalisen median kasvattaessa suosiotaan myös ... -
Conversational Gatekeeping : Social Interactional Practices of Post-Publication Gatekeeping on Newspapers’ Facebook Pages
Salonen, Margareta; Olbertz-Siitonen, Margarethe; Uskali, Tero; Laaksonen, Salla-Maaria (Routledge, 2023)Digital platforms, such as social media networks, have become intertwined in the news ecosystem, leading news media to lose their role as the sole gatekeeper in the public space. This development has given an active voice ...
Ellei toisin mainittu, julkisesti saatavilla olevia JYX-metatietoja (poislukien tiivistelmät) saa vapaasti uudelleenkäyttää CC0-lisenssillä.