A Hybrid Approach for the Sentiment Analysis of Turkish Twitter Data

Shehu, H.A.; Tokat, Sezai

Please use this identifier to cite or link to this item: https://hdl.handle.net/11499/37377

Full metadata record

DC Field	Value	Language
dc.contributor.author	Shehu, H.A.	-
dc.contributor.author	Tokat, Sezai	-
dc.date.accessioned	2021-02-02T09:25:33Z
dc.date.available	2021-02-02T09:25:33Z
dc.date.issued	2020	-
dc.identifier.issn	2367-4512	-
dc.identifier.uri	https://hdl.handle.net/11499/37377	-
dc.identifier.uri	https://doi.org/10.1007/978-3-030-36178-5_15	-
dc.description.abstract	Social media is now playing an important role in influencing people’s sentiments. It also helps analyze how people, particularly consumers, feel about a particular topic, product or an idea. One of the recent social media platforms that people use to express their thoughts is Twitter. Due to the fact that Turkish is an agglutinative language, its complexity makes it difficult for people to perform sentiment analysis. In this study, a sum of 13K Turkish tweets has been collected from Twitter using the Twitter API and their sentiments are being analyzed using machine learning classifiers. Random forests and support vector machines are the two kinds of classifiers that are adopted. Preprocessing methods were applied on the obtained data to remove links, numbers, punctuations and un-meaningful characters. After the preprocessing phase, unsuitable data have been removed and 10,500 out of the 13K downloaded dataset are taken as the main dataset. The datasets are classified to be either positive, negative or neutral based on their contents. The main dataset was converted to a stemmed dataset by removing stopwords, applying tokenization and also applying stemming on the dataset, respectively. A portion of 3,000 and 10,500 of the stemmed data with equal distribution from each class has been identified as the first dataset and second dataset to be used in the testing phase. Experimental results have shown that while support vector machines perform better when it comes to classifying negative and neutral stemmed data, random forests algorithm perform better in classifying positive stemmed data and thus a hybrid approach which consists of the hierarchical combination of random forest and support vector machines has also been developed and used to find the result of the data. Finally, the applied methodologies have been tested on both the first and the second dataset. It has been observed that while both support vector machines and random forest algorithms could not achieve an accuracy of up to 77% on the first and 72% on the second dataset, the developed hybrid approach achieve an accuracy of up to 86.4% and 82.8% on the first and second dataset, respectively. © 2020, Springer Nature Switzerland AG.	en_US
dc.language.iso	en	en_US
dc.publisher	Springer	en_US
dc.relation.ispartof	Lecture Notes on Data Engineering and Communications Technologies	en_US
dc.rights	info:eu-repo/semantics/closedAccess	en_US
dc.subject	Artificial intelligence	en_US
dc.subject	Sentiment analysis	en_US
dc.subject	Social media	en_US
dc.subject	Turkish	en_US
dc.subject	Twitter	en_US
dc.title	A Hybrid Approach for the Sentiment Analysis of Turkish Twitter Data	en_US
dc.type	Book Part	en_US
dc.identifier.volume	43	en_US
dc.identifier.startpage	182
dc.identifier.startpage	182	en_US
dc.identifier.endpage	190	en_US
dc.authorid	0000-0003-0193-8220	-
dc.identifier.doi	10.1007/978-3-030-36178-5_15	-
dc.relation.publicationcategory	Kitap Bölümü - Uluslararası	en_US
dc.identifier.scopus	2-s2.0-85083455867	en_US
dc.identifier.wos	WOS:000678771000015	en_US
dc.owner	Pamukkale University	-
item.languageiso639-1	en	-
item.openairetype	Book Part	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.fulltext	No Fulltext	-
item.grantfulltext	none	-
item.cerifentitytype	Publications	-
crisitem.author.dept	10.10. Computer Engineering	-
Appears in Collections:	Mühendislik Fakültesi Koleksiyonu Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Show simple item record

CORE Recommender

SCOPUS^TM
Citations

17

checked on Jan 4, 2025

WEB OF SCIENCE^TM
Citations

12

checked on Mar 30, 2025

Page view(s)

340

checked on Mar 4, 2025

Google Scholar^TM

Check

SCOPUSTM Citations

WEB OF SCIENCETM Citations

Page view(s)

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations

Google Scholar^TM