Logo PTI Logo rice

Proceedings of the 2024 Ninth International Conference on Research in Intelligent Computing in Engineering

Annals of Computer Science and Information Systems, Volume 42

Machine Learning-Based Prediction Models for Sentiment Analysis on Online Customer Reviews: A Case Study on Airbnb

, , , , ,

DOI: http://dx.doi.org/10.15439/2024R117

Citation: Proceedings of the 2024 Ninth International Conference on Research in Intelligent Computing in Engineering, Vijender Kumar Solanki, Tran Duc Tan, Pradeep Kumar, Manuel Cardona (eds). ACSIS, Vol. 42, pages 103116 ()

Full text

Abstract. In the last decade, with the rise of sharing economy, in particular Airbnb, customers are not merely buyers but also actively share their thoughts and experiences toward goods and services. Sentiment analysis, a sophisticated technological approach, has emerged as a pivotal tool to extract people's opinions as well as sentiments from written language. On the other hand, assessing the price of a listing has always been a daunting task for hosts and guests. While numerous pricing models for Airbnb have been proposed, achieving precise accuracy remains a challenge. As a result, this paper aims to investigate whether incorporating the sentiment scores derived from online customer reviews could improve the accuracy of Airbnb price prediction or not. First, online customer reviews on Airbnb are examined using natural language processing techniques to seek the guest sentiment and its association with listings prices. Once sentiment scores are calculated, they are used as an additional attribute to forecast Airbnb listings price. Several machine learning models are employed, including Linear Regression, Ridge Regression, Support Vector Machine, XGBoost and Random Forest. The experimental results show that the inclusion of sentiment scores slightly decreases model performance in the case of three Asian economies (Hong Kong, Japan and Taiwan). Overall, Random Forest without sentiment variable is the best-performing model among five models for Airbnb price prediction.

References

  1. Moon, H., Miao, L., Hanks, L. and Line, N.D. (2019). Peer-to-peer interactions: Perspectives of Airbnb guests and hosts. International Journal of Hospitality Management, 77, pp. 405-414.
  2. Negi, G. and Tripathi, S. (2022). Airbnb phenomenon: a review of literature and future research directions. Journal of Hospitality and Tourism Insights, http://dx.doi.org/10.1108/JHTI-04-2022-0133.
  3. Dogru, T., Mody, M. and Suess, C. (2019). Adding evidence to the debate: quantifying Airbnb’s disruptive impact on ten key hotel markets. Tourism Management, 72, pp. 27-38.
  4. Mody, M.A., Jung, S., Dogru, T. and Suess, C. (2023). How do consumers select between hotels and Airbnb? A hierarchy of importance in accommodation choice. International Journal of Contemporary Hospitality Management, 35(4), pp. 1191-1218.
  5. Mahyoub, M., Al Ataby, A., Upadhyay, Y., and Mustafina, J. (2023). AIRBNB Price Prediction Using Machine Learning. In: 2023 15th International Conference on Developments in eSystems Engineering (DeSE), Iraq, 09-12 January, pp. 166-171.
  6. Jiang, L., Li, Y., Luo, N., Wang, J. and Ning, Q. (2022). A Multi-Source Information Learning Framework for Airbnb Price Prediction. In: 2022 IEEE International Conference on Data Mining Workshops (ICDMW), USA, 28 November – 01 December, pp. 1-7.
  7. Lawani, A., Reed, M.R., Mark, T. and Zheng, Y. (2019). Reviews and price on online platforms: Evidence from sentiment analysis of Airbnb reviews in Boston. Regional Science and Urban Economics, 75, pp. 22-34.
  8. Adamiak, C. (2019). Current state and development of Airbnb accommodation offer in 167 countries. Current Issues in Tourism, 25(19), pp. 3131–3149.
  9. Chen, T., Samaranayake, P., Cen, X.Y., Qi, M. and Lan, Y.C. (2022). The Impact of Online Reviews on Consumers’ Purchasing Decisions: Evidence From an Eye-Tracking Study. Frontier in Psychology, 13:865702, http://dx.doi.org/10.3389/fpsyg.2022.865702.
  10. Guo, J., Wang, X. and Wu, Y. (2020). Positive emotion bias: Role of emotional content from online customer reviews in purchase decisions. Journal of Retailing and Customer Services, 52, 101891, pp. 1-21.
  11. Ma, E., Cheng, M. and Hsiao, A. (2018). Sentiment analysis – a review and agenda for future research in hospitality contexts. International Journal of Contemporary Hospitality Management, 30(11), pp. 3287-3308.
  12. Chang, W.L. and Wang, J.Y. (2018). Mine is yours? Using sentiment analysis to explore the degree of risk in the sharing economy. Electronic Commerce Research and Applications, 28, pp. 141-158.
  13. Zeng, G., Cao, X., Lin, Z. and Xiao, S.H. (2020). When online reviews meet virtual reality: Effects on consumer hotel booking. Annals of Tourism Research, 81, 102860.
  14. Gavilan, D., Avello, M. and Martinez-Navarro, G. (2018). The influence of online ratings and reviews on hotel booking consideration. Tourism Management, 66, pp. 53-61.
  15. Lee M., Jeong M. and Lee J. (2017). Roles of negative emotions in customers’ perceived helpfulness of hotel reviews on a user-generated review website: A text mining approach. International Journal of Contemporary Hospitality Management, 29(2), pp. 762–783.
  16. Lee, C.K.H., Tse, Y.K., Zhang, M., and Ma, J. (2020). Analysing online reviews to investigate customer behaviour in the sharing economy: The case of Airbnb. Information Technology and People, 33(3), pp. 945-961.
  17. Amat-Lefort, N., Barravecchia, F. and Mastrogiacomo, L. (2023). Quality 4.0: big data analytics to explore service quality attributes and their relation to user sentiment in Airbnb reviews. International Journal of Quality and Reliability Management, 40(4), pp. 990-1008.
  18. Tran, T., Ba, H. and Huynh, V.N. (2019). Measuring hotel review sentiment: An aspect-based sentiment analysis approach. International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making, Springer, pp. 393–405.
  19. Zhao, Y., Xu, X. and Wang, M. (2019). Predicting overall customer satisfaction: Big data evidence from hotel online textual reviews. International Journal of Hospitality Management, 76, pp. 111-121.
  20. Zhu, E., Wu, J., Liu, H. and Li, K. (2022). A Sentiment Index of the Housing Market in China: Text Mining of Narratives on Social Media. The Journal of Real Estate Finance and Economics, 66, pp. 77–118.
  21. Zhao, Y., Xu, X. and Wang, M. (2019). Predicting overall customer satisfaction: Big data evidence from hotel online textual reviews. International Journal of Hospitality Management, 76, pp. 111-121.
  22. Zhang, C., Tian, Y.X. and Fan, Z.P. (2022). Forecasting sales using online review and search engine data: A method based on PCA–DSFOA–BPNN. International Journal of Forecasting, 38(3), pp. 1005-1024.
  23. Picasso, A., Merello, S., Ma, Y., Oneto, L. and Cambria, E. (2019). Technical analysis and sentiment embeddings for market trend prediction. Expert Systems with Applications, 135, pp. 60-70.
  24. Symitsi, E., Stamolampros, P. and Karatzas, A. (2021). Augmenting household expenditure forecasts with online employee-generated company reviews. Public Opinion Quarterly, 85, pp. 463-491.
  25. Wu, D.C., Zhong, S., Qiu, R.T.R. and Wu, J. (2022). Are customer reviews just reviews? Hotel forecasting using sentiment analysis. Tourism Economics, 28(3), pp. 795-816.
  26. Gibbs, C., Guttentag, D., Gretzel, U., Morton, J. and Goodwill, A. (2018). Pricing in the sharing economy: a hedonic pricing model applied to Airbnb listings. Journal of Travel and Tourism Marketing, 35(1), pp. 46-56.
  27. Ye, P., Qian, J., Chen, J., Wu, C.H., Zhou, Y., Mars, S.D., Yang, F. and Zhang, L. (2018). Customized regression model for airbnb dynamic pricing. In: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19 23 August, pp. 932-940.
  28. Kwok, L. and Xie, K.L. (2019). Pricing strategies on Airbnb: Are multi-unit hosts revenue pros?. International Journal of Hospitality Management, 82, pp. 252-259.
  29. Wang, D. and Nicolau, J.L. (2017). Price determinants of sharing economy based accommodation rental: A study of listings from 33 cities on Airbnb.com. International Journal of Hospitality Management, 62, pp. 120-131.
  30. Faye (2021). Methodological discussion of Airbnb’s hedonic study: A review of the problems and some proposals tested on Bordeaux City data. Annals of Tourism Research, 86, 103079.
  31. Liu, Y. (2021). Airbnb pricing based on statistical machine learning models. In: 2021 International Conference on Signal Processing and Machine Learning (CONF-SPML), CA, USA, 14 November 2021, pp. 175-185.
  32. Ganu, G., Elhadad, N. and Marian, A. (2009). Beyond the stars: improving rating predictions using review text content. Twelfth International Workshop on the Web and Databases, June 28, WebDB, USA.
  33. Chica-Olmo, J., González-Morales, J.G. and Zafra-Gómez, J.L. (2020). Effects of location on Airbnb apartment pricing in Málaga. Tourism Management, 77, 103981.
  34. Guttentag, D. (2019). Progress on Airbnb: a literature review. Journal of Hospitality and Tourism Technology, 10(4), pp. 814-844.
  35. Sites, D. (2013). Compact https://github.com/CLD2Owners/cld2.
  36. Mohamed, E.D. (2023). gtranslate, Github repository, https://github.com/mohamed 180/gtranslate.
  37. Al-Shabi (2020). Evaluating the performance of the most important Lexicons used to Sentiment analysis and opinions Mining. International Journal of Computer Science and Network Security, 20(1), pp. 51-57.
  38. Maulud, D., and Abdulazeez, A.M. (2020). A review on linear regression comprehensive in machine learning. Journal of Applied Science and Technology Trends, 1(4), pp. 140-147.
  39. McDonald, G.C. (2009). Ridge regression. Wiley Interdisciplinary Reviews: Computational Statistics, 1(1), pp. 93-100.
  40. Saleh, A.M.E., Arashi, M., and Kibria, B.G. (2019). Theory of ridge regression estimation with applications. John Wiley & Sons.
  41. Wauters, M., and Vanhoucke, M. (2014) Support vector machine regression for project control forecasting, Automation in Construction, 47, pp. 92-106.
  42. Chen, T., & Guestrin, C. (2016). XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
  43. Yassine Al Amrani, Mohamed Lazaar, Kamal Eddine El Kadiri (2018). Random Forest and Support Vector Machine based Hybrid Approach to Sentiment Analysis. Procedia Computer Science. Volume 127, Pages 511-520, ISSN 1877-0509.
  44. Long, C. K., Trung, H. Q., Thang, T. N., Dong, N. T., & Van Hai, P. (2021). A knowledge graph approach for the detection of digital human profiles in big data. Journal of Science and Technology: Issue on Information and Communications Technology, 19(6.2), 6-15.
  45. Long, C. K., Van Hai, P., Tuan, T. M., Lan, L. T. H., Chuan, P. M., & Son, L. H. (2022). A novel fuzzy knowledge graph pairs approach in decision making. Multimedia Tools and Applications, 1-30.
  46. Long Cu Kim and Hai Pham Van (2018). Intelligent Collaborative Decision Model for Simulation of Disaster Data in Cities and Urbanlization. International Journal of Advanced Research (IJAR), Vol. 6, Issue 07.
  47. C. K. Long et al. (2020). A Big Data Framework for eGovernment in Industry 4.0. Open Computer Science, ISSN: 2299-1093.
  48. Hai Van Pham, Long Kim Cu, (2020). Intelligent Rule-based Support Model Using Log Files in Big Data for Optimized Service Call Center Schedule. Proceedings of International Conference on Research in Intelligent Computing in Engineering, ISBN 978-981-15-2780-7.
  49. C.K.Long et al. (2021). Disease Diagnosis in the Traditional Medicine: A Novel Approach based on FKG-Pairs. Journal of Research and Development on Information and Communication Technology, Vol. 2021(2), pp. 59-68.
  50. Pham, H. V., Long, C. K., Khanh, P. H., & Trung, H. Q. (2023). A Fuzzy Knowledge Graph Pairs-Based Application for Classification in Decision Making: Case Study of Preeclampsia Signs. Information, 14(2), 104.
  51. Cu Kim Long, Pham Van Hai, et al. (2023). A novel Q-learning-based FKG-Pairs approach for extreme cases in decision making. Engineering Applications of Artificial Intelligence, Vol. 120, 2023, ISSN 0952-1976.