Comparison of machine learning algorithms in restaurant revenue prediction

Stepan Gogolev*, Evgeniy M. Ozhegov

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingAcademicpeer-review

18 Downloads (Pure)

Abstract

In this paper, we address several aspects of applying classical machine learning algorithms to a regression problem. We compare the predictive power to validate our approach on a data about revenue of a large Russian restaurant chain. We pay special attention to solve two problems: data heterogeneity and a high number of correlated features. We describe methods for considering heterogeneity—observations weighting and estimating models on subsamples. We define a weighting function via Mahalanobis distance in the space of features and show its predictive properties on following methods: ordinary least squares regression, elastic net, support vector regression, and random forest.

Original languageEnglish
Title of host publicationAnalysis of Images, Social Networks and Texts - 8th International Conference, AIST 2019, Revised Selected Papers
EditorsWil M.P. van der Aalst, Vladimir Batagelj, Dmitry I. Ignatov, Valentina Kuskova, Sergei O. Kuznetsov, Irina A. Lomazova, Michael Khachay, Andrey Kutuzov, Natalia Loukachevitch, Amedeo Napoli, Panos M. Pardalos, Marcello Pelillo, Andrey V. Savchenko, Elena Tutubalina
PublisherSpringer, Cham
Pages27-36
Number of pages10
ISBN (Print)9783030395742
DOIs
Publication statusPublished - 2020
Externally publishedYes
Event8th International Conference on Analysis of Images, Social Networks and Texts - Kazan Federal University, Kazan, Russian Federation
Duration: 17 Jul 201919 Jul 2019
Conference number: 8
https://2019.aistconf.org/

Publication series

SeriesCommunications in Computer and Information Science
Volume1086CCIS
ISSN1865-0929

Conference

Conference8th International Conference on Analysis of Images, Social Networks and Texts
Abbreviated titleAIST 2019
Country/TerritoryRussian Federation
CityKazan
Period17/07/1919/07/19
Internet address

Keywords

  • Machine learning
  • Revenue prediction
  • Weighted regression

Cite this