Professional Documents
Culture Documents
Tourism Management: Juan Pedro Mellinas, Eva Martin-Fuentes
Tourism Management: Juan Pedro Mellinas, Eva Martin-Fuentes
Tourism Management: Juan Pedro Mellinas, Eva Martin-Fuentes
Tourism Management
journal homepage: http://www.elsevier.com/locate/tourman
A R T I C L E I N F O A B S T R A C T
Keywords: Booking.com provides a massive database compiling millions of reviews about thousands of accommodations
Booking.com worldwide that hotel managers and academics have extensively consulted during the past decade. In 2019–2020,
Scores however, the famous website changed several aspects of its methods of calculating hotel scores, the most
Reviews
important one being a change from its peculiar 2.5–10 scale to a more conventional 1–10 scale. Such novelties
Scale
Ratings
may cause changes in hotel scores that do not reflect changes, if any, in customer satisfaction. This article offers
an initial investigation into the nature and consequences of those changes that professionals and academics
should consider to avoid errors in future studies that involve using Booking. com’s database.
* Corresponding author.
E-mail addresses: juan.mellinas@upct.es (J.P. Mellinas), eva.martin@udl.cat (E. Martin-Fuentes).
https://doi.org/10.1016/j.tourman.2020.104280
Received 22 August 2020; Received in revised form 28 November 2020; Accepted 5 December 2020
Available online 17 December 2020
0261-5177/© 2020 Elsevier Ltd. All rights reserved.
J.P. Mellinas and E. Martin-Fuentes Tourism Management 85 (2021) 104280
new system, mostly due to the drop in scores that they had observed as a such information is of great interest not only to hoteliers, who seem to
result. In addition, on the Partner Hub, one such owner posted “We urge have begun detecting the effects of changes in the system and have
BDC [Booking Dot Com] to go back to the previous system where the shown their concern, but also to the scientific community that uses the
final review is a calculation based on the category reviews” and received database with increasing regularity.
200 comments in response, in most of which various owners complain
about the drop in scores with the new system (Booking.com Partner 2. Methodology
Hub, 2019). Some examples of the new situation are provided in the
comments: Booking.com provides the reviews of each hotel on two webpages,
which have nearly identical content but different formats, as can be
“(Vicki Webber) We too are experiencing unfair difficulties with the
observed at the following URLs for the same hotel:
new rating system.
EXAMPLE: ***. 10 for all the individual scores and then a 9 overall? - Reservation website: https://www.booking.com/hotel/us/pennsyl
vania-new-york.html
EXAMPLE: ***. 9.8 for all the individual scores and then an 8 - Reviews website: https://www.booking.com/reviews/us/hotel/
overall? pennsylvania-new-york.html
This new system is flawed and simply does not work.”
Since Booking.com implemented its new system in September 2019,
“(pibomarco) But when you will receive 7,5 for all individual scores we have tracked the scores received by a random sample of hotels and
and a 9 overall in this case you will probably not complain.) Some observed that the system’s implementation has been gradual — that is,
times you will receive a higher overall score and sometimes higher scores calculated with the former system (i.e., scores with decimals)
individual scores.” have been combined with others seemingly calculated with the new
system (i.e., scores in whole numbers) on the reservation website.
Table 1 lists different situations that may cause Booking. com’s
However, on the reviews website, all scores shown were calculated with
scores for each hotel to increase or decrease due to the mentioned
the old system until April, when scores with the new system began
changes. However, because we do not know which of the listed elements
appearing.
will have more or less weight, we do not know whether the new system
In March 2020, Booking.com updated its webpage describing how
can make high and low scores rise or fall.
reviews are collected (Booking.com, 2020). Since then, all reviews
Scholars have paid great attention to whether hotel scores on Boo
registered on the reservation website have had scores calculated with
king.com will indeed drop, as well as to what extent, due to the
the new system, albeit with the same reviews — same hotel, user, and
change in the system as predicted (Martin-Fuentes et al., 2020; Mellinas
date — being shown on the reviews website, accompanied by scores
et al., 2016) and as the comments of some hoteliers in online forums
calculated with the former system. Such divergence indicates, with total
suggest.
precision, the difference between scores obtained with the old and new
Because scores along with reviews using the former system will
systems for hundreds of reviews and hotels.
remain active until September 2022, the most orthodox way to analyze
On March 27, 2020, we attempted to create a large database of such
changes would be to take the score for a sample of hotels from
reviews by looking for ones posted during March. However, because
September 2019 (i.e., all scores from the old system) and repeat the
most countries had recently announced the closure of borders and hotels
operation in September 2022 such that all reviews use the new system.
due to the COVID-19 pandemic, the task proved exceptionally difficult.
Collecting large volumes of online reviews through a big data and an
Ultimately, we compiled a database of hotels in cities where the lock
alytics approach could offer accurate results on the variations produced
down arrived somewhat later: London, Las Vegas, New York City,
during those three years. Doing so, however, would require a waiting
Miami, Rio de Janeiro, and Moscow. In sum, the database comprised 19
period, and variations could then stem from both the new system and
hotels with 400 total reviews registered in March and with scores from
changes in the conditions of each hotel after 3 years. Beyond that, the
both the old and new systems. When hotels began to open in some
exceptional situation affecting the period — that is, the COVID-19
countries in June 2020, we tried to expand that sample of 400 reviews.
pandemic — could also affect the scores.
However, it was impossible because we noticed that Booking.com now
Against that background, the purpose of this article is to report a
showed the same scores on “Reservation website” and “Reviews
preliminary study that involved estimating real variations that may
website".
occur in Booking. com’s scores for hotels around the world, without
To check the normality of the data, a Kolmogorov-Smirnov test was
having to wait 2 years to obtain definitive results. We anticipate that
performed, which confirmed that the Booking.com ratings were not
following a normal distribution, as can be seen in the results section.
Then, we performed a nonparametric kernel density estimator test
Table 1 following the same procedure as Mariani and Borghi (2018), which is
Possible situations with Booking.com’s new scoring system.
used with different populations (e.g. old and new Booking.com score),
Very dissatisfied Very satisfied and when they do not follow normality, we can construct a density
Might affect Very dissatisfied customers can Customers who are very function for each sample according to this criterion and classify the new
negatively assign a rating of 1 or 2 instead satisfied in all aspects can individual simply by assigning it to the population where there is the
of 2.5. assign a maximum score on a highest density value, which will mean that over that, there is more
If the guest had a horrible scale of 4 smileys (i.e., equal to
influence of that population.
experience, then evaluating 10), but when offered a scale of
some objectively positive aspect 1–10, they can select 8 or 9 if
(e.g., location or staff) no longer they feel that the service was 3. Results
raises the average. great but not perfect.
Might affect The guest may consider that It is not necessary to assign the
The average score for the sample of reviews obtained with the old
positively though the hotel is of low maximum score to all
quality, the overall value for the parameters in order to obtain a system was 7.606 and with the new system 7.062 (Table 2). The dif
price is high, thereby score of 9 or 10. ference of 0.544 was slightly higher than the difference of 0.470
encouraging the guest to assign determined in a comparative study of hotel ratings on Booking.com and
a high score in the overall Priceline.com (Mellinas et al., 2016) and substantially higher than dif
rating.
ferences calculated when hotel scores on Booking.com were compared
2
J.P. Mellinas and E. Martin-Fuentes Tourism Management 85 (2021) 104280
Table 2
Descriptive statistics of the Booking.com scoring system.
N Min Max Mean SD Skewness Stat. Error Kurtosis Stat. Error
New scoring system 400 1 10 7.062 2.64 -.878 .122 -.040 .243
Old scoring system 400 2.5 10 7.606 2.03 -.678 .122 -.293 .243
3
J.P. Mellinas and E. Martin-Fuentes Tourism Management 85 (2021) 104280
Fig. 3. Kernel density estimator test for the new and old scoring systems on Booking.com.
affected by this change. It can be difficult for them to understand that Ignoring both the new scale and using reviews made with different
Booking.com is implementing a new system that is more rational and scales to calculate the mean could cause systematic errors in results
more in accordance with market standards and that they do not intend to obtained during the next 3 years. If authors, reviewers, and editors
harm a specific group of hoteliers. remain unaware of that possibility, then articles with erroneous results
For academics, this article can serve as an initial reference for future could be published in academic journals. This problem could be
studies on changes that Booking. com’s new scoring system may cause. extended to other review platforms, which can also make changes and
The preliminary data offered here, unlike the data obtained in the not advertise them. Hoteliers and academics must fully know the char
future, are distorted neither by the traumatic experience of COVID-19 acteristics of the information they obtain from the Internet, to avoid
nor by real changes that may occur in each hotel in the near future. errors in their analysis.
From 2015 to 2020, the scientific community has worked intensively
with databases provided free of charge by Booking.com. In most cases, Impact statement
scholars have taken into account the 2.5–10 scale used by the website
and thus avoided errors committed prior to the publication of the article Booking.com altered in 2019–2020 how its calculates hotel scores,
that made that unexpected scale known (Mellinas et al., 2015). In the from using a 2.5–10 scale to a 1–10 scale, from using scores based on the
coming years, academics who continue examining these reviews should average of six items to user-generated global scores, and from deleting
bear in mind that the system now uses a 1–10 scale and should calculate reviews after 2 years to deleting them after 3 years.
scores based on reviews from the last 3 years to prevent substantial Hotels around the world, especially those with medium and low
statistical errors. Studies based on comparing hotel ratings on Booking. scores, are going to see their scores significantly reduced by these
com (e.g. Nicolau et al., 2020) at an initial date and a later that do not changes. Therefore, hoteliers have to analyze whether drops in their
take into account the new scale could give totally wrong results. scores stem from the deterioration of their services or from changes to
In that light, it should be taken into account that, because Booking.co Booking. com’s methods. Attempting to take corrective measures to
m now uses reviews from the past 3 years to calculate final scores, these solve a seeming problem that does not in fact exist would be futile, if not
final scores derive from the average of individual scores obtained with damaging.
the old system (2.5–10) and the new one (1–10). Therefore, all studies Academics should also consider these changes when working with
based on hotel ratings on Booking.com collected between September scores obtained from Booking.com database through the end of 2022.
2019 (i.e., first scores with the new system) and September 2022 (i.e., 3
years since the total application of the new system) will have an Credit author statement
important methodological limitation that must be clearly indicated.
Once the new system is completely implemented on Booking.com, Juan Pedro Mellinas contributed to conception, Methodology, data
research using ratings can continue to be carried out with any meth collection, building the literature review and writing the manuscript.
odology previously used with the old system such as OLS regression Eva Martin-Fuentes contributed to the literature review, data análisis
analysis (M. M. Mariani, Borghi, & Kazakov, 2019), Tobit regression and writing the manuscript.
models (M. M. Mariani et al., 2020), or Support Vector Machine (Mar
tin-Fuentes, Mateu, & Fernandez, 2018), depending on the goal of each Declaration of competing interest
research.
For example, researchers who collect hotel scores on Booking.com in None.
October 2021 should consider that those scores originate from the
average user scores between October 2018 and October 2021. That Acknowledgments
circumstance implies the following situations:
This work was partially funded by the Spanish Ministry of the
- From October 2018 to September 2019, scores represent the old Economy and Competitiveness: research project TURCOLAB ECO 2017-
system; 88984-R. This research article has received a grant for its linguistic
- From September 2019 to March 2020, scores represent a mix of the revision from the Language Institute of the University of Lleida (Spain)
old and new systems; and (2020 call).
- From March 2020 to October 2021, scores (will) exclusively repre
sent the new system.
4
J.P. Mellinas and E. Martin-Fuentes Tourism Management 85 (2021) 104280
References Parra, E., Mellinas, J. P., Martínez María-Dolores, S.-M., Bernal Garcia, J. J., & Gutiérrez-
Taño, D. (2018). Effects of reviews scales on hotel online reputation. Turitec, 98–116,
2018.
Booking. (2020). Reviews: How does it work? Booking.Com. https://www.booking.com/re
Phillips, P., Antonio, N., de Almeida, A., & Nunes, L. (2020). The influence of geographic
views.en-gb.html.
and psychic distance on online hotel ratings. Journal of Travel Research, 59(4),
Bookingcom. (2020). How is my guest review Score generated? Booking.Com for partners.
722–741. https://doi.org/10.1177/0047287519858400
https://partner.booking.com/en-us/help/guest-reviews/how-my-guest-review-sco
re-generated.
Booking.com Partner Hub. (2019). Hosts asking for justice about the new review system.
Booking.Com for Partners. https://partner.booking.com/en-gb/community/pa Juan Pedro Mellinas. He holds a PhD in Business Administra
rtner-feedback/hosts-asking-justice-about-new-review-system. tion; a MSc Tourism Planning and Management and a BA in
Mariani, M., Baggio, R., Fuchs, M., & Höepken, W. (2018). Business intelligence and big Business Administration. Currently he is lecturer for the
data in hospitality and tourism: A systematic literature review. International Journal Department of Business Management at the Universidad
of Contemporary Hospitality Management, 30(12), 3514–3554. https://doi.org/ Politécnica de Cartagena. He has experience working in inter
10.1108/IJCHM-07-2017-0461 national corporations and as entrepreneur for 15 years. His
Mariani, M. M., & Borghi, M. (2018). Effects of the Booking.com rating system: Bringing research focuses on online reviews in websites like Booking and
hotel class into the picture. Tourism Management, 66, 47–52. https://doi.org/ TripAdvisor among others. He has published in Tourism Man
10.1016/j.tourman.2017.11.006 agement, Annals of Tourism research and Tourism Review,
Mariani, M. M., Borghi, M., & Gretzel, U. (2019). Online reviews: Differences by among other journals.
submission device. Tourism Management, 70, 295–298. https://doi.org/10.1016/j.
tourman.2018.08.022
Mariani, M. M., Borghi, M., & Okumus, F. (2020). Unravelling the effects of cultural
differences in the online appraisal of hospitality and tourism services. International
Journal of Hospitality Management, 90, 102606. https://doi.org/10.1016/j.
ijhm.2020.102606
Martin-Fuentes, E., Mateu, C., & Fernandez, C. (2018). Does verifying users influence
Eva Martin-Fuentes. She holds an international PhD in Engi
rankings? Analyzing Booking.Com and tripadvisor. Tourism Analysis, 23(1), 1–15.
neering and Information Technologies; a MSc in Tourism
https://doi.org/10.3727/108354218X15143857349459
Planning and Management; a BA in Advertising and Public
Martin-Fuentes, E., Mellinas, J. P., & Parra-Lopez, E. (2020). Online travel review rating
Relations; and a BA in Tourism. She is lecturer for the
scales and effects on hotel scoring and competitiveness. Tourism Review.
Department of Business Management at the University of
Mellinas, J. P., Martínez María-Dolores, S.-M., & Bernal García, J. J. (2015). Booking.
Lleida (Spain) where she has been recently recognized with the
com: The unexpected scoring system. Tourism Management, 49, 72–74. https://doi.
Teaching Excellence Award for the areas of tourism manage
org/10.1016/j.tourman.2014.08.019
ment and social media. She has been working as a tourism
Mellinas, J. P., Martínez María-Dolores, S.-M., & Bernal García, J. J. (2016). Effects of the
manager at the Tourism Board of Lleida and on events orga
Booking.com scoring system. Tourism Management, 57, 80–83. https://doi.org/
nization at the University of Lleida. She has published in
10.1016/j.tourman.2016.05.015
Journal of Hospitality and Tourism Management, International
Murphy, C. (2017). Report: 78% of all online hotel reviews come from the top four sites.
Journal of Hospitality Management and Tourism Review,
Revinate https://learn.revinate.com/blog/report-78-of-all-online-hotel-reviews-com
among other journals.
e-from-the-top-four-sites.
Nicolau, J. L., Mellinas, J. P., & Martín-Fuentes, E. (2020). The halo effect: A longitudinal
approach. Annals of Tourism Research, 83, 102938. https://doi.org/10.1016/j.
annals.2020.102938