Professional Documents
Culture Documents
CS Exam GuidelinesV1.5 Compressed
CS Exam GuidelinesV1.5 Compressed
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
IP Ma m I
r o m ri nd P
2 3 f andu uri
20
Satisfaction
2 5, rya M
ly u
y , Ju by S
a .6
u esd 6.254
T .1
2
17
Guidelines
A guide to providing satisfaction ratings for search results
Version 1.5
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
1. Introduction 4 12. Flights 17
1. Search Needs and Satisfaction 4 13. Movies/TV Shows/Books/Music Tu 18
17 esda
2. The Query 5 5.How to Assign Ratings 2.1 y, J
6.2 ul 19
54 y 2
3. Steps in the Grading Process 5 1. When to Grade Highly Satisfying (HS) .6 by S5, 202 19
ury 3 fr
4. Definitions IP 6 2. When to Grade Satisfying (S) a M om 22
r o m ri an IP
2 3 f andu du
ri
2.Result Validation
5,
20 a M 8 3. When to Grade Somewhat Satisfying (SS) 24
y
u ly 2 Sur
y,1. Wrong
J b y Language 8 4. When to Grade Not Satisfying (NS) 25
e s da 54.6
2
Tu .16.5. Content Unavailable
2 8 6.Grading Specific Situations & Result Types 27
17
6. Inappropriate 9 1. Ambiguous Queries (Multiple Interpretations) 28
3.Satisfaction Principles 11 2. Locale Sensitivity 30
1. Satisfaction Scale 11 3. English Results in Non-English Locales 31
2. Degrees of Separation 12 4. Redirected Pages 31
3. Think About the Meaning, Not Just Matching Words 13 5. Apps 32
4. Consider User Effort 13 6. News 33
5. Consider Source Quality 13 7. Maps 34
4.Overview of Result Types 14 5. Web Video 35
1. Web Results 14 6. Dictionary, Stocks, Weather, Knowledge / Answers , Sports 36
2. Apps 14 7. Web Results (also called Suggested Web Sites) 36
3. Maps 14 8. Web Images 36
Tu 4. Stocks 15 7. Common Grading Mistakes 39
17 esda
2.1 y, J
6.2 5. ul Dictionary 15 1. Failing to Use Web Search 39
54 y 2
.6
by 5, 20
6. Weather
Su 23 15 4. Failing to Visit Destination Page 40
rya fro
M m IP
7. Sports and IP 15 3. Ignoring Time and Place r o m ri
du
40
uri 3f
2 02 Man
8. News 16 3. Ignoring Conceptual Distance y
,
25 urya 40
u l S
, J .6 by
9. Web Images 16 4. Ignoring Relevance Grading Principlessd 54
a y 41
e
Tu .16.2
10. Web Video 16 8.Examples: Satisfaction Rating 17
2 43
11. Answers and Knowledge 17 1. Highly Satisfying 43
2. Satisfying Examples 45
3. Somewhat Satisfying Examples 48 Tu
17 esda
4. Not Satisfying Examples 50 2.1 y, J
6.2 ul
54 y 2
.6
9.Other Aspects Related to Search Satisfaction Grading 52 by 5, 20
Su 23
rya fro
IP
1. Overall Preference Rating (OPR) 52 Ma m I
o m i nd P
3 fr ndur uri
2
6. Writing
20 a MComments
a 54
5, y
u ly 2 Sur
10.OPR y, &bComment Examples
J y 55
e s da 54.6
u 16.2
TVersion
2. History 61
17
1.5 (3rd February, 2023) 61
1.4 (21st March, 2023) 61
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
1. Introduction Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
1. Search Needs and Satisfaction 4 1. Search Needs and Satisfaction by 5, 20
Su 23
rya fro
2. The Query IP 5 Ma m I
r o m ri nd P
2 3 f andu Search engine users are trying to accomplish a task (or urachieve
i a goal)
3. Steps in5, 2the 0 Grading
M Process 5
l y 2 u rya that requires some information or quick access to some other
u S
4. Definitions
y, J .6 by 6
a
esd 54 resource, such as an app.
Tu .16.2
2
17
A user s information need or search need is de ned as the
information or resource that the user needs in order to accomplish
A search service may return many di erent types of results. How are
their task. The user's query is an attempt to express that need to the
these graded? What is a satisfying search result? In these guidelines
search engine. If the search results enable the user to accomplish their
we talk about what constitutes a search query, the di erent types of
task, we say that the search need is satis ed.
results, and how to grade them. In addition we describe some typical
grading tasks that use the principles learned in satisfaction grading.
We say that a result is satisfying if it satis es the search need of a
query. Results can be more satisfying or less satisfying depending on
how well or how completely they satisfy the need.
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
A search need → a search query f u
uri A search query → results returned 023 and
, 2 M
y 25 urya
You may assume all searches are made on an Apple iOS mobile u l S
a y, J .6 by
device. esd 54
Tu .16.2
2
17
• The query itself 2. Validate the result to make sure it can be graded, as explained in the
Result Validation section. Following step (1) is crucial for correct
• Web Search links you will use to research the possible intents and validation.
interpretations of the query
3. Assign the satisfaction rating per the guidelines outlined in
• The language of the user. We do not want to return results in other
languages • Relevance Principles
• Assigning a Satisfaction Rating
• The location of the user. We want to return results appropriate for
• Special Situations
their area (e.g. locations of business).
Tu
1• esd
72 Date of query. We want to return results that are relevant in time. When assigning your grade, be on the lookout for common mistakes!
.16 ay, Ju
.25 ly
4.6 25 Details can be found in Common Mistakes made.
by , 20
S 23
⚠ Unlessuryou ya fhave
r been speci cally instructed otherwise, skip/release to the
Ma om I ⚠ Search engines often correct query spelling errors and/or rpredict IP
next task if any nof du the above information about the query is missing and their
P o m ri
(“autocomplete”) what a partially typed query was intended 3 f anduIf the web search
ri
absence a ects your ability to provide a grade 2 02 toMbe.
,
25 urya of the query, you
results show results for a corrected or autocompletedly version
should grade your result as if the user typed dthe Ju by S
ay, corrected
4.6 or completed query.
3. Steps in the Grading Process es
Tu .16.2
5
2
Examples: 17
The grading of results consists of the following steps. • Query is “fac,” result is “facebook.com”. Grade as if the query was “facebook.”
• Query is “ted cruise,” result is a wikipedia page about U.S. senator Ted Cruz.
Grade as if the query was “ted cruz.”
What is Search Need and Relevance Page 5 of 61
ff
fi
fi
4. Definitions
Tu
The following terms are used throughout these guidelines: 17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
IP Ma m I
r o m ri nd P
3 f andu u
20
2
5, rya M
Term Definition Examples ri
ly 2 u
y , Ju by S • Stephen Curry
a .6
u esd 6.254
T .1
2 • Yellowstone National Park
17
• Jupiter
• Médecins Sans Frontières
A person, place, organization, business, product, service, or • Starbucks
Named Entity event whose name would normally be capitalized in English. • Post-It Notes
(This includes ctional entities.) • Skype
• Super Bowl LI
• Boxer Rebellion
• Frodo Baggins
• photosynthesis
• elephant
A word or phrase describing a concept or object of study • ROC curve
(other than a named entity) that users may wish to learn • linear algebra
Tu
17 esda more about. Knowledge terms may come from any eld of • cancer
2.1 y, J
6.2 ul
54 y 2 Knowledge Term study, including: science, technology, mathematics, medicine, • oligarchy
.6
by 5, 20
Su 23
rya fro
history, philosophy, literature, art, economics, etc. They are • veto
Ma m I IP
nd P most often noun phrases, but may also be other parts of • existentialism r o m ri
uri 3f du
speech. • metaphor 2 02 Man
,
ly 25 urya
• impressionism u S
a y, J .6 by
sd 4
• interest rate Tue .16.25
2
17
5. Content Unavailable
1. Wrong Language
Flag result as content unavailable in any of these situations:
A result is in the wrong language if it is neither in English nor in the
language of the user s locale. • A result is a web/news or videos result but does not show a page
when clicked.
However, there are a few exceptions that are NOT considered wrong
language results: • If at least one image in web-images group result is not visible
1. Result (e.g. amazon.co.jp) is the same country-speci c site as • Result requires log-in or subscription to access, speci cally where the
requested by the query ( amazon.co.jp ), even if the requested site user would be able to see the content of the page by logging in, but
Tu is not in your locale.
es
you cannot.
17
2.1 day, J
6.2 ul
54 y 2 and result are in the same language, even though it s not the
2. Query .6 • The browser presents a dialog box warning of a privacy or security
by 5, 20
primary 2
Su language
rya 3 fro for this locale. issue on the page.
Ma m I IP
nd P r o m ri
• Required information for this result type is missing 3 f andu no distance
02 M(e.g.
3. User is visiting urianother country, query is for a local business or
, 2
shown for Maps result). 25 urya
attraction, result is in the language of the visited country (i.e. where u ly S
a y, J .6 by
query was submitted), and there is no equivalent result in the user s esd 54
Tu .16.2
own locale language. ⚠ Even if there is enough content to provide a rating
17
2 but the page is behind a pay-wall/
log-in, please check the Content Unavailable ag
• Violent or harmful: the result should not intentionally incite imminent • Illegal: We also manually remove reported results in those
violent, physically dangerous, or illegal activities, nor provide circumstances that are required by law in the corresponding locale
information that leads to immediate harm. (e.g., images of child abuse, content related to sex tra cking,
copyright infringement, etc.) and when action is required to keep
• Sexually explicit: the result should not have overtly sexual or people safe (e.g., involuntary posting of sensitive personal
pornographic material, de ned by Webster s Dictionary as "explicit information, etc). Movie streaming sites such as those posing as free
Tu descriptions or displays of sexual organs or activities that are
17 esda movies are also part of this category
2.1principally
y intended to stimulate erotic without su cient
6.2 , Jul
54 y 2 ⚠ Content that might otherwise be considered inappropriate is acceptable
aesthetic
.6
b 5, 2 or emotional feelings.
y S 02
ury 3 fr if it occurs in a medical, educational, ne art, or journalistic context, and
a M om IP
• Contradicting an expert
I
du P
consensus on public interest topics: the should not be agged (e.g Wikipedia). m
fro dur
i
ri 23 n
result should not contradict well-established or expert consensus on , 20 a Ma
25 ry
a popular topic or issue. This includes misleading or inaccurate J uly by Su
Examples y,
da 54.6
information. es
Tu .16.2
2
• User searched for [tinyzone] and the
17 result is https://
tinyzonetv.to/ which contains pirated content.
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
1. Satisfaction Scale
When judging how satisfying each result is, you ll use the following scale
Almost all users would want to see this result. Many users would be interested in seeing this Some users may nd this result useful, but it s This result has nothing to do with the query, or
It s authoritative, accurate, up-to-date, and result. Satisfying results often provide probably not what most searchers were looking provides incorrect information, and should not
addresses the most likely search need(s). If the supplementary information that is one step for. It s often only indirectly related to the be shown.
Tuser
u is asking a speci c question, the result away from the query topic. search need or assumes an uncommon
17 esda
2.1 ythe
gives
6.2 , Julcorrect answer clearly and concisely. For example, if the query is a restaurant, it interpretation of the query. All results agged as Inappropriate ,
54 y 2 might be a review of the restaurant; if the Content Unavailable , or Wrong Language
.6
by 5, 20
Su 23 query is a company, it might be the current should be rated as Not Satisfying.
rya fro
Ma m I stock price, or news about the company. IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
Satisfaction Scale a y, J .6 by
esd 54
Tu .16.2
2
17
Each time we pass through one of these relationships, we increase the distance from the original concept
A Rolling Stone magazine review of the album. The singer's o cial site and Rob She eld's Twitter.
Somewhat Satisfying
The reviewer Rob She eld's Twitter. Random article from same issue of Rolling Stone
Not Satisfying
Tu
17 esda Degrees of Separation
2.1 y, J
6.2 ul
54 y 2
We can .6 think
by 5, 20 of these relationships as degrees of separation so in this example, the review of the Lemonade album is two degrees of separation
Su 23
from Beyoncé. rya fro
IP
Ma m I m ri
nd P r o
f ndu
uri 23 havea
When Grading results, each degree of separation from the concept mentioned in the query, that is, the number of relationships5,you 20 a M to traverse to
y 2 ur y
get to the result, lowers the grade by one level. See table above. ul S
a y, J .6 by
esd 54
Tu .16.2
2
17
It's also possible for a result to contain all the query words and not be High Quality Low Quality
3. Maps
These results help the user navigate to a place. Usually they have
address and distance from the user. If it s a business it often has hours
of operation.
Tu
17 esda
2.
2.1 Apps
y
6.2 , Jul
54 y 2
.6
by 5, 20
Su take2
Cards that rya 3 frothe user to the Apple app store (or open an app on the
Ma m I IP
device). Usually ndthey P have an icon of the app and the star ratings. r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
7. Sports
These cards are meant to display sports scores, or latest scores for a
5. Dictionary team (and dates of upcoming matches). Some examples
This card shows the de nition of word. When the user interacts with
this card it provides detailed usage.
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
9. Web Images
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Query is the name of a well-known app; result is the a. Query is “facebook”, result is the Facebook app.
1 App Query Official App
app with that name b. Query is “calculator,” result is the built-in Calculator app.
App Regularly Used Query is the name of a business; result is an app a. Query is “b of a,” result is the Bank of America mobile banking app.
2 Business to Interact with regularly used to interact with that business. See b. Query is “dominos,” result is the Domino’s Pizza app, which allows
Business details under “Apps” in “Additional Guidance”. users to place orders.
Tu
17 esda Query is looking for a specific location / business /
2.1 y, J a. Query is “1234 market street sf”; result is a Map for that exact address
6.2 ul institution / point of interest, or the closest example
54 y 2 b. Query is “new york public library”; result is a Map to that location
.6
by 5, 20 of a chain business / type of business, and the
Su 23 c. Query is “larry and joe’s”; result is a Map to a restaurant with that name
rya fro result showed that location on a map.
Ma m I in the same town where user is located IP
3 Maps
nd P Query Closest Map r o m ri
d. Query is “closest lowe’s”; result is a Map showing 3 f ndu Lowe’s store
02 Mathe
uri
Queries with a map intent often have a distance 2
location closest to the user’s location. y 25, urya
qualifier e.g. "nearest", "closest", "near me". Also ul by S
e. Query is “starbucks”; result is a Map y, J showing the closest Starbucks
such queries often relate to business where one es d 54.6
a
branch. Tu .16.2
must physically go to e.g. gas stations, cinema halls 2
17
Query is the name of a creative work (music album, movie, a. Query is “fleabag,” result is https://en.wikipedia.org/wiki/
4 Creative Work Performer/Creator etc.); result is a representation of the creator/performer (e.g., Phoebe_Waller-Bridge, the wikipedia page about the creator and
Tu
17 esda artist’s official site). star of that television series.
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Query is a named entity, result is an authoritative page (other a. Query is “facebook,” result is news story “Facebook agrees to
6 Named Entity News than official online presence) providing news about that pay FTC $5 billion fine for various privacy violations,” dated
entity. the same day the search was performed.
Query is asking for specific piece of information with a simple a. Query is “barack obama age,” result is https://
Embedded Correct right answer, and the result contains that answer, but the en.wikipedia.org/wiki/Barack_Obama.
7 Exact Question
Answer user has to take an action (e.g., follow link to destination b. Query is “cambridge library hours,” result is https://
page and read it) to get the answer. www.cambridgema.gov/cpl/hoursandlocations.
a. Query is “zillow”, result is the video “Living Large in a Tiny Home” from
Query is the name of the entity; result is not their
Zillow’s YouTube channel.
official website, but is a site, page, video, or app
Company/Product/ Related Site/Video/ b. Query is “sonicare” (brand of electric toothbrush), result is website for
3 related to their business. For example, this might be
Named Entity App Oral-B (a competing brand of electric toothbrush).
a 3rd party site about that company or its products,
c. Query is “billy idol” (singer), result is wikipedia page for Generation X, a
or a site for a competing product or service.
band from the 1970s he was in before he became famous.
Query is the name of an event or named entity; a. Query is “super bowl news,” result is a news story “Patriots Come from.
Stale but Valid News result is a news story about an earlier event or early Behind to Defeat Falcons in Super Bowl LI.” The story is still accurate,
Tu 4 Named Entity or Event
e
17 da s Story news about the entity. The news story must still be but it describes something that happened in 2017, not in the most
2.1 y, J valid. recent or upcoming Super Bowl.
6.2 ul
54 y 2
.6
by 5, 20
Su 23 Query is the name of a general concept or event a. Query is “dogs”, result is wikipedia page for the dog breed Beagle.
rya fro
(such as a TV show); result is about a specific b. Query is “suits” (a TV show that ran for 9 seasons), IP
5
Ma m I
General
nd P Query
Overly Specific
r o m result
ri
is https://
f d u
uri Result instance of that concept or event (such as a www.peacocktv.com/watch-online/tv/suits/8003089882869075112/
23 n
, 20 a Ma
particular episode of that show). seasons/5, a page where viewers can lstream 25 rythe 5th season.
J u y by Su
y,
es da 54.6
Tu .16.2
2
17
This result has nothing to do with the query, provides incorrect information, or fails the validation step, and should not be shown.
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Implicitly Locale-Sensitive.
Query does not explicitly ask for results in a particular Any results from a di erent locale (even if they’re in the Query is “ticketmaster”; user is located in US. Result is
locale, but the user need is inherently locale- correct language) should be automatically graded as ticketmaster.co.uk. Grade as NS, since user did not
speci c (e.g., local law information, country-speci c “NS”. express any interest in UK events.
merchant sites, nearby real-world business).
The user’s locale is one where most users understand English uently (i.e. ES-US)
Grade the result normally, the same way you would if it were in the locale language.
and would likely be interested in English-language results.
Grade the result one level lower than you would if it were in the locale language.
The user’s locale is one where many users understand English uently (i.e. Western
⚠ Results that would have been NS should still be graded as NS
Europe) and would possibly be interested in English-language results.
The user’s locale is one where relatively few users understand English uently and
Grade the result as NS.
would be unlikely to be interested in English-language results.
4. Redirected Pages
Tu
1If
72 the
esd result displayed URL gets redirected to a di erent URL, then you should grade the page you re redirected to as if that were the result.
.16 ay, Ju
.25 ly
4.6 25
by , 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Rule 1 under HS refers to cases where the query is the name of a well-known app —
a service that is best known as an app. Examples: Instagram, Spotify, and Candy Crush
⚠ A well-known app is not the same thing as a well-known company!
Rule 3 under HS refers to cases where the query is a business and the result is an
app “regularly used to interact with that business.” Meaning, the app is a common
way that customers or clients perform the ordinary tasks they need to do business
with that company.
1. If the query is the name of a bank, then the app should allow the user to
perform mobile banking tasks.
⚠ Just because a company has an app does not mean that it’s regularly used 2. If the query is the name of a restaurant chain, then the app should allow the
user to order food at that restaurant.
to interact with that business. For example, the query “dell” refers to the name
3. If the query is the name of an airline, then the app should allow the user to
of a computer company. But their app “Dell@Retail 2019” is described as “a
make reservations, choose their seat assignment, and check ight status.
Tu chance for our global retail partners to immerse themselves in the design,
17 esda
2.1 performance, 4. If the query is the name of a retail chain, then the app should allow the user
y
6.2 , Jul and vision driving Dell’s innovation.” This app is NOT used
to browse and purchase items sold by that chain.
regularly
54.6 25 by Dell’s customers and should NOT be graded HS.
y
,
by 2
Su 023
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Timely Article: up to 3 months older than the search date Either S or SS if it's about the query topic.
Current Event
May never be graded better than SS even if
Stale Article: more than 3 months older than the search date
it's about the query topic.
Time sensitivity does not impact the relevance grade of the results for these types of queries. Examples of historical events are
Historical Events
Notre Dame re, Harry and Meghan wedding, Sandy Hook shooting, Pope Benedict resigns, etc.
Tu ⚠ You might see articles with dates in the future! For these rare occurrences, grade it the same way as a timely article,
17 esda
2.1 y, J
6.2 ul as long as the date is not more than 3 months newer than the search date.
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I ⚠ News items are never HS. Why? one news organization – even one reporter – may actually write several stories IP
nd P r o m ri
uri about the same event. Maybe one person wants to get an overview of an event while another wants the latest updates. 023 f andu
,2 M
Or one person only likes stories from Fox News while another prefers MSNBC. For these reasons, we can't say that ualy 25 Surya
given news story is one that almost everyone wants to see. So it is mistake to rate a news result as Highly Satisfying.
a y, J .6 by
es d 54
Tu .16.2
2
17
Maps Results
Grading Maps
Maps result is correct and near the user, but is not the closest one S
Business
Maps result is correct, and is still accessible to the user but is not close. SS
• Query is "prime video" and result description is: "prime time video, 2511 springs rd ne, hickory, nc 28601- distance: 529 mi
• Query is "Lakers" and result description is: "great lakes brewing company, 2516 market ave, cleveland, oh 44113 - distance: 2,165 miles
5. Web Video
• If a query speci cally refers to a particular video (e.g., lemonade o cial video,
stepanov elements of programming lecture ), the desired result should be
graded as Highly Satisfying regardless of its popularity.
•T For other results, and for more general queries where many di erent video results
us
17 ecould
2.1 day, J satisfy the user's need (e.g., guitar lesson ), then popularity may factor into
6.2 ul
your
54 decision;
y
.6 25, you may want to grade a video with millions of views higher than a
by 20
similar Sone
ury 3with
2
fr only a handful.
a M om IP
an IP r o m ri
d
• When decidinguron your grade, think about whether video results are what user is 3f du
i 2 02 Man
,
looking for when typing the query. ly 25 urya
u S
a y, J .6 by
esd 54
⚠ You are not required to watch the entire video to arrive at a rating Tu .16.2
2
17
• Answers: If the query is an explicit question, see HS7. Grade on what is visible.
Please click on the thumbnail and grade the destination page(after redirects).
8. Web Images
A group of web images should be graded as a single result. Check to see if all the images have the
following properties:
Tu
17 esda
2.1 Image
1. 6.2 , Jul displays correct subject. The image must actually show the subject of the query. For
y
54 y 2
example,
.6
by 5, 20if the query is dodecahedron, the image must actually show that geometric gure and
Su 23
not some rya other
fr
Ma om I
one. Missing images (or ones that do not load) do not have this property. IP
nd P r o m ri
uri 3f du
2. Subject clearly shown. All images in the set must clearly show the subject of the query. The 2 02 Man
Query: Men 25 uin
, yaBlack
r
ly
subject should not be blocked, out of focus, too far away, or otherwise di cult to see clearly. , J u by S
ay .6
u esd 6.254
T .1
3. Subject is focus of image. In cases where the image includes multiple people or objects, it 17
2
should be clear who or what is the subject of the query. (For example, if the query is Joe Biden,
4 Property #1 violated for any image Mark as Content Unavailable and Grade as Not Satisfying
Examples:
Tu
17 esda
2.1 y, J
6.2 ul is David Beckham, result is set shown above. It has all the desired properties, so you would grade as Highly Satisfying.
• Query54 y 2
.6
by 5, 20
Su 23
a M from
• Query is rydodacahedron (a geometric shape); result set is shown on the right below. Neither the second image nor the last image in this IP set are
an IP r o m ri
dodecahedrons, du so they violate property #1. Therefore you would grade this Not Satisfying.
ri 3f
02 Man
du
, 2
ly 25 urya
• Query is ta y brodesser-akner (an author); result set is on the left below. Two of the images in the set are problematic;y,one yS
Ju bshows part of a poster
a
d 54. 6
es
for an event featuring the author, and another shows her with another person, both partly cut o . Neither of these violates Tu .16.2 property #1 because
72
both attempt to represent the author and not something else that would confuse or mislead the user, like a picture of1 a di erent author. But each
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
1. Failing to Use Web Search 3. Falsely Assuming Dominant Interpretation. If you have heard of a
result, you may assume that it's the dominant interpretation. But this
1. Misunderstanding Query Meaning. The query may be a common is not always true.
word that you think you know. But the web search may show that
• Example: Query is "u of m scholarships," result is a page about
the primary meaning is something entirely di erent.
scholarships at the University of Michigan. A grader who knew
• Example: Query is "canada goose"; result is the wikipedia page nothing about the subject might conclude that this is a great
about that kind of bird. If you had not heard of the Canada Goose result, and rate it Highly Satisfying. But looking at the web results
clothing brand, you might assume that the bird page is what shows that the query has no dominant intent. It might be referring
almost all users would want to see. But by looking at the web to the University of Minnesota, or the University of Manitoba, or
search results, you can tell that this is not the case. many other things. Therefore the grade cannot be HS.
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
ly 2 u
y , Ju by S
a .6
u esd 6.254
T .1 Query Result(s) Rating Explanation
2
17 Instagram is best known as an app, so result is what
instagram Instagram app HS
almost all users would want to see. (Rule HS1)
Almost all users searching for a celebrity would want
olivia rodrigo O cial website for the pop star, oliviarodrigo.com HS
to see that person's o cial web site. (HS4)
saw
A knowledge card for a named entity is Highly
HS
Satisfying. (HS5)
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
2. Satisfying
.6
by 5, 20 Examples
Su 23
rya fro
Ma m I IP
m ri
QuerynduriP Result(s) Rating Explanation 23 f andu r o
20
2 5, rya M
J uly by Su
,
ay 4.6
The query is asking an implicit
esd 6question
5 (how to change
T .1 .2
u
instagram.com change Instagram password. This web 2 page has the authoritative
O cial instructions on how to change instagram password S 17
pass answer, but the user has to click on the result to visit the
page in order to see the answer. (S7)
Probably not what most users were looking for. (If they had
camden county college Home page for library at the college SS
wanted the library, they would have mentioned it in the query.)
A very popular interview with BTS. and tv show host, but not very
bts
2018 video of interview with the band SS relevant given that it is several years old, and several newer
[searched in 2022]
interviews are available.
cao Irish website about applying to undergraduate There is a grocery chain in Florida called CAO, so it's unlikely that
SS
[user is in Florida] programs in Ireland. the user had the Irish website in mind.
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
how many weeks has it been Despite matching some words in the query, this
https://www.answers.com/Q/
since march 25th NS result is for a totally di erent year and does not
How_many_weeks_has_it_been_since_April_27_2009
[query issued in April 2021] give the user any useful information. (NS6)
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
After providing satisfaction ratings for every result, you will be asked to choose
which side you prefer. This is called the Overall Preference Rating (OPR).
The rating scale is About the Same, Slightly Better, Better and Much Better.
OPR Criteria:
Use the following criteria to decide on the OPR:
1. Prefer the side whose results have higher satisfaction grades.
2. If there are multiple results, prefer the side where results with higher
satisfaction are ranked higher.
3. If there are multiple results, prefer the side with a more varied result set. This
might be a variety of result types (maps, apps, web pages, etc.), satisfying a
variety of meanings of the query.
Tu
ed
72 sNote
14.
.16 ay, Ju that the side with more results is not necessarily better.
.25 ly
4.6 25
5. If you by re, 2 having trouble deciding which side is better, choose About the Same.
Su 023
rya fro
Ma m I IP
nd P r o m ri
How much these uricriteria a ect OPR also depend on the position of the result. For example, 3f du
2 02 Man
if the satisfaction rating of the results in position 1 are di erent, that should have a bigger y
,
25 urya
u l S
impact on OPR than if the satisfaction rating of results in position 4 are di erent. a y, J .6 by
esd 54
Tu .16.2
2
17
When one side is does not have results, OPR choice has some special guidance. Depending on the product (browser or
Tu phone) the following guidelines
17 esda
will be automatically be shown in the template 2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
2 Su
P rya 3 fro
• Prefer the side I Ma m I Satisfying
f rom WITH
u ri results ONLY when the side with results has at least one result graded Somewhat Satsifying, Satisfying or Highlynd P
2 3 and uri
2 0 M
• Do not 2choose5 , ya "About The Same .
u ly Sur
J y
s day, 4.6 b
e 5
Tu .16.2
OR 2
17
• Prefer the side WITH results ONLY when the side with results has at least one result graded Satisfying or Highly Satisfying
• Do not choose "About The Same .
In neither case should you choose About the Same in other words a side with a result can never be as good as a side without.
Tu
17 esda
2.1 y, J
6.2 ul
54 y 2
.6
by 5, 20
Su 23
rya fro
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Since only the last result is di erent, and the last result on the left is
Query 3: apollo project
less bad than the one on the right, we conclude that the left side is
Location: Cincinnati, OH on Feb. 13, 2020. Slightly Better.
LEFT RIGHT
Apollo Space Program wikipedia article Apollo Space Program wikipedia article
(en.wikipedia.org/wiki/Apollo_program) (en.wikipedia.org/wiki/Apollo_program)
LEFT RIGHT
query was on Feb. 13, 2020, we assume the user wanted the most Official video for Ramos' 2021 song
NBC News article from February 2021
"Blessings"
recent award winner at the time, announced at the ceremony on Official video for Ramos' 2021 song
Anthony Ramos instagram page
February 8, 2020. “Say Less"
Slightly Slightly Much
Much Better Better About the Same Better
• Result #1 on the left (same as #2 on right) contains the answer, but Better Better Better
Turequires visiting the page and scrolling all the way to the bottom to
17 esnd
2.1 day, it. Result #1 on the right gives us the answer right away, without
6.2 Jul
even y
54 having to click on it.
.6 25,
by2 OPR Explanation: The query refers to an actor and singer who
Su 023
rya fro appeared in the original cast of the musical Hamilton. P
• Result #2 Monan the
m
I left is a YouTube video from a non-authoritative I
m ri
du P r o
3 f andu
source (a random ri fan), and it s very outdated ̶ from 2011. • Results L1, R1, and R4 all all Highly Satisfying. 2 02 All the rest of the
2 5, rya M
y
results on both sides are Satisfying. , Jul by S u
• Result #3 on the left is related to best actor winners, but doesn t ay .6
u esd 6.254
actually contain the answer the user is looking for. • T .1
The set on the right is more diverse, 2 providing more di erent
17
types of results.
OPR Explanation: Both sides have the same results, but they are
ranked di erently. Since the search was done in 2021, it s most likely
OPR Explanation: The query can refer to many di erent things or that the new 2021 documentary about Tina Turner ( Tina ) is what the
people, and the web search results make it clear that none of them is a user was looking for. Since the only di erence is the ranking, and the
dominant interpretation. Furthermore, these results all seem to be only right side ranking is clearly better than the left side (moving the best
Somewhat Satisfying, since it isn t likely that most users in the United result into position #1), it s Better.
Tu
e
1States
72 sday, were searching for (say) an Indonesian app or an Israeli Singer
.16 Ju
from .25 the
4. ly 21990s.
5
Therefore the two sides are About the Same.
6b
y S , 202
ury 3 fr
a M om IP
an IP r o m ri
du 3f du
ri
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
Query: monster hunter stories 2 OPR Explanation: Both sides have the brief Knowledge card describing
the person (with links to her o cial website and twitter feed). The left
Location: Miami, FL on 2021-08-10. side also has web videos for two of her songs, while the right side also
has her o cial website and Twitter feedResults R2 and R3 are more
Tu LEFT RIGHT
17 eWikipedia
sda valuable than L2 and L3, but the lack of any videos makes the right
2.1 y, J entry for the video game Wikipedia link to Monster Hunter Stories
6.2 Monster
54
uly Hunter Stories 2: Wings of Ruin side only Slightly Better.
.6 25,
by 2 Slightly Slightly Much
Much Better Su 02Better About the Same Better
rya 3 fro Better Better Better
Ma m I IP
nd P r o m ri
uri 3f du
2 02 Man
,
ly 25 urya
u S
OPR Explanation: The user speci cally asked for Monster Hunter a y, J .6 by
esd 54
Stories 2 . The left side has a more general result (it s about the entire Tu .16.2
2
17
video game series), while the right is about the exact thing the user
asked about, so the right is Better. To be Much Better, the right side
OPR Explanation: Both have same third result. Both have the same
Highly Satisfying info card, but it s ranked better on the left. Of the
remaining results, the one on the left might be useful, while the one on
the right is Not Satisfying. Both of these di erences favor the left side,
so it is Better.
Location:
T Paxtonia, PA 2021-09-22.
u
17 esda
2.1 y, J
6.2 ul
54 y 2 LEFT RIGHT
.6
by 5, 20
Su 23
Official fr
rya website Official UK website
Ma om I IP
m ri
nd P r o
du
uri 3f
Twitter handle Huffington Post News App 2 02 Man
,
ly 25 urya
u S
Much Better Better
Slightly
About the Same
Slightly
Better
Much a y, J .6 by
Better Better Better esd 54
Tu .16.2
2
17
๏ In Section 2 regarding the query, if the research links do not work, copy the phrase into the search engine (e.g. Google/Bing) with the appropriate
locale.
๏ Added some guidance on permanently closed maps results. See Maps guidance (2)
1.4
Tu (9th February, 2023)
17 esda
2.1 y, J
6.2 ul
๏ If 5at y2
4.6 least one image in web-images group result is not visible then ag as Content Unavailable (see section in Content Unavailable)
by 5, 20
Su 23
rya fro
๏ Updated table Ma mof
nd IP
advice to suggest this in Grading Speci c Advice for Web Images o
IP
m ri
r du
uri 3f
2 02 Man
,
ly 25 urya
u S
a y, J .6 by
esd 54
Tu .16.2
2
17
fi
fi
fl
fi