Download as pdf or txt
Download as pdf or txt
You are on page 1of 11

Task Overview

Given a Webpage, analyze it to get the Primary Entity described in it


and either say Yes or No on whether that Primary Entity belongs to
the shown Entity Type.
Note that, a given Primary Entity can have multiple Types. For the
purpose of this HitApp, we don't need to know the best or most
appropriate Type. We simply want to know whether the Entity Type
assignment is correct or not.

Process Steps

1. Read the web page on the left.


2. Read the Entity Type on the right
3. Determine the primary entity of the web page and decide if the
Entity Type for it is correct.
4. Is the displayed Entity Type for the given Webpage correct?
Choose ‘Yes’ or ‘No’ as the answer. If you see a blank screen
please open in a new tab, if it does not open in the new tab
either, or does not exist, please click on 'Page Not Found'.
5. Click “Submit”.

Rating Examples

For example, all of the following is correct for "Barack Obama"


Entity:
https://en.wikipedia.org/wiki/Barack_Obama - People

https://en.wikipedia.org/wiki/Barack_Obama - Politician

https://musicbrainz.org/artist/0de4d19f-
05c8-4562-a3c0-7abdc144f1d5 - Artist
https://twitter.com/BarackObama - InternetCeleb

Similarly, all of the following is correct for "Michael Jordan" Entity:


https://en.wikipedia.org/wiki/Michael_Jordan - People

https://en.wikipedia.org/wiki/Michael_Jordan - SportsPlayer

https://www.nba.com/news/historynba-legend-michael-jordan -

SportsPlayer https://www.imdb.com/name/nm0003044/ - Actor

https://www.nba.com/hornets/executive-staff - Businessman

https://www.forbes.com/profile/michael-jordan/?sh=360b9aa72d83

- Businessman

Additional information about the definition and examples of Entity

Type is shown below as well as on the Hit page itself.

Entity Types:

Adult Website

This Entity Type refers to a website that contains sexually explicit


content.

Album

This Entity Type refers to a Musical Album which is a collection of


tracks, often by the same artist/band, that are typically sold together.
Videotaped live albums showing track listings should also be
considered as Album not Movie. For example:

• https://en.wikipedia.org/wiki/Marvin_Gaye:_Live_in_Montreux_19
80 is an Album not Movie.

The following page should be considered as Song, not Album:

• https://baike.baidu.com/item/Damn!/58228273

Animal

This Entity Type refers to all Animals, excluding People.

Auto

This Entity Type refers to all kinds of automobiles.

Book

This Entity Type refers to all kinds of Written works that are bound and
self-contained. Entity typically has an author, publisher, or date of first
publication, etc. Examples:

• Written works that have been published as stand-alone books.


• Novels
• Dissertations (e.g. The Structure of Evolutionary Theory).
• Unpublished novels (e.g. Prince Jellyfish).
• Comic Books
• eBooks - like Kindle versions of books.

Composition

This Entity Type only refers to Symphonies, Orchestra, Sheet Music


(Score/Notation).

Wikipedia Pages ending in (song) are not Composition type.

Composition is different from a Song Entity Type. Normal Song


information/play pages with or without Lyrics should be considered as
Song type not Composition.

Fictional Character

This Entity Type refers to characters found in fictional works. Examples:

• https://en.wikipedia.org/wiki/Hawkgirl

Field of Study

This Entity Type refers to the areas of knowledge organized around


common research and theories. Examples:

• History
• Culinary Arts
• Physics
• Health Condition, Symptom, Diagnostic Procedure, Diagnostic
Test
• Physical Exercise, Workout

Generic Food Sources

This Entity Type refers to general Food sources that are consumed
directly or used in preparation of other Food. Examples:

• Chickpea - http://en.wikipedia.org/wiki/Chickpea
• Fish as Food - http://en.wikipedia.org/wiki/Squid_as_food

Please note: Use your judgment to understand the difference between


Animals and Animals as Food. For example, Wikipedia makes such a
distinction:

• Fish as Food: https://en.wikipedia.org/wiki/Fish_as_food


• Fish as Animals: https://en.wikipedia.org/wiki/Fish

Movie

This Entity Type refers to the form of entertainment that enacts a story
by video. A film can be of any length of running time and presented in
a theatrical, television, internet-streaming or direct-to-home video
presentation. General evidence to look for: Duration, Release Date,
Producer, Director, Cast etc. Examples:

• Films released in theatres


• Short films
• Educational, promotional, or institutional/corporate films (for
example: Your Safety First).
• TV movies
• Direct-To-Video films

Movie Series

This Entity Type refers to Movie-Series pages that describe multiple


movies from a sequence of films instead of a single movie. General
hints to look for:

• Wikipedia page title sometimes ends with "(film series)".


• Wikipedia normally has a table containing all the movies that are
part of the series.

Examples:

• https://en.wikipedia.org/wiki/Marvel_Animated_Features
• https://en.wikipedia.org/wiki/American_Pie_(film_series)

Media Franchise

This Entity Type refers to pages that contain collection of related


multi-media in which several derivative works have been produced
from an original creative work. Derivative work involves:

• Print - Books/Manga/Comics
• Films
• TV Shows
• Videogames
• Theme Park

General hints to look for:

• Wikipedia page title sometimes ends with "(franchise)".


• Wikipedia normally has a table containing all the multi-media
work that is part of the franchise

Examples:

• https://en.wikipedia.org/wiki/Star_Wars - Books, Films, Games,


TV
• https://en.wikipedia.org/wiki/Aladdin_(franchise) - Films and TV,
Theatre, Games, Theme park

Organization

This Entity Type refers to groupings of people or other organizations


that affiliate for some reason. Examples:

• Non-profit organizations
• Government agencies
• Companies
• Schools or Universities
• Sports Teams
• Music Bands
• Child Care, Senior Care, Pet Care Centers
• Radio Stations etc.
• Magazines and Newspapers are Organizations

People

This Entity Type refers a human being (man, woman, or child) known
to have actually existed.

• People, celebrities, and politicians are People


• Deceased people
• Web Active Person
• Famous Generic People
• Actor
• Internet Celebrity
• Artist
• Doctor
• Academic Author
• Sports Player
• Electronic Sports Player.

These are not people:

• Organizations such
as: https://www.youtube.com/vaguardpao or https://sports.yaho
o.com/nhl/teams/car/ (Carolina Hurricanes)

Place

This Entity Type refers to anything with spatial extent and a fixed
location on Earth. It is something you could find on a map and could
be used to provide information about the position of something else.

• Restaurants and Local Businesses from pages such as


Yelp/Menupix should be considered as a Place.
• Cities, towns, municipalities, villages, hamlets, neighborhoods,
settlements, communes, etc. (Long Beach, New York)
• Geographic features like lakes, rivers, swamps, oceans, seas, bays,
mountains, hills, islands, geologic formations, etc. (Dead Sea).
• Man-made structures like buildings, roads, monuments, historic
landmarks, racetracks, airports, graveyards, archaeological sites.
• Governmental districts, electorates, and constituencies (e.g.
Pennsylvania House of Representatives).
• Geographic coordinates (e.g. 36th parallel south and 160th
meridian east).

Confusion between Organization and Place: Webpages primarily about


location of a local business should be considered as Place. If the
webpage is about information of Organization in general – like the
CEO, Date of Foundation, revenue etc. then it should be considered as
Organization.
Product

This Entity Type refers to any tangible commodities produced and


then consumed by the consumer to satisfy current wants or needs.
Products can be sold and bought. Examples:
• Product pages on Amazon
• Drugs
• Packaged Food

Recipe

This Entity Type refers to a set of instructions that describes how to


prepare a culinary dish.

Software / Application

This Entity Type refers to Computer Software, Application, Operating


System, or a Program. Examples:

• Applications such as Microsoft Word as well as specific versions


of Applications like Microsoft Word 2010.
• Browsers like Google Chrome
• Operating Systems like Apple iOS

Tricky cases:

• Webpages that are primarily about Computer Video Games


should be considered Video Game type instead of Software
unless the Webpage is primarily about Software like Apple App
Store or Google Play App Store link of a game can be considered
Software.

Song

This Entity Type refers to:

• Information pages about Songs


• Pages with Play button such as Spotify/iTunes
• Lyrics pages
• Video performance pages such as YouTube videos of Songs

Hint

• Wikipedia page title normally ends with "(Song)"


The following page should be considered as Song, not Album:

• https://baike.baidu.com/item/Damn!/58228273

Please Note: Song is different from a Composition Entity Type.


Composition is only about Symphonies, Sheet Music Scores,
Orchestra. All other Songs pages should be considered as Song type.

TV Show / Series

This Entity Type refers to a TV program that is factual or fictional


content that is broadcast on television. It may be a one-off broadcast
or a TV series that has seasons and episodes. This type includes
regular broadcasts (e.g. TV news), TV series or miniseries, television
specials, and infomercials. Examples:

• TV series (e.g. Game of Thrones, True Blood).


• TV miniseries (e.g. Band of Brothers).
• Reality TV shows (e.g. American Idol).
• TV talk shows (e.g. The Oprah Winfrey Show, The Ellen
DeGeneres Show).
• TV news programs (e.g. CNN Newsroom, Newsround, ABC World
News).
• TV documentary or documentary series.
• Television specials (e.g. A Muppet Family Christmas).
• TV game shows (e.g. The Price Is Right, Jeopardy!, Takeshi's
Castle).
• TV coverage of sporting events (e.g. Red Bull X-Fighters, UFC 90:
Silva vs. Côté, The NBA on ABC).
• TV infomercials (e.g. Weekend Marketplace).
• TV cartoons or animated programs (e.g. Simpsons, Naruto).

TV Season

This Entity Type refers to a serialized TV Shows. A season is a regular


run of episodes that defines a self-contained part of the television
program. A program often has more than one season, if picked up by
a network. Examples:
• https://en.wikipedia.org/wiki/Agents_of_S.H.I.E.L.D._(season_6)
• https://en.wikipedia.org/wiki/America's_Got_Talent_(season_4)

Please note that the sports seasons information pages, like the
examples below on Wikipedia are not TV Seasons, so the following are
not TV Seasons:

• https://en.wikipedia.org/wiki/1983_Chicago_Cubs_season
• https://en.wikipedia.org/wiki/1948–49_New_York_Knicks_season

Instead, these sports seasons are of Generic type. But the sports
season broadcast on IMDB like the examples below should be
considered as TV Seasons:

• https://www.imdb.com/title/tt0407423/episodes?ref_=tt_eps_sm
• https://www.imdb.com/title/tt7135328/episodes?season=5&ref_
=tt_eps_sn_5

TV Episode

This Entity Type refers to a self-contained part of a particular serial TV


Show.

Video Game

This Entity Type refers to video games and hand-held electronic


games. It encompasses all computerized games, incorporating
platforms from computer to console, from game arcade to mobile
device. Examples:

• All video games regardless of platform, such as games for


computers, consoles, arcades, and cellphones.
• Computer game expansions and expansion packs such as The
Sims: House Party and Half-Life: Blue Shift.
• Computer game remakes such as Metal Gear Solid: The Twin
Snakes.
• Internet game applications such as Farmville.
• Video game compilations such as Sonic Jam.
• Video game modifications such as Multi Theft Auto.
• Hand-held electronic games such as Mattel Auto Race.

Generic

This Entity Type refers to Entities that do not belong to specific types
we have defined. Thus if it does not belong to the types defined
above.

You might also like