A CrossPlatform Personalized Recommender System For Connecting ECommerce and Social Network - 2023 - MDPI

future internet
Article
A Cross‑Platform Personalized Recommender System for
Connecting E‑Commerce and Social Network
Jiaxu Zhao 1 , Binting Su 2, *, Xuli Rao 1 and Zhide Chen 3, *
1 Department of Computer Science, Fuzhou Polytechnic, Fuzhou 350108, China

2 Network and Data Center, Fujian Normal University, Fuzhou 350007, China
3 College of Computer and Cyber Security, Fujian Normal University, Fuzhou 350007, China
* Correspondence: bintingsu@fjnu.edu.cn (B.S.); zhidechen@fjnu.edu.cn (Z.C.); Tel.: +86‑180‑6047‑8480 (B.S.)
Abstract: In this paper, we build a recommender system for a new study area: social commerce,
which combines rich information about social network users and products on an e‑commerce plat‑
form. The idea behind this recommender system is that a social network contains abundant infor‑
mation about its users which could be exploited to create profiles of the users. For social commerce,
the quality of the profiles of potential consumers determines whether the recommender system is a
success or a failure. In our work, not only the user’s textual information but also the tags and the
relationships between users have been considered in the process of building user profiling model. A
topic model has been adopted in our system, and a feedback mechanism also been design in this pa‑
per. Then, we apply a collative filtering method and a clustering algorithm in order to obtain a high
recommendation accuracy. We do an empirical analysis based on real data collected on a social net‑
work and an e‑commerce platform. We find that the social network has an impact on e‑commerce,
so social commerce could be realized. Simulations show that our topic model has a better perfor‑
mance in topic finding, meaning that our profile‑building model is suitable for a social commerce
recommender system.
Keywords: social commerce; recommender system; topic model; CFUA
Citation: Zhao, J.; Su, B.; Rao, X.;

Chen, Z. A Cross‑Platform 1. Introduction
Personalized Recommender System
E‑commerce, the activity of buying or selling online, has generated significant busi‑
for Connecting E‑Commerce and
ness value, and it is becoming ever more common for both consumers and companies to
Social Network. Future Internet 2023,
purchase products or services online. The rise of various e‑commerce platforms results in
15, 13. https://doi.org/10.3390/
competition between companies, and they continue to explore new sales models. In recent
fi15010013
years, social commerce, defined as a subset of e‑commerce involving social networks, has
Academic Editor: Michael Sheng become one of the most popular topics [1]. In social commerce, users are encouraged by
a platform via financial rewards to share/diffuse/broadcast information about the various
Received: 4 November 2022
Revised: 14 December 2022
products sold on that platform via social networks. For example, Amazon and Twitter
Accepted: 23 December 2022
launched a seamless cross‑platform shopping service, ‘AmazonCart’, in 2014, which al‑
Published: 27 December 2022 lows Twitter users to purchase Amazon items in tweets while browsing Twitter. Another
example is Alibaba, the largest e‑commerce platform in China. A large number of Alibaba
vendors promote their commodities through Sina Weibo (the largest Chinese social net‑
work). This indicates that e‑commerce has evolved from a stand‑alone platform to one that
Copyright: © 2022 by the authors. incorporates social information. RT Wigand et al. indicated that the rapid development of
Licensee MDPI, Basel, Switzerland. social media and web technology may have the potential to transform e‑commerce from a
This article is an open access article product‑oriented environment to a social and customer‑centered one [2]. Tajvidi M et al.
distributed under the terms and
found that, in social commerce, consumer–consumer interaction and consumer–seller in‑
conditions of the Creative Commons
teraction enhance consumers’ intention to co‑create brand value [3]. Adam et al. provided
Attribution (CC BY) license (https://
evidence that social media use significantly influences the development of e‑government
creativecommons.org/licenses/by/
and the diffusion of e‑commerce globally [4].
4.0/).
Future Internet 2023, 15, 13. https://doi.org/10.3390/fi15010013 https://www.mdpi.com/journal/futureinternet

Future Internet 2023, 15, 13 2 of 20
Recommender systems for e‑commerce companies have been well studied [5,6]. How‑
ever, most of the existing recommender systems use only information from e‑commerce
to make recommendations, such as consumer purchased history log or rating scores of
purchased commodities. With the integration of e‑commerce and social platforms, new
recommender systems should be designed, and they should make full use of social net‑
work user information and product information. Some studies have begun to use social
network information to improve the accuracy of the recommendations on e‑commerce plat‑
forms. Damian et al. proposed a web recommender system for e‑commerce that traces
clients and analyzes their activities on Facebook [7]. However, the user profiles are only
based on keywords from the users’ activity and their friends’ activities. Hao Ma et al.
improved the recommender system by adding social contextual information, i.e., social
tags and latent information obtained by heterogeneous data mining [8]. Zhao et al. ex‑
tracted the demographic information of both products and social network users’ activities,
and then leveraged the demographic information to improve the recommendation perfor‑
mance on e‑commerce websites [9]. They also proposed a cross‑platform recommender
system that operates by learning both users’ and products’ features from data collected
from e‑commerce websites using recurrent neural networks and then apply a modified
gradient boosting trees method to transform users’ social networking features into user
embeddings [10]. Using these features, their work realized cross‑site product recommen‑
dation and solved the cold‑start problem. With the rise of AI technology, Pan et al. pro‑
posed a unified framework of active transfer learning for cross‑system recommendation,
which used an active learning principle to construct entity correspondences across sys‑
tems [11]. Xiang et al. integrated the fuzzy association rule and complex preference into a
recommendation model to improve the efficiency of the traditional collaborative filtering
recommendation algorithm [12]. However, these cross‑platform recommendation meth‑
ods solely rely on sparse social network data and e‑commerce data. These methods do not
fully integrate textual information, tagging information and behavioral information from
the social network. In fact, the social network contains abundant, detailed, time‑resolved,
real‑world user data, e.g., tweets (microblogs), tags and relations with other users, which
motivates us to extract users’ information and capture users’ interest profiles for cross‑
platform recommendations.
In this paper, we propose a novel cross‑platform recommender system (CPRec) to
make full use of the abundant information on social networks to improve recommenda‑
tion accuracy. In CPRec, we build both a user profile and a commodity profile from data
on social networks and e‑commerce. An improved topic model for detecting users’ inter‑
est profiles from their historical released information is designed, which is based on Latent
Dirichlet Allocation (LDA) [13]. The users’ tags and their followees’ profiles will be used
when we are building the users’ profiles. After obtaining a user profile, we make recom‑
mendations based on the recommended score as calculated from the users’ profiles and the
commodity profiles. Considering that each user will take different actions after he/she re‑
ceives the recommended products, a feedback mechanism is designed for the CPRec. Since
a user‑commodity‑score matrix will be obtained if the CPRec starts to work, we develop
an improved collaborative filtering algorithm that combines user profiles to make further
recommendations. Finally, in order to show the performance of the proposed system, we
evaluate and analyze the CPRec based on two platforms, Sina Weibo and Alibaba. The
contributions of this paper are summarized as follows:
• We propose a novel cross‑platform personalized recommender system, CPRec, for
recommending e‑commerce commodities to users on social network platforms.
• An interest mining process is proposed to build the user interest profiles, which makes
full use of users’ information on social networks.
• We propose three subdivisions for CPRec, i.e., recommendations for individuals, a
feedback mechanism and an improved collaborative filtering algorithm.
Future Internet 2023, 15, x FOR PEER REVIEW 3 of 20
• The experimental results validate the feasibility of the CPRec, the veracity of user
profiling and the superior performance of our improved collaborative filtering algo-
• The experimental results validate the feasibility of the CPRec, the veracity of user pro‑
rithm compared with some existing algorithms.
filing and the superior performance of our improved collaborative filtering algorithm
The remainder
compared with of the existing
some paper is algorithms.
organized as follows. In Section 2, an overview of our
proposed cross-platform recommender system is given. System building and user profil-
The remainder of the paper is organized as follows. In Section 2, an overview of our
ing are presented in Section 3, and commodity profiling and recommendation subdivi-
proposed cross‑platform recommender system is given. System building and user profil‑
sions are described in detail. Section 4 discusses the experimental results. Section 5 con-
ing are presented in Section 3, and commodity profiling and recommendation subdivisions
cludes this paper and outlines future work.
are described in detail. Section 4 discusses the experimental results. Section 5 concludes
this paper and outlines future work.
2. System Model
2.
2.1.System Model
Preliminary
2.1. Preliminary
2.1.1. Social Networks
2.1.1. Social Networks
Social networks can be illustrated as a graph of the relationships and interactions
withinSocial
in a networks can be illustrated
group of individuals, and theyas aplay
graph of the relationships
a fundamental role as a and
mediuminteractions
for the
within
spread inof ainformation,
group of individuals, and they play
ideas and influence among a fundamental
their members. roleInasthis
a medium
paper, weforcon-
the
spread of information, ideas and influence among their members. In this
sider a general model of social networks, which is abstracted as a set of nodes and a set of paper, we con‑
sider
edgesabetween
general themodel of social
nodes. networks,
Each node can be which is abstracted
considered as a set ofor
as an individual nodes and a set
as a collective
of edges between the nodes. Each node can be considered as
unit such as a department, organization or family. There exists an edge between an individual or as a two
col‑
lective unit such as a department, organization or family. There exists
nodes if they have relation. Figure 1 shows a brief instance of a social network, which an edge between
two nodes
contains if they
four have
nodes relation.
(users in theFigure
social 1network)
shows a andbrieftheir
instance of a social
relations network,
(following which
a person).
contains
In Figurefour nodes
1, user (usersuser
a follows in the socialanetwork)
c while and theirby
is being followed relations
b. In the(following a person).
social network, the
In
user followed by other users is defined as followee, and those who follow this user the
Figure 1, user a follows user c while a is being followed by b. In the social network, are
user
calledfollowed
followers. byInother usersusers
practice, is defined as followee,
are mostly likely toand those
follow whowhose
a user followinterests
this user are
align
called followers. In practice, users are mostly likely to follow a user whose
with their own. In Sina Weibo, users always write a short message (limited 140 characters) interests align
with their own. In Sina Weibo, users always write a short message (limited 140 characters)
and upload some pictures to show moments in their lives or interesting things.
and upload some pictures to show moments in their lives or interesting things.
Figure
Figure 1. An abstract
1. An abstract graph
graph of
of social
social network.
network.
The short message is an important part of our model. While a short microblog may be
The short message is an important part of our model. While a short microblog may
unable to depict the full scope of a user’s interests or thoughts, we collect user’s historical
be unable to depict the full scope of a user’s interests or thoughts, we collect user’s histor-
microblogs and divide them into groups to analyze. The user’s microblogs from a uniform
ical microblogs and divide them into groups to analyze. The user’s microblogs from a
time period ∆t will be represented as the microblog group M∆t . For a given user,) his/her
uniform
entire time period
microblog
t canbe
history Mwill berepresented
denoted as M as =
(
the Mmicroblog group 𝑀 , M . For a given
∆t0 , M∆t1 , M∆t2 , . . .∆𝑡 ∆tn , where
user, his/her entire
t0 represents microblog
the current timehistory M cantags
slot. Users’ be denoted as 𝑀useful
are another = (𝑀∆𝑡source 0
, 𝑀∆𝑡1 ,of
𝑀∆𝑡 , … , 𝑀∆𝑡𝑛 ),
information.
2
where 𝑡0 tags
Selecting represents the current
is an essential time slot.
if optional Users’
part of thetags are another
registration processusefulaftersource
users of create
infor-
mation.
their Selecting
accounts with tags
theissocial
an essential
network. if optional
Tags givepart of the information
obvious registration process about theafter users
interests
create
that their
user accounts
want to representwith the social network.
to others, Tags give
such as singing, obvious
eating, information
shopping, about
traveling, orthe
IT.
interests
Let Tu =that
( T0 ,user
T2 , Twant
3 , . . . ,to
Tmrepresent
) denote to theothers, suchuser
tags that as singing,
u selected. eating, shopping,
Another datatraveling,
source is
or IT. Let 𝑇We
followees. 𝑢 =denote
(𝑇0 , 𝑇2 , the
𝑇3 , … , 𝑇𝑚u) i denote
user ’s followees as Fthat
the tags ( F1 , uF2selected.
ui =user , F3 , . . . , FAnother
l ). data source
is followees. We denote the user 𝑢𝑖 ’s followees as 𝐹𝑢𝑖 = (𝐹1 , 𝐹2 , 𝐹3 , … , 𝐹𝑙 ).
2.1.2. E‑Commerce
2.1.2.In
E-Commerce
this paper, the e‑commerce data source that we adopted is Taobao, which is a child
company ofpaper,
In this Alibaba
theand the most data
e-commerce successful
sourcee‑business platform
that we adopted in China.
is Taobao, As ofis2014,
which it
a child
had generated
company a totaland
of Alibaba volume of 1.172
the most trillion RMB.
successful e-business platform in China. As of 2014, it
Commodities
had generated on volume
a total e‑commerce platforms
of 1.172 trillionare the key factor that we concentrate on, and
RMB.
they can be described by the commodity’s name, its description and the buyer’s informa‑
tion. These three kinds of data will be used to define the commodity profiles.
Commodities on e-commerce platforms are the key factor that we concentrate on,
and they can be described by the commodity’s name, its description and the buyer’s in-
formation. These three kinds of data will be used to define the commodity profiles.
2.2. System Model of the Cross-Platform Recommender System (CPRec)

2.2. System Model of the Cross‑Platform Recommender System (CPRec)
We concentrate on an idealized cross-platform process in which an e-commerce com-
pany orWe concentrate
a retailer R wantson toanrecommend
idealized cross‑platform process intowhich
suitable commodities annetwork
a social e‑commerceuser com‑
u.
pany or a retailer R wants to recommend suitable commodities to a
The first thing that the retailer R should do is to understand what the user u prefers. To social network user
u. The first thing that the retailer R should do is to understand what
give an example, if R recommends a basketball to a user who enjoys soccer, this user will the user u prefers.
notTobegive an example,
satisfied with theifrecommendation.
R recommends a With basketball to a user whosystem,
our recommender enjoysthere
soccer, thisan
exists user
opportunity for retailers to learn about a social network user’s interests, supposing thatex‑
will not be satisfied with the recommendation. With our recommender system, there
ourists an opportunity
system for retailers
has the necessary to learn about
permissions to get athe
social network
social networkuser’s interests,
user’s supposing
information. R
that our system has the necessary permissions to get the social network
could maintain a user’s profiles with his/her existing information from social networks user’s information.
R could
though ourmaintain
profilingamodel.
user’s profiles
The nextwith his/her
question is existing information
how to make from social networks
recommendations. Retail-
though our profiling model. The next question is how to make recommendations. Retail‑
ers naturally understand that users are more likely to purchase those commodities that
ers naturally understand that users are more likely to purchase those commodities that
most closely match their interests. Figure 2 depicts an overview of our CPRec. The first
most closely match their interests. Figure 2 depicts an overview of our CPRec. The first
part is what we call the original data collection period, in which data for “potential con-
part is what we call the original data collection period, in which data for “potential con‑
sumer profiling” and “commodity profiling” would be collected. The original data used
sumer profiling” and “commodity profiling” would be collected. The original data used
in user profiling include three types: historical microblogs, a user’s tags and information
in user profiling include three types: historical microblogs, a user’s tags and information
about the user’s followees (the people whom that user follows). Information about a com-
about the user’s followees (the people whom that user follows). Information about a com‑
modity (the commodity name, commodity description and buyers’ information) would
modity (the commodity name, commodity description and buyers’ information) would be
be collected analogously. The second part refers to the profiling process whereby the data
collected analogously. The second part refers to the profiling process whereby the data on
on the user and commodity will be analyzed and used to obtain profiles of both. From the
the user and commodity will be analyzed and used to obtain profiles of both. From the
original data, we aim to build both a stable interest profile and temporal interest profile
original data, we aim to build both a stable interest profile and temporal interest profile
forfor
each user,
each due
user, to the
due factfact
to the that a user’s
that preferences
a user’s preferences may evolve
may evolveover time,
over butbut
time, hishis
or or
herher
general preferences may also be relatively stable. In the third part, namely,
general preferences may also be relatively stable. In the third part, namely, the recommen‑the recommen-
dation
dationprocess,
process,wewe offer three
offer threesubdivisions
subdivisions forfor
thethe
CPRec,
CPRec, and thethe
and different
differentsubdivisions
subdivisions
realize different functions.
realize different functions.
Figure
Figure 2. System
2. System model
model of the
of the proposed
proposed cross‑platform
cross-platform recommender
recommender system
system (CPRec).
(CPRec).
3. The
3. The Proposed
Proposed CPRec
CPRec
3.1.3.1. User
User Profiling
Profiling
User
User profiling
profiling is is
thethe coreofofthe
core theproposed
proposedrecommender
recommendersystem.
system.ItItisishardly
hardly possible
possible to
recommend suitable goods to a consumer without knowing his/her profiles. In this section,
to recommend suitable goods to a consumer without knowing his/her profiles. In this sec-
we aim to identify each user’s profiles, or what we may call their interests. Considering
tion, we aim to identify each user’s profiles, or what we may call their interests. Consid-
the users’ interests may be variant, we indicate that the users’ interest profiles consist of
ering the users’ interests may be variant, we indicate that the users’ interest profiles con-
two components: stable interest and temporal interest. Furthermore, the stable interests
sist of two components: stable interest and temporal interest. Furthermore, the stable in-
have the characteristic of being time‑immune, which means that they only change slightly
terests have the characteristic of being time-immune, which means that they only change
as time passes. However, some interests may be generated due to reasons like the influence
slightly as time passes. However, some interests may be generated due to reasons like the
of a hot social trend, and we define these interests as temporal interests, meaning that
they are short‑time interests. In our model, we employ a scheme with time‑weighting
to capture both stable interests and temporal interests from a user’s historical microblogs.
Then, considering tags to be part of the interest criteria that users set for tagging themselves
at the initial time, which could be powerful evidence for defining a user’s stable interests,
influence of a hot social trend, and we define these interests as temporal interests, mean-
ing that they are short-time interests. In our model, we employ a scheme with time-
weighting to capture both stable interests and temporal interests from a user’s historical
Future Internet 2023, 15, 13 microblogs. Then, considering tags to be part of the interest criteria that users set for tag-
5 of 20
ging themselves at the initial time, which could be powerful evidence for defining a user’s
stable interests, we propose an algorithm to combine the interest profiles drawn from
these two sources. Last, we integrate the profiles of a user’s followees.
we propose an algorithm to combine the interest profiles drawn from these two sources.
Last, we integrate the profiles of a user’s followees.
3.1.1. Latent Interest Profiles Obtained by Microblogs
Many
3.1.1. studies
Latent related
Interest to users’
Profiles postedby
Obtained messages have been conducted [14–16], includ-
Microblogs
ing research
Manyfocusing on the to
studies related problem of identifying
users’ posted messagesinfluential
have been users in a social
conducted network
[14–16], includ‑
by ing
taking into account
research focusingtheonsimilarity
the problem of the topics that influential
of identifying users post users
aboutintopics, which
a social is
network
decided by a user’s posted messages [17]. However, we wish to detect a user’s latent
by taking into account the similarity of the topics that users post about topics, which is de‑ in-
terest profiles
cided though
by a user’s the messages
posted messagesposted naturallywe
[17]. However, bywish
that to
user. Theamethod
detect we adopt
user’s latent interest
is based on Latent Dirichlet Allocation (LDA), an unsupervised machine
profiles though the messages posted naturally by that user. The method we adopt is learning tech-
based
nique which Dirichlet
on Latent has been Allocation
widely used to detect
(LDA), latent topics in
an unsupervised documents.
machine LDA
learning treats a sin-
technique which
glehasdocument as “aused
been widely bag toof detect
words”, which
latent means
topics that it views
in documents. LDA a document as adocument
treats a single vector of as
word counts.
“a bag Each document
of words”, which meansis represented as aaprobability
that it views document as distribution
a vector ofover
wordvarious
counts. top-
Each
ics,document
while each is topic is represented
represented as a probability
as a probability distribution
distribution overtopics,
over various a numberwhileof each
words, topic
as shown in Figure
is represented as 3.
a probability distribution over a number of words, as shown in Figure 3.
Figure
Figure 3. An
3. An abstract
abstract representation
representation of LDA
of LDA principle.
principle.
Standard
Standard LDALDA maymay notnotfitfitthethewriting
writingof of microblog
microblog users,
users, the
the reason
reason being
beingthat
thataasin‑
gle microblog
single will always
microblog will always be beshort
shortand andcontain
containonly
onlyone
onetopic,
topic,sosothethe method
method wewe adopt
adopt
is the Microblogs Topic Discovery Model, which is based on the twitter-LDA in [18]. In In
is the Microblogs Topic Discovery Model, which is based on the twitter‑LDA in [18].
Section
Section 2, we
2, we have
(have introduced
introduced method
method for)for dividing
dividing a user’s
a user’s microblogs
microblogs into
into groups
groups such
such
that M = M
that 𝑀 = (𝑀∆𝑡0 , 𝑀∆t , M ∆t , M ∆t , . . . , M ∆t . M ∆t denotes all of the microblogs that user re‑
∆𝑡01 , 𝑀∆𝑡21, … , 𝑀2∆𝑡𝑛 ). 𝑀∆𝑡 ndenotes all of the microblogs that user released
leased during time ∆t , and it will be processed to obtain the user’s
during time ∆𝑡𝑖 , and it i will be processed to obtain the user’s interest profiles during interest profiles during
time ∆𝑡𝑖∆t
time . i.
Suppose
Suppose that
that therethere
areare T hidden
T hidden topics
topics in in microblogs
microblogs 𝑀∆𝑡M𝑖 ,∆tand
setset i
, and
thatthat
eacheach topic
topic t t
has a word distribution ∅ 𝑡 and a background words distribution ∅ 𝐵. π denotes a Bernoulli
t B
has a word distribution ∅ and a background words distribution ∅ . π denotes a Ber-
distribution that manages the choice between background words and topic words. θ u is the
noulli distribution that manages the choice between background words and topic words.
topic distribution of user u. Each multinomial distribution is governed by some symmetric
𝜃 𝑢 is the topic distribution of user u. Each multinomial distribution is governed by some
Dirichlet distribution. Gibbs sampling is used to perform model inference. We leave out
symmetric
Future Internet 2023, 15, x FOR PEER REVIEW Dirichlet distribution. Gibbs sampling is used to perform model inference. We 6 of 20
the derivation details and the sampling formulas here. Figure 4 describes the generation
leave out the derivation details and the sampling formulas here. Figure 4 describes the
process of microblogs, and we illustrate the plate notation of the model in Figure 5.
generation process of microblogs, and we illustrate the plate notation of the model in Fig-
ure 5.
Figure4.
Figure 4. Generation
Generation process
process of
of Microblogs.
Microblogs
Future Internet 2023, 15, 13 Figure 4. Generation process of Microblogs 6 of 20
Figure 5. Plate notation of Microblogs Topic Discovery Model.

Figure 5. Plate notation of Microblogs Topic Discovery Model.
As the result of the method we adopt, the microblogs set M∆ti has been patterned as
a distribution over all topics to obtain latent topics and to represent the topic distribution
Asmicroblogs
of the the result of the
collection as amethod we
topic vector, adopt,
each entry of the
whichmicroblogs
denotes the weightsetof𝑀∆𝑡𝑖 h
the representative words for each topic. Otherwise, the topics that users focus on are the
a distribution over
latent interests user have,all
so, topics
leveragingto obtain
LDA, latent
we depict topics
M∆ti as and
an interest to Prepresent
vector ∆ti
of the microblogs collection as a topic vector, each entry of which d
P∆t = {( I1 , λu,1 ), ( I2 , λu,2 ), ( I3 , λu,3 ), . . . , ( In , λu,n )}
i
(1)
the representative words for each topic. Otherwise, the topics that u
where I = ( I1 , I2 , I3 , . . . , In ) denotes the set of interest vectors, Ii represents a kind of inter‑
latent interests user have, so, leveraging LDA, we depict 𝑀∆𝑡𝑖in as an
est (lurking in topics, e.g., eating, IT, traveling), and λu,i is the degree that Ii occupies
user u’s interest. Take all the P∆t into account, we formulate Pu M (the latent interest vector
in user u’s microblogs) as 𝑃∆𝑡 = {(𝐼1 , 𝜆𝑢,1 ), (𝐼2 , 𝜆𝑢,2 ), (𝐼3 , 𝜆𝑢,3 ), … , (𝐼𝑛 , 𝜆𝑢,𝑛 )
𝑖
n
P∆t + P∆t + . . . + P∆t = ∑ =0 P∆t
where 𝐼 = (𝐼1 , 𝐼2P,u𝐼3=, …P∆t, 𝐼+
M
𝑛 ) denotes the set of iinterest
0 1 2 n
vectors, (2)
i 𝐼𝑖 repr
est (lurking
Note thatin topics, e.g., eating,
influencesIT, traveling), some𝜆interests
and 𝑢,𝑖 is the degre
time is a factor which a user’s interests, and sub‑
merge or weaken. We determine to add a time weight function f (t) to each P∆t . There are
user
twou’s interest. f (t): all the 𝑃∆𝑡 into account, we formulate 𝑃𝑢𝑀 (th
forTake
i
requirements
tor•in user u’sbemicroblogs)
f (t) must as for the reason that current interest vector P∆t
monotonically decreasing 0
should have more weight in Pu M .
• 𝑃 in the
The value of f (t) should lie
𝑢𝑀=range
𝑃 of+[0,𝑃1].
∆𝑡0 ∆𝑡1 + 𝑃∆𝑡2 + ⋯ + 𝑃∆𝑡𝑛 = ∑𝑛𝑖=0 𝑃∆𝑡
Inspired by Li et al. [19], we found that three kinds of time function could be used
to describe the curve of time weight: an exponential function ( f (t) = e−λt ), a logistic
Note that time is a factor which influences a user’s interests, an
function ( f (t) = eλt2+1 ) and a damping function ( f (t) = (1 + λt)e−λt ). λ is the decay rate.
merge
Figureor weaken.
6 shows We determine
the diagrams to add
of these three functions (λ a
= time
0.1), in weight
which we canfunction
clearly 𝑓(𝑡
notice that all of the functions are monotonically decreasing and moving to zero in the
areend,
two requirements for 𝑓(𝑡):
which is suitable to describe interest attenuation. Allowing for the idea that a user’s
• interests
𝑓(𝑡)should
mustnotbe change quickly, we choose the damping function as our time weight
monotonically decreasing for the reason that cu
function f (t). Hence, the calculation formula Pu M is updated as
𝑃∆𝑡0 should have more weight in 𝑃𝑢𝑀 . n
Pu = P∆t f (t0 ) + P∆t f (t1 ) + P∆t f (t2 ) + · · · + P∆t f (tn ) = ∑i=0 P∆t f (ti ) (3)
• The value of 𝑓(𝑡) should lie in the range of [0, 1].
M 0 1 2 n i
𝑃𝑢𝑀 = {(𝐼1 , 𝜆𝑢,1 ), (𝐼2 , 𝜆𝑢,2 ), (𝐼3 , 𝜆𝑢,3 ), … , (𝐼𝑛 , 𝜆𝑢,𝑛 )}
3.1.2. Interest Profiles Obtained from Tags

A user u’s tags are denoted as 𝑇𝑢 = (𝑇1 , 𝑇2 , 𝑇3 , … , 𝑇𝑚 ), which corre
′ ′ ′ ′ ′
interests 𝐼 = (𝐼1 , 𝐼2 , 𝐼3 , … , 𝐼𝑚 ). Considering that tags are dominant and
a user’s
Theninterests, we setas𝑃follows
Pu M can be rewritten 𝑢𝑇 (interest vector in user ’s tags) as follow
u
Pu M = {( I1 , λu,1 )𝑃
, ( I2 , λ u,2 ), ( I′3 , λu,3 ), . . .′, ( In , λu,n )}
′ ′ (4)
𝑢𝑇 = {(𝐼1 , 𝑐1 ), (𝐼2 , 𝑐1 ), (𝐼3 , 𝑐1 ), … , (𝐼𝑚 , 𝑐1 )}
3.1.2. Interest Profiles Obtained from Tags
where 𝑐1 u’s
A user
is tags
a constant and the value of c corresponds to the interest d
are denoted as Tu = ( T , T2 , T3 , . . . , Tm ), which correspond to the user’s
( ) 1
interests I ′ = I1′ , I2′ , I3′ , . . . , Im
′ . Considering that tags are dominant and stable indicators
of a user’s
3.1.3. interests,
Interest we set PuTObtained
Profiles (interest vector user u’s tags) asProfiles
byinFollowees’ following equation:
{( ′ ) ( ′ ) ( ′ ) ( ′ )}
In the real world, PuT = we I1 , chave 1 , I3 , c1connections
1 , I2 , cmore , . . . , Im , c1 with people who
hav (5)
similar
where c1 tois aours.
constantThe motivation
and the for a user
value of c corresponds tointerest
to the follow of I ′ .
another
degree user in aso
termined
3.1.3. Interestby whether
Profiles he/she
Obtained has nearly
by Followees’ Profiles the same interests as this use
lowees’ In theprofiles
real world,can mirror
we have the user’s
more connections withprofiles
people whoto havesome
tastes andextent.
habits There
similar to ours. The motivation for a user to follow another
user’s profiles via the people whom the user is following. Suppose user in asocial network is use
determined by whether he/she has nearly the same interests as this user. Namely, the
𝐹𝑢followees’
= (𝐹1 , 𝐹 2 , 𝐹3 , can
profiles … , mirror
𝐹𝑙 ) and
the their profiles
user’s profiles 𝑃𝑢𝐹extent.
to some 𝑖
have already
Therefore, we beenexpandcreated.
a
user’s profiles via the people whom the user is following.
terest profiles reflected by the followees, is calculated by Suppose user u follows l people
Fu = ( F1 , F2 , F3 , . . . , Fl ) and their profiles Pu F i have already been created. Hence, Pu F , the
interest profiles reflected by the followees, is calculated by 𝑙
𝑃𝑢𝐹 = ∑𝑖=1 𝜉𝑃𝑢𝐹
𝑖
l
Pu F = ∑i=1 ξPuFi (6)
where 𝜉 is reduction factor.
where ξ is reduction factor.
Figure 6. Diagrams of three time functions.

Figure 6. Diagrams of three time functions.
3.1.4. Stable Interest Profiles and Temporal Interest Profiles
We have acquired three kinds of interest profiles (Pu M , PuT , Pu F ), which are obtained
3.1.4.
from aStable Interest
user’s historical Profiles
microblogs, tagsand Temporal
and followees, Interest
respectively. Profiles
In this section, we will
propose an algorithm to integrate Pu M , PuT and Pu F , and then define stable interest profiles
We have
and temporal acquired
interest profiles. three
Firstly, kinds of1interest
Algorithm profiles
is the procedure (𝑃𝑢𝑀 , 𝑃𝑢P𝑇u , 𝑃𝑢𝐹 ), w
of integrating M
from
and Pua
T user’s historical microblogs, tagstheand
, and its result we define as Pu . Then, we follow same followees, respectively.
procedure for integrating
Pu and Pu F , which we do not detail here. We get the user’s profiles Pu eventually, and we
will propose an algorithm to integrate 𝑃𝑢𝑀 , 𝑃𝑢𝑇 and 𝑃𝑢𝐹 , and then de
define Pu as the user’s stable interest profile, while the temporal interest profiles refer to
current interest vector P∆t0 , which is decided by the recent user data.
Algorithm 1. Procedure for integrating Pu M and PuT

( )
Input: Interest sequence I = ( I1 , I2 , I3 , . . . , In ) and I ′ = I1′ , I2′ , I3′ , . . . , Im
′ , interest vector P
u M and
PuT
Output: User u’s interest vector Pu
1. initialize Pu = Pu M ;
2. for i = 1: m (the number of I ′ );
3. if Ii′ = Ij ,
4. update interest vector ( Ij , λu,j ) to ( Ij , λu,j + c) in Pu ;
5. else ( )
6. insert Ii′ , c and its value into Pu ;
7. end for;
8. Output Pu ;
3.2. Commodity Profiling

The name of a commodity is made up of commodity details. For example, “Motorola
Moto 360 2nd Gen Smartwatch for Most Apple iOS and Android Cell Phones (Men’s,
42 mm, Black w/Black Leather)” is the trade name of a product sold on Amazon. Gen‑
erally, e‑commerce limits the length of the trade name, for which reason each component
represents important information about product. Observing this trade name, we divide
it into several key components: “Motorola”, “Moto 360”, “Smartwatch”, “Apple”, “iOS”,
“Android”, “Cell Phones”, “Men’s”, “Black”, “w/Black” and “Leather”. These key compo‑
nents show consumers the nature of the commodity concisely and explicitly. We define
these components as commodity profile components N = (n1, n2 , n3 , . . . , nk ), where n is
one component, such as “Motorola”. The other crucial aspect we concern ourselves with
is the commodity’s classifying labels, which were filed in the e‑business database when the
vender put this item on the shelf. For instance, “Cell Phones & Accessories”, “Accessories”
and “Smart Watch Accessories” are three classifying labels for the commodity we men‑
tioned above. From these three labels, consumers understand that this product belong to
those classifications. Let L = (l1 , l2 , l3 , . . . , lr ) be a collection of a user’s classification labels.
For each commodity c, Pc N = {(n1 , c2 ), (n2 , c2 ), (n3 , c2 ), . . . , (nk , c2 )} and PL =
{(l1 , c3 ), (l2 , c3 ), (l3 , c3 ), . . . , (lr , c3 )} are the profiles obtained from that commodity’s name
and classifying labels, respectively. By combining Pc N and Pc L , a commodity profiles Pc
can be obtained using Algorithm 2.
Algorithm 2. Procedure for integrating Pc N and Pc L

Input: Interest sequence N = (n1, n2 , n3 , . . . , nk ) and L = (l1 , l2 , l3 , . . . , lr ), interest vector Pc N and
Pc L
Output: Commodity c’s profiles Pc
1. initialize Pc = Pc N ;
2. for i = 1: r;
3. if li = n1 ,
4. update interest vector (ni , c2 ) to (ni , c2 + c3 ) in Pc ;
5. else
6. insert (li , c3 ) and its value into Pc ;
7. end for;
8. Output Pc ;
3.3. Recommendation Subdivisions

In this section, we introduce three recommendation subdivisions. We propose three
subdivisions for the reason that an intact recommendation scheme needs to be a complex
system as a result of the complexity of the recommendation process. Recall the process
by which we intend to recommend a commodity from an e‑commerce platform to a so‑

cial network user. At first, we consider the question of how we can recommend a single
commodity to an individual. As we have developed a method for obtaining user profiles
and commodity profiles, they become the means to resolve this question. However, when
the individual receives an item recommendation based on both profiles, different actions
may ensue, as he/she may accept, reject or ignore it. We consider these actions as feedback
on this item for the reason that the different actions represents different levels of accep‑
tance of recommended item. Then, we design a feedback mechanism by taking such feed‑
back actions into account. By regarding three feedback actions as three user‑item scores, a
user‑commodity recommendation matrix will be acquired. Finally, we will employ an im‑
proved collaborative filtering algorithm, which will make recommendations based on the
user‑commodity recommendation matrix. Consequently, in this section, we will introduce
a recommendation scheme containing three subdivisions: Recommendations for individ‑
uals, the Feedback mechanism of recommendation scheme, and Recommendations by the
improved collaborative filtering algorithm.
3.3.1. Recommendations for Individuals

Since the proposed cross‑platform recommender system is to recommend commodi‑
ties across platforms, the profiles we obtain from both platforms turn into the bridge con‑
necting user and commodity. Due to the fact that consumers are more likely to spend
money on commodities that fit their habits, we try to compute the similarity between the
user profiles Pu and the commodity profiles Pc . For a given user u and commodity c, the
recommended score RScore( Pu | Pc ) (cosine similarity of Pu and Pc ) is computed as follows:
Pu T Pc
RScore( Pu | Pc ) = √ √ (7)
Pu T Pu Pc T Pc
Supposing χ is the critical value of recommendation, then

{
i f RScore ≥ χ → recommend
i f RScore < χ → do not recommend
which means that, if the relevance of u and c is big enough, we recommend c to u. Other‑
wise, c is not a suitable recommended item for u.
3.3.2. Feedback Mechanism

When dealing with individual recommendations, the RScore plays an important role.
However, once a user receives the item recommendation, different actions may be taken
by the user. Supposing that a user u receives a recommendation of commodity c, one of
the three actions below would be taken:
• Click recommended item, browse and buy finally.
• Click recommended item, browse but do not buy.
• Do not click recommended item.
Different action indicates different levels of acceptance (emotions) of the
recommended item. Table 1 lists the details regarding these three actions as different types
of feedback from the user, which correspond to three rankings of the recommended com‑
modity, defined as rank(high), rank(middle), rank(low), respectively. rank (high) means that
the result that recommended the commodity with profiles Pc to the user with profiles Pu is
perfect, while rank(middle) means the result is generally positive and rank(low) means the
result is bad.
Table 1. Different action and different emotion.
User Action Emotion

Click recommended item, browse it and buy it finally. Very high positive
Click recommended item, browse it but do not buy it. Moderate positive
Do not click recommended item. Negative
Then RScore( Pu | Pc ) in the recommendation database will be updated according to the

user’s feedback. Formally, we define FScore( Pu | Pc ) as the feedback score and have

 +ω, rank = rank(high)
Fscore( Pu | Pc ) = 0, rank = rank (middle) (8)

−ω, rank = rank(low)
where ω is a positive real number. Hence, RScore( Pu | Pc ) is updated by the formula
RScore( Pu | Pc )′ = RScore( Pu | Pc ) + FScore( Pu | Pc ) (9)
3.3.3. Recommendations Based on Collaborative Filtering Algorithm Using User Profiles

If the proposed cross‑platform recommender system starts to work by recommending
different commodities to different users, a user‑commodity‑feedback score matrix will be
obtained. We apply the collaborative filtering (CF) method to produce the predicted like‑
ness score of a given item for a given user with the help of the user‑commodity‑feedback
score matrix. As for CF, it is the most successful recommendation technique to date. The
basic idea of CF‑based algorithms is to provide item recommendations or predictions based
on the opinions of other like‑minded users, and CF characterizes consumers and products
implicitly by their previous interactions. Considering that we have user interest profiles,
we want to improve the traditional collaborative filtering algorithm by combining it with
the similarity of user profiles between users.
Suppose that we have a list of m users U = {u1 , u2 , u3 , . . . , um } and a list of n com‑
modities C = {c1 , c2 , c3 , . . . , cn }. The user‑commodity‑score matrix is denoted as Rm×n .
 
R1,1 ··· R1,n
 .. 
R =  ... ..
. .  (10)
Rm,1 ··· Rm×n
where Ri,j (1 ≤ i ≤ m, 1 ≤ j ≤ n) represents the feedback score that user i gives to com‑
modity j. Rm×n will always be a sparse matrix while the recommendation method aims to
predict the unknown score in Rm×n .
Measuring the similarity between users is quite important in CF. There are three popu‑
lar methods of measurement used in CF, which are Cosine Similarity, Pearson Correlation
Coefficient Similarity and Modified Cosine Similarity. In our paper, Pearson Correlation
Coefficient Similarity has been used. The formula is shown below.
( ) ( )
∑ik ∈Ca ∩ Cb R a,k − R a × Rb,k − Rb
sim(u a , ub ) = √ ( )2 √ ( )2 (11)
∑Cc ∈Ca ∩ Cb R a,k − R a ∑Cc ∈Ca ∩ Cb Rb,k − Rb
where sim(u a , ub ) ∈ (0, 1) denotes the similarity of user a and user b. R a,k and Rb,k denote
the scores that u a and ub give to commodity ck . R a and Rb are the average score that u a and
ub give to commodities. Ca ∩ Cb denotes the set of commodities to which have both given
a score u a and ub .
Pa,j indicates the predicted score that u a gives to c j , which means that u a has never
given a score to c j . Firstly, we need the set S of users who have given scores to c j . Then
we calculate the similarity between u a and every user in S, choose k users with the greatest
similarity as the set of neighbor users S(u a ) and calculate Pa,j by
∑ub ∈S(ua ) ( Rb,k − Rb ) × sim(u a , ub )

Pa,j = R a + (12)
∑ub ∈S(ua ) sim(u a , ub )
Considering that each user has his or her own profiles, we make some improvements
to the similarity formula and propose a collaborative filtering algorithm based on the user
profiles
( (CFUP). u a) and ub have n profiles Aua = ( aua ,1 , aua ,2 , . . . , aua ,n ) and Aub =
aub ,1 , aub ,2 , . . . , aub ,n . We use the Euclidean metric method to measure the profile differ‑
ence of u a and ub . d(u a , ub ) denotes the Euclidean distance of u a and ub , while the calcula‑
tion formula is shown below.
√
n ( )2
d(u a , ub ) = ∑ aua ,i − aub ,i (13)
i =1
Then, the improved similarity formula appears as follows:

( ) ( )
∑ik ∈Ca ∩ Cb R a,k − R a × Rb,k − Rb
sim(u a , ub ) = µ √ ( )2 √ ( )2 + (1 − µ ) d ( u a , u b ) (14)
∑Cc ∈Ca ∩ Cb R a,k − R a ∑Cc ∈Ca ∩ Cb Rb,k − Rb
where µ ∈ [0, 1].
4. Evaluation and Analysis

In this section, we first introduce the data that we used. After that, we will conduct
experiments to validate the value of the cross‑platform recommender system and confirm
that promotional information in a social network could affect e‑business. Next, we com‑
pare the performance of the Microblogs Topic Discovery Model and standard LDA in user
profiling. Following that, users’ profiles will be built by our topic model. Finally, a compar‑
ison between CFUP and the tradition collaborative filtering algorithm will be presented.
4.1. Data Preparation

The social network data source we adopted in this paper is Sina Weibo, which is the
most popular and influential social network in China. It had 530 million active monthly
users (MAU) in December of 2021. In order to realize our simulation, the original mi‑
croblogs corpus dataset was collected from the Sina Weibo website using a crawler tool.
We collected this dataset by starting from a seed set of active Sina Weibo users (we call
these users shop owners) who have set foot both in social networks and in e‑business. When
these users market their commodities using microblogs (we call these microblogs promo‑
tional microblogs) in a social network, we trace the information in these microblogs and
collect data both on the social network platform and the e‑business platform. This process
is shown in Figure 7.
For the purpose of obtaining credible and complete user profiles, the collected data
from the social network platform contains four main types of information, which are shown
in Table 2 in detail.
Table 2. Four types of information collected from social network platform.
No. Information Type

1 User’s whole microblogs since he/she became a weibo register.
2 User’s basic registered information
3 User’s tags which were set by user.
4 User’s followees (The person that user follows.)
Future Internet 2023, 15, x FOR PEER REVIEW 12
Figure
Figure7.7.The
Thedetailed process
detailed of collecting
process data. data.
of collecting
The same collection method is used in commodity information collection process. The
For the purpose of obtaining credible and complete user profiles, the collected
six types of commodity information are shown in Table 3.
from the social network platform contains four main types of information, which
shown
Table in types
3. Six Table of 2information
in detail.collected from e‑business platform.
No. Information Type
Table 2. Four types of information collected from social network platform.
1 Commodity’s title
No. 2 Information
Commodity’s Type
category
1 3
User’s whole microblogs since he/she became a weibo register.
Detail description of commodity
2 User’s basic registered information
4 Sell recorder of commodity
3 User’s tags which were set by user.
5 Commodity’s reviews
4 User’s followees (The person that user follows.)
6 Sales
The same collection method is used in commodity information collection proc

4.2. Effects of a Promotional Microblog on Commodity Sales
The six types of commodity information are shown in Table 3.
In this section, we study the following process: a shop owner who has Weibo ac‑
count in a social network and has a number of followers releases a promotional microblog,
Table 3. Six types of information collected from e-business platform.
namely, an advertisement of his products, which are available on an e‑commerce platform.
Would this microblog promote his product sales? Starting
No. with thisType
Information question, we do some
experiments below.
1 Commodity’s title
Firstly, several special users on Weibo has been chosen as shop owners and have shops
2 Commodity’s category
(selling food, cosmetic, living goods, etc.) on Taobao (a famous e‑commerce platform).
3 Detail
When they release a promotional microblog for selling description
their own ofproducts,
commodity we track the
microblog 4and collect the product’s sale informationSell from
recorder of commodity
Taobao. Later, we analyze these
sales data.5 In the instance of the promotional microblog shown reviews
Commodity’s in Figure 8, one of the
shop owners 6 has a Weibo account (his homepage: http://weibo.com/wysr2007
Sales (accessed
on 7 March 2022)) and a shop on an e‑commerce site (shop link: https://shop70713800.
taobao.com/ (accessed on 7 March 2022)). The number of his followers is more than one
4.2. Effects of a Promotional Microblog on Commodity Sales
million nine hundred thousand. The promotional microblog in Figure 8 was released by
In this
this shop ownersection, we study
to announce the following
that there would be aprocess:
discount ina shop owner
his shop. who
In total, has Weibo acco
it received
1009 forwards, 368 comments and 433 praises.
in a social network and has a number of followers releases a promotional microb
namely, an advertisement of his products, which are available on an e-commerce p
form. Would this microblog promote his product sales? Starting with this question, w
some experiments below.
Firstly, several special users on Weibo has been chosen as shop owners and h
shops (selling food, cosmetic, living goods, etc.) on Taobao (a famous e-commerce p
(accessed on 7 March 2022)) and a shop on an e-commerce site (shop l
https://shop70713800.taobao.com/ (accessed on 7 March 2022)). The number of his foll
ers is more than one million nine hundred thousand. The promotional microblog in Fig
8 was released by this shop owner to announce that there would be a discount in his sh
In total, it received 1009 forwards, 368 comments and 433 praises.
Figure 8.ofAn
Figure 8. An instance theinstance
special of the we
users special users we study.
study.
In this experiment,
In this experiment, five differentfive different promotional
promotional microblogs
microblogs are selected are
andselected and analyz
analyzed,
and detailedand detailed information
information about them is about
shownthem is shown
in Table 4. in Table 4.
Table 4. Detailed
Tableinformation
4. Detailedon five different
information on promotional
five differentmicroblogs.
promotional microblogs.
Information Information
Information of Five Promotional
of Five Promotional Microblogsmicroblogs Commodity TypesCommodity
that Types
Information
Followers the Shop Ownerthat the
Sell Shop
on Owner
Followers Forward
Forward Like Like
Num-Comment
CommentA Brief
A Brief Overview of
Overview of MI-
E‑Business Platform
Shop Owner Shop Owner Number Number Number Number Microblog Contents Sell on E-Business
Number Number ber Number CROBLOG Contents
1. Discount on Platform
beauty products,on beauty prod-
1. Discount
women’s products
ucts, women’s products Food and
Shop Owner 1 and red wine. Food
1,917,983 980 434 369 Women’s products
(Wu Yue SanShop
Ren) Owner 1 1917983 980 434 369 2. red
New wine.
commodities
Cosmetics Women’s products
(Wu Yue San Ren) 3. 2. New gifts
Receive commodities
for
forwarding
3. Receive thegifts for forwarding Cosmetics
microblog
the microblog
1. 1.
Festival discount
Festival discount on some
Shop Owner 2 on some Milk powder
commodities
commodities Milk powder Women’s products
(Emergency
Shop Owner 2 Fe- 3043249 181 76 79
3,043,249 181 76 79 2. 2. Chance
Chance to aget a cash
to get gift for
Women’s products
(Emergency Female Superman)
male Superman) cash gift for
forwarding Cosmetics Cosmetics
this microblog
forwarding this
Shop Owner 3 1. Festival discount on red
microblog
1968905 69 104 57 Red Wine
(Wang Xiaoshan) wine
Shop Owner 3 1. Festival discount
Shop Owner 4 1,968,905
(Wang Xiaoshan)
69 104 57 1. Chance to win a gift RedifWine
you
62809 68933 9660 15402 on red wine Barbecue
(Barbecue) forward this microblog
1. Chance to win a
Shop Owner 4
62,809 68,933 9660 15,402 gift1.ifNew products Barbecue
you forward
(Barbecue)
Shop Owner 5 this microblog
2. Chances to get cash or cloth-
590873 1018 285 1458 Clothes
(Zhou Xiaoxiong) 1. ing
Newgift for users who forward
products
2. this
Chances to get
microblog
Shop Owner 5 cash or clothing gift
590,873 1018 285 1458 Clothes
(Zhou Xiaoxiong) for users who
After these microblogs were released, we kept track of all of the commodities s
forward this
in the e-business stores and collectedmicroblog
their sales information. Figure 9 shows the metab
curves of different shop owners’ commodities sales.
After these microblogs were released, we kept track of all of the commodities sales in
the e‑business stores and collected their sales information. Figure 9 shows the metabolic
curves of different shop owners’ commodities sales.
In these figures, the abscissa denotes the time and each unit of abscissa represents one
day, while the zero abscissa indicates the day that the shop owner released the promotional
microblog, and the ordinate denotes the sale volume of shop on e‑commerce. To display
the change clearly, we use a red line to indicate places where the commodities sales are
larger than before.
Figure 9. The
Figure 9. Thecurves
curvesofof different
different shop
shop owner’s
owner’s commodities
commodities sales. sales.
From thesefigures,
In these curves, the
we can observedenotes
abscissa that, after these
the timeshop owners
and each released their promo‑
unit of abscissa represents
tional microblogs, there was an obvious upward trend of their shops’ sales volumes. For
one day, while the zero abscissa indicates the day that the shop owner released the pro-
shop owners 1, 4 and 5, their commodities sales appeared to peak after they released the
motional microblog, and the ordinate denotes the sale volume of shop on e-commerce. To
promotional microblogs. For shop owner 2 and 3, there exist a continuously higher sales
display
volume thanthe change
the days clearly, we use athese
before. Although red sales
line to indicate
curves moveplaces where
differently, thegeneral
their commodities
sales are larger than before.
tendency is to go up, which means that promotional microblogs certainly have a facilitat‑
From these
ing function curves, we
for promoting can
sales. Inobserve that,a after
other words, socialthese shop
network hasowners
ability toreleased their pro-
play a role
motional microblogs, there was an obvious upward trend of their shops’ sales volumes.
in creating economical value for e‑commerce, which gives meaning to our research field.
In turn,
For shop if owners
we know1,about
4 andone user’scommodities
5, their profiles in a social
salesnetwork,
appearedmeaning
to peak that we they
after knowreleased
what this user prefers, could we take a suitable commodity and recommend
the promotional microblogs. For shop owner 2 and 3, there exist a continuously it to the corre‑ higher
sponding user? This is the process of cross‑platform recommendation, which discovers a
sales volume than the days before. Although these sales curves move differently, their
user’s interests with the help of social media and chooses products fit for that user interests.
general tendency is to go up, which means that promotional microblogs certainly have a
In the next section, we will make an evaluation of our method of obtaining user profiles.
facilitating function for promoting sales. In other words, a social network has ability to
play a role in Topic
4.3. Microblogs creating economical
Discovery value
Model and for e-commerce,
Standard LDA Model which gives meaning to our re-
search field. In turn, if we know about one user’s profiles in aevaluate
In order to test the efficiency of our model, we quantitatively social network,
the MTDMmeaning
that we know
compared withwhat this user
the standard prefers,
LDA model,could we takealla tweets
i.e., treating suitable
as commodity and recommend
a single document.
Thecorresponding
it to the above‑mentioned two This
user? models have
is the four parameters,
process and different
of cross-platform choices of pa‑ which
recommendation,
rameters have
discovers implications
a user’s forwith
interests the inference
the helpresults. In our
of social experiment,
media learning
and chooses from other
products fit for that
research and from our own experience, the number of topics T is
user interests. In the next section, we will make an evaluation of our method set as 5, α is 50/T,of is
β obtaining
0.1 and the iterations of Gibbs sampling that we set is 1000. In addition, some preparatory
user profiles.
work must be accomplished before using these two models, such as deleting stop words,
removing punctuation and segmenting words. However, we omit the description of this
4.3.
work.Microblogs Topicsamples
Table 5 shows Discovery Model
of the andobtained
results Standardusing
LDAMTDMModel and the standard LDA
modelIn(we
order
onlytolisttest the efficiency
six words of our
in each topic, andmodel, we quantitatively
we translate evaluate
them into English the MTDM
in brackets).
compared with the standard LDA model, i.e., treating all tweets as a single document.
The above-mentioned two models have four parameters, and different choices of pa-
rameters have implications for the inference results. In our experiment, learning from
other research and from our own experience, the number of topics T is set as 5, α is 50/T,
β is 0.1 and the iterations of Gibbs sampling that we set is 1000. In addition, some prepar-
atory work must be accomplished before using these two models, such as deleting stop
words, removing punctuation and segmenting words. However, we omit the description
of this work. Table 5 shows samples of the results obtained using MTDM and the standard
Table 5. The result samples of MTDM and Standard LDA.
Microblogs Topic Discovery Mode (MTDM) Standard LDA Model

Topic Words Distribution Topic Words Distribution
Zealer 0.046426777 ZEALER 0.040501230700380406
出品 (Manufacture) 0.026692798 魅族 (Meizu) 0.022600134258223315
大会 (Convention) 0.016652705 MX3 0.015887223092414412
Topic 1 Topic 1
评测 (Evaluation) 0.010767135 测评 (Evaluation) 0.015887223092414412
讨论 (Discussion) 0.008689874 视频 (Video) 0.015887223092414412
留言 (Leave a message) 0.007997453 小米 (Xiaomi) 0.015887223092414412
评论 (Comment) 0.013492634 想 (think) 0.01883788803396126
系列 (Series) 0.013108227 梦想 (Dream) 0.016184664367206156
品位 (Taste) 0.012723822 做 (Do) 0.016184664367206156
Topic 2 Topic 2
Smartisant1 0.0050357124 说 (Speak) 0.013531440700451048
老罗 (Mr. Luo) 0.003113685 创业 (Startup business) 0.013531440700451048
锤子 (Smartisant) 0.003113685 国产 (Domestic) 0.013531440700451048
创业 (Startup business) 0.004170707 手机 (Cellphone) 0.01888800212822559
快乐 (happiness) 0.0037949677 买 (Buy) 0.016227720138334664
生活 (live) 0.0037949677 硬件 (Hardware) 0.016227720138334664
Topic 3 Topic 3
公司 (company) 0.0030434888 改变 (Change) 0.016227720138334664
事 (affair) 0.0030434888 产品 (Product) 0.010907156158552806
努力 (strive) 0.0026677493 希望 (Hope) 0.010907156158552806
直播 (live broadcast) 0.01684577 大会 0.05386941426811513
发布会 (new product release conference)
Zealer 0.038190166871990144
0.0147453
Topic 4 Topic 4
平台 (platform) 0.012224737 WISE 0.035950274386829434
斗鱼 (Betta) 0.010964454 转发 (Forward) 0.035950274386829434
苹果 (Apple)0.010544361 说 (Speak) 0.021390973233284805
iPhone 0.005503232 自如 (Someone’s name) 0.020271026990704447
发布会 (new product release conference)
测评 (Evaluation) 0.0318356288101151
0.018411258
科技 (Technology) 0.009562965 科技 (Technology) 0.022804244750508015
Topic 5 视频 (video) 0.009562965 Topic 5 ZEALER 0.013772860690900881
挑战 (challenge) 0.0058194553 视频 (Video) 0.013772860690900881
产品 (Product) 0.0058194553 中国 (China) 0.013772860690900881
评测 (Evaluation) 0.005479136 产品 (Product)0.0058194553
However, we repeat the process 100 times with different data sets so that we have
100 pairs of results. We select three human judges to make judgements regarding these
results. The results are first mixed randomly and then sent to the judges. They assign a
grade for each topic according to original data. The grading rules are given below.
Grading rules:
• 1: meaningful and coherent
• 0.5: not very good; contains other topics or meaningless words
• 0: makes no sense
Then, we calculate the average grade for two models and list them in Table 6.
Table 6. Comparison results of MTDM and Standard LDA.
Model Average Grade

MTDM 0.612
Standard LDA Model 0.531
The average grade of MTDM is larger than that of Standard LDA, which indicates
that MTDM has better performance on topic detecting. In the next section, we use MTDM
to detect user interest profiles.
4.4. The Construction of User Interest Profiles

This section will build user interest profiles based on the data we have collected from
social networks and test the effectiveness of the profiles we have built.
4.4.1. The Process of Building User Interest Profiles

Here, we choose one user as an example. Considering the fact that the user’s inter‑
ests
( may change as time goes)by, we separate the user’s microblogs M into five groups
M∆t0 , M∆t1 , M∆t2 , M∆t3 , M∆t4 , so that each group contains microblogs that the user re‑
leased during ∆t (six months). Then, we train MTDM using M∆t0 , M∆t1 , M∆t2 , M∆t3 , M∆t4 ,
respectively. The user interest vector P∆ti at different times is shown in Table 7.
Table 7. Examples of profiles building process.
M∆ti Topics Keyword in Topic P∆ti

Topic 1 (Mata 0.015)
Topic 2 (Toyota 0.019)
{(Soccer, 0.015), (Car, 0.019),
M∆t0 Topic 3 (Mobile 0.011) (Mobile, 0.011), (Fuel tank,
Topic 4 (Fuel tank 0.012) 0.012), (Sichuan, 0.026)}
Topic 5 (Sichuan 0.026)

Topic 1 (Shopping 0.015)
Topic 2 (Boom 0.010) {(Shopping, 0.015), (News,
Topic 3 (Mobile 0.069) 0.010), (Mobile, 0.069),
M∆t1
(Technology, 0.008),
Topic 4 (Technology 0.008) (Soccer, 0.012)}
Topic 5 (Soccer 0.012)
Topic 1 (Environment 0.011)
Topic 2 (Mobile 0.018) {(Environment, 0.011),
Topic 3 (Technology 0.006) (Mobile, 0.018), (Technology,
M∆t2
0.006), (Car, 0.013),
Topic 4 (Driver 0.013) (Huawei, 0.022)}
Topic 5 (Huawei 0.022)
Topic 1 (Mobile 0.017)
Topic 2 (Environment 0.017) {(Mobile, 0.017),
Topic 3 (Meizu 0.019) (Environment, 0.017),
M∆t3
(Meizu, 0.019), (Website,
Topic 4 (Website 0.009) 0.009), (Soccer, 0.007)}
Topic 5 (Soccer fan 0.007)
Topic 1 (Japan 0.013)
Topic 2 (Mobile 0.030)
{(Japan, 0.013), (Mobile,
M∆t4 Topic 3 (Company 0.014) 0.030), (Company, 0.014),
Topic 4 (World Cup Soccer 0.035) (Soccer, 0.035), (News, 0.007)}
Topic 5 (Soccer fan 0.007)
We can observe from Table 7 that P∆ti are different from each other. P∆t0 shows that,
in last six months, the user mainly focused on Car, Mobile and Soccer, while P∆t1 shows
that the user was interested in Shopping, Mobile, Technology, Soccer and News. This
phenomenon demonstrates that user interests as detected using microblogs, which reflect
actual user preferences, would change over different time periods. This indirectly shows
Future Internet 2023, 15, x FOR PEERthe necessity
REVIEW of distinguishing stable interest profiles and temporal interest profiles.17 of 20
The interest profiles Pu M in a user u’s microblogs can be obtained by
f (t) = (1 + λt)e−λt (15)

𝑛
𝑃𝑢𝑀 = ∑n𝑃∆𝑡𝑖 𝑓(𝑡𝑖 ) (14)

Pu M = ∑ P∆ti f (ti ) (16)
𝑖=0
i =0
Here the value of 𝜆 we choose is 1; therefore, 𝑓(𝑡 ) = 𝑓(0) = 1 , 𝑓(𝑡 ) = 𝑓(1) =
Here the value of λ we choose is 1; therefore, f (t0 ) =0 f (0) = 1 , f (t ) 1= f (1) =
0.772, 𝑓(𝑡2 ) = 𝑓(2) = 0.434, 𝑓(𝑡3 ) = 𝑓(3) = 0.231, 𝑓(𝑡4 ) = 𝑓(4) = 0.107. 1
0.772, f (t2 ) = f (2) = 0.434, f (t3 ) = f (3) = 0.231, f (t4 ) = f (4) = 0.107.
The user’s 𝑃 that we obtain is {(Soccer, 0.0296), (Mobile, 0.0259), (Car, 0.0246),
The user’s Pu M𝑢𝑀that we obtain is {(Soccer, 0.0296), (Mobile, 0.0259), (Car, 0.0246), (Shop‑
ping, 0.0116),0.0116),
(Shopping, (Huawei, (Huawei, .009548),
0.009548), (Technology,
(Technology, 0.0088), 0.0088), (Environment,
(Environment, 0.0087), 0.0087),
(News,
(News, (Meizu,
0.0085), 0.0085), 0.0044),
(Meizu, (Website,
0.0044), (Website, 0.0021),0.0014),
0.0021), (Japan, (Japan,(Company,
0.0014), (Company,
0.0015)}. 0.0015)}.
Whileuser
While userhas tagsTTu =
hastags =(Soccer,
(Soccer, News,
News, Humor, Mobile, Java,
Humor, Mobile, Java, Game,
Game, Post‑80s),
Post-80s), so
so
u
P = {(Soccer,0.01), (News,0.01), (Humor,0.01), (Mobile,0.01), (Java,0.01),
PuuTT = {(Soccer,0.01), (News,0.01), (Humor,0.01), (Mobile,0.01), (Java,0.01), (Game,0.01), (Game,0.01),
(Post-80s,0.01)},the
(Post‑80s,0.01)}, theuser profilesPu𝑃𝑢can
userprofiles canbebeobtain
obtainbybyAlgorithm
Algorithm1.1.
P𝑃u𝑢 ={(Soccer,0.0396),
= {(Soccer,0.0396), (Car,0.0246),
(Car,0.0246), (Shopping,0.0116), (News,0.0185), (Humor,0.01),
(Shopping,0.0116), (News,0.0185), (Humor,0.01),
(Mobile,0.0359),(Java,0.01),
(Mobile,0.0359), (Java,0.01),(Game,0.01),
(Game,0.01),(Post80s,0.01),
(Post80s,0.01), (Huawei,0.009548),
(Huawei,0.009548), (Technol-
(Technology,
ogy,0.0088), (Environment,0.0087), (Meizu,0.0044), (Website,0.0021), (Japan,0.0014),
0.0088), (Environment,0.0087), (Meizu,0.0044), (Website,0.0021), (Japan,0.0014), (Company,
(Company,0.0015)}.
0.0015)}.
4.4.2.
4.4.2.Efficiency
Efficiencyof
ofthe
theProfiles
ProfilesUsed
Used
Since
Sincewe wehave
havebeen
beenable to build
able useruser
to build profiles using
profiles the process
using mentioned
the process mentionedabove,above,
now
we proceed to test the effectiveness of the profiles that we obtain. The
now we proceed to test the effectiveness of the profiles that we obtain. The method we method we adopt
isadopt
an indirect way which
is an indirect relies on
way which common
relies sense to
on common an extent.
sense It is easy
to an extent. It is to understand
easy to under-
that
standusers
thatin a social
users in a network prefer prefer
social network to make to comments
make comments on microblogs when when
on microblogs they have
they
an interest
have in the information
an interest that thethat
in the information microblog spreads.spreads.
the microblog Therefore, we chose
Therefore, we one promo‑
chose one
tional microblog
promotional which wanted
microblog to sell mobile
which wanted to sellphone
mobileand analyzed
phone its 51 reviewers’
and analyzed interest
its 51 reviewers’
profiles, obtainedobtained
interest profiles, through through
the process
the above.
processIfabove.
the user profiles
If the we get are
user profiles weeffective, the
get are effec-
reviewers’ interest profiles will be much likely to include an interest in ‘Mobile’.
tive, the reviewers’ interest profiles will be much likely to include an interest in ‘Mobile’.
We
Wegenerate
generatestatistics
statisticsaccording
accordingto tokeywords
keywordsin inthe
thereviewers’
reviewers’interest
interestprofiles,
profiles,and
and
the statistical results are shown in Figure
the statistical results are shown in Figure 10. 10.
50
40
40
number
30
20 14 14 13 13 13 12 11 11 11 10
10 6 6 5 4 3 3 1
0
Figure10.
Figure 10.Statistical
Statisticalresults
resultsof
ofkeywords
keywordsin
inusers’
users’interest
interestprofiles.
profiles.
Figure10
Figure 10indicates
indicatesthat
that most
most ofof reviewers
reviewersare
areinterested
interestedininthetheMobile
Mobilearea,
area,which
which
canprove
can proveourourassumption.
assumption. From
From thethe figure
figure above,
above, we
we can
can see
see that
that the
the reviewers
reviewersofofthis
this
promotional
promotionalmicroblog
microblogmostly
mostly have
haveinterests in mobile,
interests technology,
in mobile, technology,digital and some
digital and other
some
aspects relatedrelated
other aspects to mobile. This phenomenon
to mobile. indicates
This phenomenon that users
indicates thatinusers
socialinnetworks prefer
social networks
to be selective
prefer about the
to be selective information
about that they
the information thatfocus on and
they focus onthat
andour thatprofile model
our profile can
model
describe user user
can describe interest profiles.
interest profiles.
4.5. Analysis of the Collaborative Filtering Algorithm Based on the User Profiles (CFUP)
While there is no actual data set for a cross-platform recommender system, we choose
the SUSHI data set, which is similar to the actual cross-platform data (containing user’s
profiles) that we want to use. It contains 5000 users, 100 different kinds of sushi and the
4.5. Analysis of the Collaborative Filtering Algorithm Based on the User Profiles (CFUP)
Future
Future Internet
Internet 2023,
2023, 15,
15, xx FOR
FOR PEER
While there is no actual data set for a cross‑platform recommender system, we choose
PEER REVIEW
REVIEW 18
18 of
of 20
20
the SUSHI data set, which is similar to the actual cross‑platform data (containing user’s
profiles) that we want to use. It contains 5000 users, 100 different kinds of sushi and the
scores that different users give to different types of sushi. Each user has ten attributes. The
scoresrange
scores that different
from zero users give to
to four. Indifferent types of sushi.
this experiment, 80% ofEach the user
SUSHI hasdata
ten attributes. The
set is training
scores range from zero to four. In this experiment, 80% of the
data, while the rest is test data. Mean absolute error (MAE) is adopt as the evaluationSUSHI data set is training
data, while
criterion andthe rest is testbydata.
is calculated Mean absolute
the formula error (MAE)
below, where pu,i standsis adopt
for theaspredicted
the evaluation
score,
criterion and is calculated by the formula below,
ru,i stands for real score and T is the number of train data. where 𝑝𝑢,𝑖
𝑢,𝑖
stands for the predicted
score, 𝑟𝑢,𝑖
𝑢,𝑖
stands for real score and T is the number of train data.
∑𝑇𝑖=1
𝑇 T|𝑝
∑ 𝑢,𝑖
𝑖=1 𝑢,𝑖
−−
| pu,i 𝑢,𝑖r u,i |
𝑟𝑢,𝑖 |
MAE== i=1
MAE (15)
(17)
𝑇T
4.5.1.
4.5.1.Impacts
Impactsof ofthe Parameterµμ
theParameter
µμ isisthe
the parameter
parameter we we use
use inin the
the CFUP
CFUP similarity
similarity formula,
formula, and different µ𝜇 lead
and different leadto
to
different predicted results. This experiment aims at finding the best µ for
different predicted results. This experiment aims at finding the best 𝜇 for CFUP. Here theCFUP. Here the
number
numberof ofneighbor
neighborusers
usersisis30.
30.
As
As we can observefrom
we can observe fromFigure
Figure11, 11, the
the MAE
MAE decreases
decreases asas µμ∈∈ [[0,0.7)
0, 0.7) increases,
increases,while
while
the
the MAR
MAR increases
increases asas µμ ∈∈(0.7,1] increases.The
(0.7, 1]increases. The
MAEMAE reaches
reaches thethe lowest
lowest point
point when when
μ=
µ = 0.7. So µ = 0.7 will be selected as the best parameter
0.7. So μ = 0.7 will be selected as the best parameter value. value.
1.168
1.168
1.166
1.166
MAE
1.164
MAE
1.164
1.162
1.162
1.16
1.16
1.158
1.158
00 0.1
0.1 0.2
0.2 0.3
0.3 0.4
0.4 0.5
0.5 0.6
0.6 0.7
0.7 0.8
0.8 0.9
0.9 11
μ
μ
Figure 11.
Figure11.
Figure The
11.The impact
Theimpact of
impactof parameterµμμonon
parameter
ofparameter on MAE.
MAE.
MAE.
4.5.2.Comparison
4.5.2. ComparisonResults
Resultswith
withCollaborative
CollaborativeFiltering
FilteringAlgorithm
Algorithm
Thisexperiment
This experimentattempts
attempts to show
to show whether
whether the collaborative
the collaborative filtering
filtering algorithm
algorithm based
based
on useron user profiles
profiles (CFUP)(CFUP) has better
has better performance
performance in terms
in terms of prediction
of prediction precision.
precision. In
In or‑
der to to
order answer
answerthis question,
this we
question, wemake
makea acomparison
comparisonwithwiththe
thetradition
tradition collaborative
collaborative fil‑
fil-
tering
teringalgorithm
algorithm (CF),
(CF), and
and the
the result shown in
result is shown in Figure
Figure12.12.The
Theabscissa
abscissaisisthe
the number
number of
of neighbors.
neighbors.
CF
CF CFUP
CFUP
1.24
1.24
1.22
1.22
1.2
1.2
MAE
MAE
1.18
1.18
1.16
1.16
1.14
1.14
00 10
10 20
20 30
30 40
40 50
50 60
60 70
70 80
80 90
90 100
100
NEIGHBOR
NEIGHBOR NUMBER
NUMBER
Figure 12.
Figure12.
Figure The
12.The comparison
Thecomparison results
comparisonresults of
resultsof CF
ofCF and
CFand CFUP.
andCFUP.
CFUP.
It can be observed from Figure 12 that MAE decreases as the number of neighbor
users increases in the different algorithms. It changes sharply at the beginning, and then
more slowly before finally seeming become a stable number. However, the general rec-
ommendation result of CFUP is better than that of the traditional CF.
It can be observed from Figure 12 that MAE decreases as the number of neighbor users
increases in the different algorithms. It changes sharply at the beginning, and then more
slowly before finally seeming become a stable number. However, the general recommen‑
dation result of CFUP is better than that of the traditional CF.
5. Conclusions
This paper proposed a cross‑platform recommender system, CPRec. By constructing
user profiles and commodity profiles, commodity recommendations will be realized based
on the similarity of user and commodity. The experiments and analysis demonstrated that
social networks effect e‑ commerce, which will play an important role in creating econom‑
ical value for e‑commerce. The simulation results showed that the Microblogs Topic Dis‑
covery Model performs better compared with the LDA model, and we built users’ profiles
more precisely with the help of the proposed model. Moreover, we also improved the tra‑
ditional collaborative filtering algorithm and proposed a collaborative filtering algorithm
based on the user profiles (CFUP) by considering the similarity of users’ attributes. The
experiments with CFUP show that µ = 0.7 is the best parameter for CFUP and CFUP,
which obtain more accurate recommendation results than the traditional collaborative fil‑
tering algorithm.
In future work, we will focus on studying the information spread path. We aim to
find the fastest and broadest path for information spreading to enhance the influence of
promotional information. The reason for this is that, the greater the influence of cross‑
platform information, the more economical value it may obtain.
Author Contributions: J.Z. and B.S. designed the proposed method and wrote the paper; J.Z. and
X.R. wrote the code and performed the experiments; J.Z. and B.S. analyzed the data; Z.C. modi‑
fied the paper and offered support. All authors have read and agreed to the published version of
the manuscript.
Funding: This research was funded by the 2020 Youth Fund Project of Fuzhou Polytechnic, grant
number FZYKJJQN202001. Additionally, the APC was funded by the 2020 Youth Fund Project
of Fuzhou Polytechnic (FZYKJJQN202001). This work is also supported by the National Natural
Science Foundation of China (62277010), the Fujian Natural Science Foundation (2021J011013 and
2020J01132452) and the Medical Innovation Project (2021CXA001).
Data Availability Statement: Not applicable.
Acknowledgments: The authors thank the 2020 Youth Fund Project of Fuzhou Polytechnic
(FZYKJJQN202001) for covering the costs to publish in open access and the costs incurred when
writing this study. In addition, the authors thank the anonymous reviewers for their insightful com‑
ments that helped improve the quality of this study.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Gao, C.; Huang, C.; Yu, D.; Fu, H.; Lin, T.; Jin, D.; Li, Y. Item Recommendation for Word‑of‑Mouth Scenario in Social E‑Commerce.
IEEE Trans. Knowl. Data Eng. 2022, 34, 2789–2809. [CrossRef]
2. Wigand, R.T.; Benjamin, R.I.; Birkland, J.L.H. Web 2.0 and beyond: Implications for Electronic Commerce. In Proceedings of the
10th International Conference on Electronic Commerce, Innsbruck, Austria, 19–22 August 2008; pp. 1–5.
3. Tajvidi, M.; Wang, Y.; Hajli, N.; Love, P.E. Brand Value Co‑creation in Social Commerce: The Role of Interactivity, Social Support,
and Relationship Quality. Comput. Hum. Behav. 2021, 115, 105238. [CrossRef]
4. Adam, I.O.; Alhassan, M.D. The Role of Social Media on the Diffusion of E‑Government and E‑Commerce. Inf. Resour. Manag. J.
2021, 34, 63–79. [CrossRef]
5. Kang, S.; Lee, D.; Kweon, W.; Yu, H. Personalized Knowledge Distillation for Recommender System. Knowl. ‑Based Syst. 2022,
239, 107958. [CrossRef]
6. Forestiero, A. Heuristic recommendation technique in Internet of Things featuring swarm intelligence approach. Expert Syst.
Appl. 2022, 187, 115904. [CrossRef]
7. Fijalkowski, D.; Zatoka, R. An Architecture of a Web Recommender System Using Social Network User Profiles for E‑Commerce.
In Proceedings of the 2011 Federated Conference on Computer Science and Information Systems (FedCSIS), Szczecin, Poland,
19–21 September 2011; pp. 287–290.
8. Ma, H.; Zhou, T.C.; Lyu, M.R.; King, I. Improving Recommender Systems by Incorporating Social Contextual Information. ACM
Trans. Inf. Syst. 2017, 29, 1–23. [CrossRef]
9. Zhao, W.X.; Li, S.; He, Y.; Wang, L.; Wen, J.R.; Li, X. Exploring demographic information in social media for product recommen‑
dation. Knowl. Inf. Syst. 2016, 49, 61–89. [CrossRef]
10. Zhao, W.X.; Li, S.; He, Y.; Chang, E.Y.; Wen, J.R.; Li, X. Connecting Social Media to E‑Commerce: Cold‑Start Product Recommen‑
dation Using Microblogging Information. IEEE Trans. Knowl. Data Eng. 2016, 28, 1147–1159. [CrossRef]
11. Pan, S.J.; Zhao, L.; Yang, Q. A unified framework of active transfer learning for cross‑system recommendation. Artif. Intell. 2017,
245, 38–55.
12. Xiang, D.; Zhang, Z. Cross‑border e‑commerce personalized recommendation based on fuzzy association specifications com‑
bined with complex preference model. Math. Probl. Eng. 2020, 2020, 8871126. [CrossRef]
13. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. J. Mach. Learn. Res. 2003, 3, 993–1022.
14. Zhou, X.; Chen, L. Migrating social event recommendation over microblogs. Proc. VLDB Endow. 2022, 15, 3213–3225. [CrossRef]
15. Djenouri, Y.; Belhadi, A.; Srivastava, G.; Lin, C.W. Toward a Cognitive‑Inspired Hashtag Recommendation for Twitter Data
Analysis. IEEE Trans. Comput. Soc. Syst. 2022, 9, 1748–1757. [CrossRef]
16. Tahmasebi, H.; Ravanmehr, R.; Mohamadrezaei, R. Social movie recommender system based on deep autoencoder network
using Twitter data. Neural Comput. Appl. 2021, 33, 1607–1623. [CrossRef]
17. Weng, J.; Lim, E.P.; Jiang, J.; He, Q. Twitterrank: Finding topic‑sensitive influential twitterers. In Proceedings of the 3rd ACM
International Conference on Web Search and Data Mining (WSDM 2010), New York, NY, USA, 4–6 February 2010; pp. 261–270.
18. Zhao, W.X.; Jiang, J.; Weng, J.; He, J.; Lim, E.P.; Yan, H.; Li, X. Comparing twitter and traditional media using topic models. In
Proceedings of the European Conference on Information Retrieval, Heidelberg, Berlin, 18–21 April 2011; pp. 338–349.
19. Li, L.; Zheng, L.; Yang, F.; Li, T. Modeling and broadening temporal user interest in personalized news recommendation. Expert
Syst. Appl. 2014, 41, 3168–3177. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual au‑
thor(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

A CrossPlatform Personalized Recommender System For Connecting ECommerce and Social Network - 2023 - MDPI

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

A CrossPlatform Personalized Recommender System For Connecting ECommerce and Social Network - 2023 - MDPI

Uploaded by

Copyright:

Available Formats

future internet

1 Department of Computer Science, Fuzhou Polytechnic, Fuzhou 350108, China

Keywords: social commerce; recommender system; topic model; CFUA

Citation: Zhao, J.; Su, B.; Rao, X.;

Future Internet 2023, 15, 13. https://doi.org/10.3390/fi15010013 https://www.mdpi.com/journal/futureinternet

Future Internet 2023, 15, 13 3 of 20

2.2. System Model of the Cross-Platform Recommender System (CPRec)

Figure 5. Plate notation of Microblogs Topic Discovery Model.

3.1.2. Interest Profiles Obtained from Tags

Figure 6. Diagrams of three time functions.

Algorithm 1. Procedure for integrating Pu M and PuT

3.2. Commodity Profiling

Algorithm 2. Procedure for integrating Pc N and Pc L

3.3. Recommendation Subdivisions

by which we intend to recommend a commodity from an e‑commerce platform to a so‑

3.3.1. Recommendations for Individuals

Supposing χ is the critical value of recommendation, then

3.3.2. Feedback Mechanism

Table 1. Different action and different emotion.

User Action Emotion

Then RScore( Pu | Pc ) in the recommendation database will be updated according to the

where ω is a positive real number. Hence, RScore( Pu | Pc ) is updated by the formula

RScore( Pu | Pc )′ = RScore( Pu | Pc ) + FScore( Pu | Pc ) (9)

3.3.3. Recommendations Based on Collaborative Filtering Algorithm Using User Profiles

∑ub ∈S(ua ) ( Rb,k − Rb ) × sim(u a , ub )

Then, the improved similarity formula appears as follows:

4. Evaluation and Analysis

4.1. Data Preparation

Table 2. Four types of information collected from social network platform.

No. Information Type

The same collection method is used in commodity information collection proc

Table 5. The result samples of MTDM and Standard LDA.

Microblogs Topic Discovery Mode (MTDM) Standard LDA Model

Table 6. Comparison results of MTDM and Standard LDA.

Model Average Grade

4.4. The Construction of User Interest Profiles

4.4.1. The Process of Building User Interest Profiles

Table 7. Examples of profiles building process.

M∆ti Topics Keyword in Topic P∆ti

Topic 5 (Sichuan 0.026)

Topic 5 (Soccer fan 0.007)

f (t) = (1 + λt)e−λt (15)

𝑃𝑢𝑀 = ∑n𝑃∆𝑡𝑖 𝑓(𝑡𝑖 ) (14)

You might also like