Psychometric Properties of the Japanese Version Of

Vol. 67 No.
2 February 2024 Journal of Pain and Symptom Management 157
Original Article
Psychometric Properties of the Japanese Version of the

Edmonton Functional Assessment Tool 2
Yuto Zogo, RPT, MS, Keita Hoshino, RN, MS, Kensuke Serizawa, ROT, Akira Iwase, ROT, and
Tetsuya Tsuji, MD, PhD
Department of Rehabilitation Medicine/ Palliative Care Team (Y.Z., K.S., A.I.), Yokohama City Minato Red Cross Hospital, Kanagawa,
Japan; Department of Palliative Care Nursing/ Palliative Care Team (K.H.), Yokohama City Minato Red Cross Hospital, Kanagawa, Japan;
Department of Rehabilitation Medicine (T.T.) Keio University School of Medicine, Tokyo, Japan
Abstract
Context. Physical symptoms such as pain and cancer-related fatigue limit physical function and activities of daily living among
patients with terminal cancer, which can lead to a decline in quality of life. Therefore, comprehensive functional impairments
should be evaluated to determine the progression of the disease and the effectiveness of palliative treatment.
Objective. To validate the psychometric properties of the Japanese version of the Edmonton Functional Assessment Tool 2
(EFAT2-J).
Methods. We developed a Japanese version of the EFAT-2 in accordance with international guidelines. To verify the reliabil-
ity and validity of the EFAT2-J, patients were evaluated by a physiotherapist and a nurse separately, and correlations with existing
evaluation scales for physical function, physical symptoms, and quality of life were analyzed, respectively. The significance level
was set at 5%.
Results. Twenty patients participated in the reliability measurement. The average EFAT2-J scores were 7.95 § 4.12 for physi-
cal therapists and 7.20 § 4.23 for nurses, and the intraclass correlation coefficient was 0.95. The weighted kappa coefficient (k)
for each item was 0.57−1.00. Fifty-five patients participated in the validity measurement. The EFAT2-J showed significant corre-
lations with Eastern Cooperative Oncology Group Performance Status and the Karnofsky Performance Scale, Barthel Index,
and the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-Core 15-Palliative Care
sub-item “physical function.”
Conclusion. These results indicate that the EFAT2-J has robust psychometric properties and is useful for evaluating physical
function in patients with terminal cancer, and thus may be an acceptable clinical instrument in research and practice. J Pain
Symptom Manage 2024;67:157−166. © 2023 The Authors. Published by Elsevier Inc. on behalf of American Academy of Hospice and Pallia-
tive Medicine. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)
Key Words
Evaluation, palliative care, physical function, rehabilitation, terminal cancer
Key Message Introduction

This article describes a prospective observational study Cancer-related fatigue,1 respiratory distress, and
that aimed to validate the psychometric properties of the bone metastasis2 are known to limit physical function
Japanese version of the Edmonton Functional Assessment and activities of daily living (ADL) in patients with ter-
Tool 2 (EFAT2-J). The results indicate that the EFAT2-J minal cancer, resulting in decreased quality of life
has robust psychometric properties and is useful for eval- (QOL) and the need for appropriate rehabilitation
uating physical function in patients with terminal cancer. and responses by palliative care teams.
Address correspondence to: Tetsuya Tsuji, MD, PhD, Department Accepted for publication: 2 November 2023.
of Rehabilitation Medicine, Keio University School of Medi-
cine, 35 Shinanomachi, Shinjuku-ku, Tokyo 160-8582, Japan.
E-mail: t.tsuji@keio.jp
Ó 2023 The Authors. Published by Elsevier Inc. on behalf of 0885-3924/$ - see front matter
American Academy of Hospice and Palliative Medicine. This is an https://doi.org/10.1016/j.jpainsymman.2023.11.001
open access article under the CC BY license
(http://creativecommons.org/licenses/by/4.0/)
158 Zogo et al. Vol. 67 No. 2 February 2024
The aim of rehabilitation in patients with terminal Although the EFAT-2 is widely used in Europe and
cancer is to achieve the best possible QOL for patients the United States, the Japanese version has not yet
and their families.3 In such patients, rehabilitation been validated. The goals of this study were to validate
such as exercise therapy has been shown to improve the psychometric properties of the Japanese version of
physical function,4,5 QOL,5 and fatigue.4,5 the EFAT-2 (EFAT2-J) and to clarify its reliability and
To administer appropriate cancer rehabilitation to validity. The EFAT2-J is expected to be widely used as a
patients with terminal cancer, it is necessary to assess their physical function assessment tool for patients with ter-
actual physical function status and self-care ability. East- minal cancer, and to be an acceptable clinical instru-
ern Cooperative Oncology Group Performance Status ment in research and practice.
(ECOG-PS)6 and the Karnofsky Performance Scale
(KPS)7 are commonly used as assessment tools for physi-
cal function in patients with cancer. ECOG-PS is a five- Methods
point scale ranging from zero to four that is primarily
used to assess physical function during active treatment Study Design and Participants
phases such as chemotherapy. It is widely used in cancer This prospective observational study included all
care settings around the world and has been reported to patients who had been admitted to Yokohama City Min-
have high reliability and validity.8−11 The KPS is an assess- ato Red Cross Hospital (Yokohama-city, Kanagawa,
ment scale that provides scores from 0% to 100% on an Japan) from January 2021 to March 2022 and those for
11-point scale depending on the medical condition, whom the palliative care team had received a consulta-
work, and status regarding assistance in daily life, and its tion request from the attending physician. Within this
high reliability and validity have been verified.10−13 time period, patients were consecutively sampled by
However, for both ECOG-PS and KPS, when the type of psychometric property.
evaluation target is a patient with terminal cancer, the The eligibility criteria were as follows: 1) patients
score is clustered at the lower limit and a floor effect is with cancer whose palliative care team received a
likely to occur. In addition, slight changes in physical request for consultation from the attending physician,
function cannot be evaluated, making them insuffi- and 2) cases for which oral and written consent was
cient for determining the effect of cancer obtained for participation in this study. The exclusion
rehabilitation.14,15 Palliative Performance Status16 and criteria were: 1) cases estimated to have a prognosis of
the Edmonton Functional Assessment Tool 2 (EFAT- less than one month; 2) cases judged by the researcher
2)14 have been developed in the field of palliative care. to be deemed unsuitable for the safe conduct of the
The Palliative Performance Status guides the assess- study because of clinical signs of death22; 3) patients
ment of functional performance and provides a frame- receiving curative treatment for cancer at the time of
work for measuring progressive decline in physical participation; and 4) patients under the age of 18 years.
function. However, to our knowledge, no studies have Terminal cancer is defined in Dietz’s classification23
been conducted on rehabilitation using this tool. On as the period from the maintenance stage, when the
the other hand, the EFAT-2 is a physical function tumor is growing, recurrence and metastasis are pro-
assessment tool specialized for patients with terminal gressing, and functional impairment is progressing, to
cancer, and its validity and reliability have been the palliative care-oriented stage, when the patient is
verified.14,17,18 The EFAT-2 has the potential to detect no longer able to receive active treatment.
more clinically relevant changes and is an excellent In this study, patients who requested a consultation
tool for measuring and recording more rarefied gains with the palliative care team during their hospitaliza-
in function.14,19 As a result, the physical symptoms asso- tion and those who were not treated for curative pur-
ciated with cancer itself and its treatment can be poses during their hospitalization were included in the
assessed from multiple perspectives. best supportive care policy. Therefore, the stage of the
The EFAT-2 has the following features. It can com- recruited patients corresponds to terminal cancer.
prehensively assess functional impairments specific to Furthermore, age, gender, cancer type, cancer stage
patients with terminal cancer, including physical, men- (TNM classification),24 comorbidities, length of hospi-
tal, and social functions of daily living. The assessment tal stay, rehabilitation, primary reasons for hospitaliza-
items are a low load on patients, making it easy to tion, and patient outcomes were investigated from the
administer even to those with terminal cancer. The patients’ medical records.
floor effect is unlikely to occur and the scoring criteria
are clarified in a manual. The system is expected to be Ethical Considerations
useful as a tool for sharing information among multiple This study was approved by the Medical Ethics Com-
professions, selecting palliative treatment and rehabili- mittee of Yokohama City Minato Red Cross Hospital
tation content, and determining discharge destinations (approval no. 2020-32). The purpose and content of
and directions.20,21 the research and the use of data were fully explained to
Vol. 67 No. 2 February 2024 Japanese Edmonton Functional Assessment Psychometric 159
the patients in writing, and the research was conducted In addition, to summarize the patient’s PS, the scale
after obtaining consent. has one final item (range 0−3), which is not calculated
as part of the total score.
Palliative Care Team
The study site was an acute-care hospital but has a EFAT2-J Creation Procedure
palliative care team consisting of multidisciplinary The translation procedure followed the guidelines
professionals (e.g., doctors, nurses, pharmacists, clini- set forth by the International Society for Pharmacoeco-
cal psychologists, dental hygienists, medical social nomics and Outcomes Research (ISPOR) task force.27
workers, rehabilitation specialists) as a specialized First, as advance preparation, we contacted the original
palliative consultation service in which the attending authors and editorial company (SAGE Publications) to
physician requests a consultation with the palliative obtain permission to create a Japanese version of the
care team regarding treatment for the patient’s phys- EFAT-2. It was difficult to contact the original author,
ical and psychiatric symptoms. In response to a con- so permission was obtained from the co-author.17
sultation request, the palliative care team proposes, First, the EFAT-2 was translated by two physiothera-
for example, treatment with opioid analgesics to treat pists who were fluent in English and experts in cancer
the patient’s secondary pain caused by the disease. rehabilitation. The translated versions were then com-
Ward rounds are conducted five times a week by doc- pared and integrated to create a single version. Second,
tors, nurses, and pharmacists. During regular confer- a doctor who was an expert in cancer rehabilitation but
ences, the treatment plan, patient distress, life not involved in the translation confirmed the trans-
expectancy, and patient and family wishes are shared. lated version.
Rehabilitation specialists participate in the rounds In addition, we asked a science communications
and conferences once a month and provide consulta- company to reverse-translate the translated EFAT-2 so
tion on indications for rehabilitation, guidance on that the original author could confirm whether the
movement methods to avoid pain, positioning, and expression of the translated EFAT-2 had the same con-
breathing methods. cepts and meanings as the original version. Finally, we
asked the original authors to review the back-transla-
tions and compared those versions with the original to
Sample Size check whether they were equivalent.
To verify the reliability of the EFAT2-J, estimates and
confidence interval (CI) widths were set with reference Psychometric Properties
to previous studies, and sample sizes were calculated The reliability and validity of the EFAT2-J need to be
using the method reported by Bonett et al.25 As a evaluated to verify its accuracy in regard to the target
result, when the estimated value was 0.9 and the CI population28 in Japan. COnsensus-based Standards for
width was 0.1, the sample size was 15 patients, and the the selection of health Measurement INstruments
target number of patients was set to 20 in consideration (COSMIN)29 is an international standard for scale
of feasibility. To verify the validity of the EFAT2-J, the research, and clearly shows the points that researchers
estimated value was 0.8 and the CI width was 0.1, refer- should keep in mind when examining scale characteris-
ring to previous studies. The sample size was then tics. In the COSMIN checklist, scale characteristics are
calculated using the method reported by Moinester divided into three areas: reliability, validity, and respon-
et al.26 As a result, the sample size was calculated to be siveness.30 Although distinct from the scale characteris-
55 patients, and the target number of patients was set tics, interpretability is also included because it is an
at 55, also in light of feasibility. important concept in deciding whether scale scores
can be used in clinical settings. Therefore, the present
Tool study examined the reliability, validity, and interpret-
The original EFAT-214 was developed as a physical ability of the Japanese version of the EFAT-2.
function assessment tool for specialized for patients The reliability of the EFAT2-J was evaluated using
with terminal cancer. The EFAT-2 is a rating scale com- inter-rater reliability. A physiotherapist and a nurse
posed of 10 question items and PS designed to measure evaluated the same patient separately. The assessment
physical symptoms and physical function in patients period for each patient was set within three days after
with terminal cancer. The 10-question items evaluate the palliative care team received the consultation
the status of various main functions, including commu- request, referring to previous studies.18
nication, mental status, pain, dyspnea, balance, mobil- In addition, EFAT-2 evaluation manuals31 were dis-
ity, locomotion, fatigue, motivation, and ADL. Each tributed to each evaluator, and preuse education was
item is scored on a four-point scale from zero (func- provided. As preuse education, the evaluator was
tional) to three (severely impaired), with the total score instructed three times (about 30 minutes each) on the
ranging from zero to 30. EFAT-2 method and evaluation manual.
Validity was tested for content validity, criterion indicate a better QOL, whereas higher symptom scales
validity, and construct validity. Content validity is a con- indicate a lower QOL. All scores were transformed
cept that looks at the scale as a whole and reassesses from 0 to 100 by linear transformation according to the
whether it contains the necessary questions to assess EORTC QLQ-C15-PAL scoring manual.42 Because the
the concepts32 for which the EFAT2-J was designed. EORTC QLQ-C15-PAL is a subjective assessment, con-
The content validity of the EFAT-2 has been tested,14 struct validity predicted moderate correlations with
and it is expected to have a similar scale structure. In common sub-items (physical function, fatigue, pain,
terms of criterion validity, correlations between the and dyspnea). On the other hand, the noncommon
ECOG-PS and KPS, the gold standard assessment tools sub-items and global QOL predicted weak correlations.
for physical function in patients with cancer, were eval- All patients were evaluated simultaneously using the
uated. Prior to the revision of the EFAT-2, the EFAT33 BI, ESAS-r-J, EORTC QLQ-C15-PAL, and EFAT2-J.
made the same assessments. However, the EFAT-2 is With the validation of each construct concept valida-
considered to be a more useful tool for the assessment tion, screening of each assessment with the EFAT2-J is
of physical function in patients with cancer and is expected.
expected to correlate well with the ECOG-PS and KPS. Furthermore, the distribution of scores on the
In addition, construct validity was checked to deter- EFAT2-J and physical function assessments, ECOG-PS
mine whether the EFAT2-J reflects the ability to per- and KPS, as well as the “floor−ceiling effect,” were
form ADL, physical symptoms, and QOL. The ability to examined. The floor−ceiling effect affects the ability of
perform ADL was validated using the Barthel Index the measure to detect deterioration or improvements
(BI),34 which is scored on a scale from 0 to 100 points, in physical function, respectively. The ECOG-PS was
with higher scores indicating higher daily indepen- expected to have a higher percentage at the lower end
dence. Construct validity with the BI has been veri- of the scale because of its narrow rating scale of zero to
fied,18 and the same strong correlation is expected. For four. We predicted that although the KPS is an 11-point
physical symptoms, the Edmonton Symptom Assess- rating scale, this study focused on late-stage cancers, so
ment System Revised Japanese version (ESAS-r-J)31 was the percentages would be less concentrated on the
used. The ESAS-r is a validated, patient-reported, symp- lower limit compared with the ECOG-PS, but more
tom assessment tool that is widely used for patients with concentrated on the lower limit compared with the
terminal cancer.35−37 The ESAS-r assesses nine physical EFAT2-J. The floor−ceiling effects on the ECOG-PS
and psychological symptoms (pain, tiredness, drowsi- and KPS scores were used as reference values to exam-
ness, nausea, lack of appetite, shortness of breath, ine the EFAT2-J.
depression, anxiety, and well-being) on an 11-point Lik-
ert scale (0−10; 0 = no symptoms, 10 = worst possible Statistical Analysis
symptoms). The ESAS-r-J has been validated.38 We also Descriptive statistics were determined by calculating
evaluated the severity of all nine symptoms of the the total number, mean § standard deviation, and
ESAS-r on a four-point verbal rating scale (none, mild, median (1st to 3rd interquartile range) for each item
moderate, or severe), and an overall score for all nine related to patient characteristics. The statistical analysis
items was used.39 The ESAS-r was predicted to have a method for psychometric properties is as follows. The
moderate correlation with the EFAT2-J because it is a intraclass correlation coefficient (ICC) and weighted
subjective tool that assesses only physical symptoms. kappa (k) were used for inter-rater reliability. In this
For QOL, the European Organization for Research study, we used ICC (2.1), which shows absolute agree-
and Treatment of Cancer Quality of Life Question- ment. An ICC of 0.7 is commonly used as a threshold
naire-Core 15-Palliative Care (EORTC QLQ-C15- of acceptable reliability.43 Weighted k statistics are fre-
PAL)40 was used. The EORTC QLQ-C15-PAL has been quently used for scale items that include two or more
widely used in the palliative care field in recent years possible responses to reflect the degree of interob-
and has been validated in Japan.41 It is composed of 15 server disagreement more accurately. According to
questions and includes two functional scales (physical conventional cutoffs for interpreting weighted k statis-
and emotional function), two multi-item symptom tics, reliability has been interpreted as follows: almost
scales (fatigue and pain), and five single-item symptom perfect (0.81−1.00), substantial (0.61−0.80), moderate
scales (nausea and vomiting, dyspnea, insomnia, appe- (0.41−0.60), fair (0.21−0.40), or slight (0.00−0.20).44
tite loss, and constipation). Response options to these For internal consistency, we used Cronbach’s a coeffi-
14 items are scored on a four-point Likert scale: 1 (not cient. Cronbach’s a was calculated to determine the
at all), 2 (a little), 3 (quite a bit), and 4 (very much). degree of internal reliability of the 10 items of the
An additional item, the global health/quality-of-life EFAT2-J. If there are many items in the assessment that
scale, uses a seven-point numerical scale from one correspond to the same underlying dimension, the
(very poor) to seven (excellent). Higher scores on the score of one item should correlate with those of the
functional scale and global health/quality-of-life scale other items and the a value should be high. If
Table 1
Basic and Medical Information
Reliability (n = 20) Validity (n = 55)
Age (years) 74.7 § 8.93 73.9 § 10.8

Gender (n) Male 12 30
Female 8 25
Cancer type (n) Lung 8 12
Gastrointestinal 6 17
Breast 4 11
Genitourinary 2 6
Other 0 9
Cancer stage (TNM system; n) (I/ II/ III/IV) 0/0/2/18 0/0/6/49
Comorbidities Cardiac disease 0 2
Respiratory disease 2 1
Orthopedic disease 2 5
Cerebrovascular disease 1 2
Length of hospital stay (days) 16 (9.5−25.0) 14 (9.0−23.0)
Rehabilitation Physical therapy 6 24
Occupational therapy 1 6
Speech and language therapy 0 1
Primary reason for hospitalization Worsening of disease condition 8 30
Pain management 7 16
Palliative radiation therapy 3 7
Palliative chemotherapy 2 2
Outcome for patient (n) (Discharge/transfer to hospital/death) 18/1/1 37/8/10
Cronbach’s a is too high, however, it suggests a high and validity are shown in detail in Table 1. For reliabil-
level of item redundancy. Cronbach’s a should be ity and validity testing, the following patients were
above 0.70, but probably not higher than 0.90.45 excluded from the 44 and 93 recruited during the
Spearman’s rank correlation coefficient was used for study period, respectively: one and five patients without
validation. In this study, we validated the strength of cancer; five and eight patients who did not consent to
the correlation coefficients between the total score of the study; nine and 12 patients whose prognosis was
the EFAT2-J and each measure. Following established estimated to be less than one month; three and three
guidelines, the following categories were used for patients who had clinical signs of death and were
Spearman’s rank correlation coefficient s: ≥ 0.5, deemed by the researchers to be unfit to safely partici-
strong; ≥ 0.3 to < 0.5, moderate; and < 0.3, weak.46 pate in the study; and six and 10 patients who were
The ECOG-PS, KPS, BI, and ESAS-r-J were compared receiving radical cancer treatment at the time of the
based on total scores. For the EORTC QLQ-C15-PAL, study.
each variable was calculated according to the scoring
manual42 and compared using the total score of each Psychometric Properties
sub-item. Fig. 1 shows the average score for each item on the
Regarding interpretability, the score distribution of EFAT2-J in all 55 patients. The mean total score was
each physical function evaluation (EFAT2-J, ECOG-PS, 10.31 § 6.28. The mean score for each rater of the
KPS) was shown in a histogram, and the skewness was EFAT2-J was 7.95 § 4.12 for the physical therapists and
calculated. In addition, a floor−ceiling effect was rec- 7.20 § 4.23 for the nurses.
ognized when the ratio of nonscoring and full-scoring Cronbach’s a coefficient for internal consistency
individuals exceeded 15% of the total.47 across the EFAT2-J was 0.81 for physical therapists and
The significance level was set at two-sided P < 0.05. 0.82 for nurses. The inter-rater reliability (ICC 2.1) for
IBM SPSS Statistics 27.0 (Armonk, NY) was used for all the EFAT-2J total score was 0.95 (95% CI; 0.83−0.98),
statistical analyses. and the standard error of the mean was 1.03. Table 2
shows the results of the analysis of the reliability of
each assessment item on the EFAT2-J. The weighted k
Results coefficients for each item ranged from 0.57 to 1.00.
Table 3 shows the results of the analysis of the validity
Sample Characteristics of the EFAT2-J. A significant correlation was found
As this was a prospective observational study, we col- between the EFAT2-J and ECOG-PS, KPS, BI, ESAS-r-J,
lected data until the specified target number of sam- and EORTC QLQ-C15-PAL sub-item (physical func-
ples was reached. Therefore, reliability and validity tion, fatigue, pain) according to Spearman’s rank cor-
were based on 20 and 55 patients, respectively. The relation coefficient (P = 0.94, −0.95, −0.95, 0.30,
characteristics of each sample in regard to reliability −0.84, 0.49, and 0.28, respectively).
Fig. 1. The average score for each item on the EFAT2-J (n = 55). EFAT2-J, the Japanese version of the Edmonton Functional
Assessment Tool 2; ADL, activities of daily living.
Figs. 2−4. There were zero no-scorers and zero full-

Table 2 scorers on the EFAT2-J, indicating the absence of any
Reliability of the EFAT2-J Items (n = 20)
floor−ceiling effects. By contrast, regarding ECOG-PS,
Weighted k 95% confidence interval SEM
(lower, upper) there were no full-scorers or ceiling effect, but there
were 10 no-scorers and a floor effect (18.2%), and the
Communication 1.00 1.00, 1.00 0.00 scores tended to cluster in the lower end of the range.
Mental status 0.78 0.37, 1.19 0.21
Pain 0.75 0.57, 0.93 0.09 Regarding the KPS, there were zero no-scorers and
Dyspnea 0.96 0.90, 1.02 0.03 full-scorers, and no floor−ceiling effect was observed.
Balance 0.94 0.83, 1.06 0.06 The KPS histogram showed a tendency for scores to
Mobility 0.93 0.79, 1.07 0.07
Locomotion 0.64 0.43, 0.86 0.11 cluster in the middle.
Fatigue 0.66 0.40, 0.93 0.13
Motivation 0.57 0.25, 0.89 0.16
ADL 0.78 0.62, 0.95 0.08
Performance status 0.87 0.77, 0.97 0.05 Discussion
SEM: standard error of the mean, ADL: activities of daily living. The purpose of this study was to create and validate
the psychometric properties of a Japanese version
The results regarding the EFAT2-J, the scores of the EFAT-2, the reliability and validity of which
(mean, minimum, maximum) for each physical func- have been verified in other English-speaking
tion assessment, and the floor−ceiling effect are shown countries.14,17,18 As a result, the EFAT2-J developed in
in Table 4. Additionally, histograms are shown in this study in accordance with the guidelines of the
Table 3
Validity of the EFAT2-J Items
Variable (vs, variable EFAT2-J) Spearman’s rank 95% confidence interval P-value
(n = 55) correlation (lower, upper) (Prob>|r|)
ECOG-PS 0.94 0.90, 0.97 <.001a

KPS −0.95 −0.97, −0.91 <.001a
BI −0.95 −0.97, −0.92 <.001a
ESAS-r-J 0.30 0.03, 0.53 0.03a
EORTC QLQ-C15-PAL Global quality of life −0.01 −0.28, 0.26 0.93
Physical function −0.84 −0.91, −0.74 <.001a
Emotional function −0.18 −0.43, 0.10 0.20
Fatigue 0.49 0.26, 0.68 <.001a
Nausea and vomiting 0.24 −0.25, 0.30 0.86
Pain 0.28 0.01, 0.52 0.04a
Dyspnea 0.18 −0.10, 0.43 0.20
Insomnia 0.23 −0.04, 0.48 0.09
Appetite loss 0.23 −0.05, 0.47 0.09
Constipation 0.14 −0.14, 0.39 0.32
EFAT2-J: Japanese version of the Edmonton Functional Assessment Tool 2, ECOG-PS: Eastern Cooperative Oncology Group Performance Status, KPS: Karnofsky
Performance Scale, BI: Barthel Index, ESAS-r-J: Edmonton Symptom Assessment System Revised Japanese version, EORTC QLQ-C15-PAL: European Organization
for Research and Treatment of Cancer Quality of Life Questionnaire Core 15 Palliative Care.
r: Spearman’s rank correlation coefficient.
a
P < 0.05.
Table 4 The ISPOR Task Force has stated that inconsistent

Physical Function Evaluation Scores and Floor−Ceiling translation methodologies make comparisons between
Effects (n = 55) the same measures difficult, and that measures that
EFAT2-J ECOG-PS KPS
have not gone through the proper translation process
Score (min−max) 30−0 4−0 0−100 are inappropriate for international collaborative
Average score 10.3 § 6.28 2.45 § 1.09 54.2 § 20.9 research.26 Therefore, measures should be translated
Lowest score 26 4 10 according to the formal procedure laid out by the
Highest score 1 1 90
Floor effect (%) 0 (0.0%) 10 (18.2%)a 0 (0.0%) guidelines of the ISPOR Task Force. In this study, trans-
Ceiling effect (%) 0 (0.0%) 0 (0.0%) 0 (0.0%) lation was performed according to this formal proce-
EFAT2-J: Japanese version of the Edmonton Functional Assessment Tool 2, dure. As a next step, this study used the EFAT2-J to
ECOG-PS: Eastern cooperative oncology group performance status, KPS: Kar-
nofsky performance scale.
examine the psychometric properties of Japanese
a
Floor effect. patients.
Concerning the inter-rater reliability (ICC 2.1) of
ISPOR Task Force was shown to have excellent reliabil- the EFAT2-J total score, as expected, it was confirmed
ity and validity, and is therefore expected to be useful to have high reliability (0.95). In a previous study,18 the
as a physical function evaluation tool for patients with ICC (3.1) was 0.85, which was the same result as this
terminal cancer. To our knowledge, this is the first study. In addition, the evaluators in the present study
study to clarify the reliability and validity of a Japanese were physiotherapists and nurses who were shown to
version of the EFAT-2, which is important because eval- have high inter-rater reliability, even though they had
uation scales for patients with terminal cancer must be different occupations. As for the inter-rater reliability
able to evaluate a wide range of functions and abilities of each evaluation item, the weighted k coefficient was
from independent to almost bedridden patients.19 0.57−1.00, confirming that the reliability was almost
Fig. 2. Histogram of EFAT2-J scores (n = 55). EFAT2-J, Japanese version of the Edmonton Functional Assessment Tool 2.
Fig. 3. Histogram of ECOG-PS (n = 55). ECOG-PS: Eastern cooperative oncology group performance status.
Fig. 4. Histogram of KPS scores (n = 55). KPS: Karnofsky performance scale.
perfect, as expected. The EFAT-2 has been reported to oxygen dose and did not correlate with the subjective
have a k coefficient of 0.17−0.96,17 and the results of QOL evaluation. No other items were present in the
the present study were high overall. The reason for this EFAT2-J sub-items, so it is possible that no correlation
may be that the use of the evaluation manual and the was obtained. Global QOL may also have been subjective
provision of education before use were effective. and comprehensive and not correlated. Another factor
In terms of content validity, the overall internal con- was that the type of cancer was not identified. Different
sistency of the EFAT2-J was 0.81−0.82 (Cronbach’s a types of cancer have different effects on global QOL.48
−50
coefficient), the same result as in the previous study.14 This needs to be examined in a future study.
Cronbach’s a coefficient should exceed 0.70, but it is We also verified the floor−ceiling effect of the
desirable not to exceed 0.90.45 Therefore, it was con- EFAT2-J and each physical function evaluation
firmed that the results of this study were valid and that (Table 4). Consistent with expectations, the lowest
the items were consistent. ECOG-PS score was clustered at 15% or higher
Criterion validity was demonstrated by the strong (18.2%), and a floor effect was recognized. As previ-
positive correlations observed between ECOG-PS ously reported,14,15 the ECOG-PS score tends to cluster
(P = 0.94) and the EFAT2-J, and the strong negative at the lower end of the scale and can easily produce a
correlations between the KPS (P = −0.95) and the floor effect; thus, it was reconfirmed that the ECOG-PS
EFAT2-J. The results were as predicted. score is insufficient for determining the effectiveness of
In terms of construct validity, strong correlations were cancer rehabilitation without being able to evaluate
also observed with BI (P = −0.95), an ADL evaluation slight changes in physical function when the subject of
scale, and the EORTC QLQ-C15-PAL sub-item “physical the evaluation is a patient with terminal cancer. The
function” (P = −0.84), an evaluation scale for QOL. The EFAT2-J and KPS showed no floor−ceiling effect, as
concurrent validity of BI (r = −0.77) has been confirmed there were zero no-scorers and full-scorers. In addition,
in a previous study.18 To our knowledge, this is the first EFAT2-J scores were not clustered at the lower limit,
time a high correlation has been found between ECOG- which indicates that EFAT2-J may help determine the
PS, KPS, and the EORTC QLQ-C15-PAL sub-item “physi- effectiveness of physical function assessments in
cal function.” These results suggest that the EFAT2-J is a patients with terminal cancer. On the other hand, the
valid and useful scale for evaluating physical function, the KPS had no floor effect, which was not predicted. This
ability to carry out ADL, and the physical function suggests that the effect of excluding patients with a
domains of QOL among patients with terminal cancer. prognosis of less than one month and those in the
On the other hand, we found a weak correlation with near-death stage requires further investigation. The his-
the ESAS-r-J (P = 0.30), a moderate correlation with sub- tograms in Figs. 2−4 show that the KPS values are con-
item “fatigue” (P = 0.49) on the EORTC QLQ-C15-PAL, centrated in the center, while those for the EFAT2-J
and a weak correlation with sub-item “pain” (P = 0.28). are concentrated on the left side. Therefore, we consid-
Similar to the EORTC QLQ-C15-PAL, the EFAT2-J has ered that the EFAT2-J scores may be more dispersed
sub-items for “pain” and “fatigue”; therefore, we assumed than the KPS values when excluding patients with can-
that they were related. However, the correlation was mod- cer at the end of life.
erate or weak, suggesting that the ESAS-r-J and EORTC The results of these psychometric properties indi-
QLQ-C15−PAL should be evaluated independently of cate that the EFAT2-J has high reliability and validity,
the EFAT2-J. On the other hand, “dyspnea” also exists as and is therefore expected to be useful as a physical
a sub-item, but no correlation was observed. This suggests function assessment tool for patients with terminal can-
that the EFAT2-J sub-item “dyspnea” was related to the cer. The EFAT2-J also showed associations with existing
physical function assessment measures and other con- Sharon Watanabe for the translation of and permission
cepts, suggesting that it contains a variety of constructs. to use the EFAT-2; Dinesh Samuel for teaching me
Furthermore, compared with existing measures of about the EFAT-2 manual; Nobuya Ibara for helping
physical function among patients with cancer, our find- with the translation work; Naoki Miyazaki, Kengo Naga-
ings were less likely to produce a floor effect, suggest- shima, and Noriyuki Ishida for providing statistical
ing that the EFAT2-J is a valid instrument for assessing advice at the early stage of project development; and
physical function in patients with terminal cancer. Noriko Tamakoshi for providing copyright advice.
Therefore, the EFAT2-J is capable of assessing physical Moreover, we would like to thank FORTE Science
function in frail patients without increasing patient Communications (https://www.forte-science.co.jp/)
burden. Furthermore, it is expected that the EFAT2-J for English language editing. The authors confirm that
will increase its use in Japan and allow comparison of they do not have any conflicts of interest to disclose.
results both domestically and internationally. This research received no specific funding/grant from
As mentioned above, in the future, the EFAT2-J is any funding agency in the public, commercial, or not-
expected to be widely used as a physical function assess- for-profit sectors.
ment scale for patients with terminal cancer and to be
utilized internationally in rehabilitation medicine and
clinical research.
References
Limitations and Future Recommendations 1. Hudson P, Collins A, Bostanci A, et al. Toward a system-
atic approach to assessment and care planning in palliative
This study does have some limitations. First, the care: a practical review of clinical tools. Palliat Support Care
responsiveness of the EFAT2-J was not investigated. Sec- 2016;14:161–173.
ond, because the patients in this study had cancer and 2. Yoshioka H. Rehabilitation for the terminal cancer
had requested a consultation with the palliative care team patient. Am J Phys Med Rehabil 1994;73:199–206.
after admission to a general ward, the results may not 3. Santiago-Palma J, Payne R. Palliative care and rehabilita-
reflect those of patients with a prognosis of less than one tion. Cancer 2001;92:1049–1052.
month or those in a near-death period who would be 4. Oldervoll LM, Loge JH, Lydersen S, et al. Physical exer-
admitted to hospice care. Based on the above, as a pros- cise for cancer patients with advanced disease: a randomized
pect, changes in scores before and after rehabilitation controlled trial. Oncologist 2011;16:1649–1657.
need to be investigated, and the reactivity of the EFAT2-J 5. Segal RJ, Reid RD, Courneya KS, et al. Resistance exer-
needs to be verified on a larger scale. In addition, it will cise in men receiving androgen deprivation therapy for pros-
also be necessary to verify whether the EFAT2-J actually tate cancer. J Clin Oncol 2003;21:1653–1659.
reflects the effects of rehabilitation in practice. Further- 6. Oken MM, Creech RH, Tormey DC, et al. Toxicity and
more, whether the EFAT2-J is suitable for patients with response criteria of the eastern cooperative oncology group.
cancer at the end of life, such as those in multicenter and Am J Clin Oncol 1982;5:649–655.
palliative care units, needs to be confirmed. 7. Karnofsky DA, Walter H, Abelmann, et al. The use of the
nitrogen mustards in the palliative treatment of carcinoma.
With particular reference to bronchogenic carcinoma. Can-
cer 1948;1:634–656.
Conclusion
8. Conill C, Verger E, Salamero M. Performance sta-
The EFAT2-J has been appropriately translated tus assessment in cancer patients. Cancer 1990;65:1864–
under the ISPOR Task Force, and has been shown to 1866.
have appropriate inter-rater reliability and content 9. Sørensen JB, Klee M, Palshof T, Hansen HH. Perfor-
validity. Furthermore, it was highlighted that the mance status assessment in cancer patients. An inter-observer
EFAT2-J had high reliability, even if the raters had variability study. Br J Cancer 1993;7:773–775.
different occupations. In addition, it showed strong 10. Vigan
o A, Dorgan M, Buckingham J, Bruera E, Suarez-
correlations between the ECOG-PS, KPS, BI, and Almazor ME. Survival prediction in terminal cancer patients:
EORTC QLQ-C15-PAL sub-item “physical function,” a systematic review of the medical literature. Palliat Med
indicating good criterion and construct validity. 2000;14:363–374.
Additionally, floor effects are unlikely to occur even 11. Buccheri G, Ferrigno D, Tamburini M. Karnofsky and
in patients with terminal cancer, which may contrib- ECOG performance status scoring in lung cancer: a prospec-
tive, longitudinal study of 536 patients from a single institu-
ute to clinical research in such patients and in reha- tion. Eur J Cancer 1996;32:1135–1141.
bilitation medicine.
12. Yates JW, Chalmer B, McKegney FP. Evaluation of
patients with advanced cancer using the Karnofsky perfor-
mance status. Cancer 1980;45:2220–2224.
Disclosures and Acknowledgments
13. Schag CC, Heinrich RL, Ganzet PA. Karnofsky perfor-
The authors thank the following for their assistance mance status revisited: reliability, validity, and guidelines.
in various stages of the study: Eduardo Bruera and J Clin Oncol 1984;2:187–193.
14. Kaasa T, Wessel J. The Edmonton Functional Assessment 33. Kaasa T, Loomis J, Gillis K, Bruera E, Hanson J. The
tool: further development and validation for use in palliative Edmonton Functional Assessment Tool: preliminary develop-
care. J Palliat Care 2001;17:5–11. ment and evaluation for use in palliative care. J Pain Symp-
15. Miyata C, Tsuji T, Tanuma A, et al. Cancer functional tom Manage 1997;12:1–10.
assessment set: a new tool for functional evaluation in cancer. 34. Mahoney FI, Barthel DW. Functional evaluation: the Bar-
Am J Phys Med Rehabil 2014;93:656–664. thel Index. Md State Med J 1965;14:61–65.
16. Anderson F, Downing GM, Hill J, Casorso L, Lerch N. 35. Suren M, Dogru S, Onder Y, et al. [The evaluation of the
Palliative performance scale (PPS): a new tool. J Palliat Care symptom clusters in patients with the diagnosis of terminal
1996;12:5–11. stage cancer]. Agri 2015;27:12–17.
17. Kaasa T, Wessel J, Darrah J, Bruera E. Inter-rater reliabil- 36. McKenzie E, Zhang L, Chan S, et al. Symptom correlates of
ity of formally trained and self-trained raters using the dyspnea in advanced cancer patients using the Edmonton symp-
Edmonton Functional Assessment tool. Palliat Med tom assessment system. Support Care Cancer 2020;28:87–98.
2000;14:509–517. 37. Pautex S, Gamondi C, Philippin Y, et al. Advance direc-
18. Gardiner A, Samuel D. Evaluation of the Edmonton tives and end-of-life decisions in Switzerland: role of patients,
Functional Assessment tool (EFAT2) within palliative care: a relatives and health professionals. BMJ Support Palliat Care
pilot study. Aging Clin Exp Res 2014;26:293–298. 2018;8:475–484.
19. Looney F, Cobbe S, Ryan A, et al. The search for a func- 38. Yokomichi N, Morita T, Nitto A, et al. Validation of the
tional outcome measure for physical therapy in specialist palli- Japanese version of the Edmonton symptom assessment sys-
ative care: an ongoing journey. Rehabil Oncol 2020;38:22–29. tem-revised. J Pain Symptom Manage 2015;50:718–723.
20. Cobbe S, Kennedy N. Physical function in hospice 39. Hayashi E, Aoyama M, Fukano F, et al. Effects of bathing
patients and physiotherapy interventions: a profile of hospice in a tub on physical and psychological symptoms of end-of-
physiotherapy. J Palliat Med 2012;15:760–767. life cancer patients: an observational, controlled study. J
21. Cobbe S, Nugent K, Real S, Slattery S, Lynch M. A profile Hosp Palliat Nurs 2022;24:30–39.
of hospice-at-home physiotherapy for community-dwelling 40. Groenvold M, Petersen MA, Aaronson NK, et al. The
palliative care patients. Int J Palliat Nurs 2013;19:39–45. development of the EORTC QLQ-C15-PAL: a shortened
22. Hui D, dos Santos R, Chisholm G, et al. Clinical signs of questionnaire for cancer patients in palliative care. Eur J Can-
impending death in cancer patients. Oncologist 2014;19:681–687. cer 2006;42:55–64.
23. Dietz Jr JH. Rehabilitation oncology. New York: John 41. Miyashita M, Wada M, Morita T, et al. Independent vali-
Wiley & Sons; 1981. dation of the Japanese version of the EORTC QLQ-C15-PAL
for patients with advanced cancer. J Pain Symptom Manage
24. Brierley J, Gospodarowicz MK, Wittekind Ch. TNM classi- 2015;49:953–959.
fication of malignant tumours 2017.
42. Groenvold M, Petersen MA, Aaronson NK, et al. EORTC
25. Bonett DG. Sample size requirements for testing and esti- QLQ-C15-PAL: the new standard in the assessment of health-
mating coefficient alpha. J Educ Behav Stat 2002;27:335–340. related quality of life in advanced cancer? Palliat Med
26. Moinester M, Gottfried R. Sample size estimation for cor- 2006;20:59–61.
relations with pre-specified confidence interval. Quant 43. Hripcsak G, Heitjan DF. Measuring agreement in medi-
Method Psychol 2014;10:124–130. cal informatics reliability studies. J Biomed Inform 2002;35:
27. Wild D, Grove A, Martin M, et al. Principles of good prac- 99–110.
tice for the translation and cultural adaptation process for 44. Landis JR, Koch GG. The measurement of observer
patient-reported outcomes (PRO) measures: report of the agreement for categorical data. Biometrics 1977;33:159–174.
ISPOR Task Force for translation and cultural adaptation.
Value Health 2005;8:94–104. 45. Streiner DL, Norman GR, Cairney J. Health measure-
ment scales: a practical guide to their development and use.
28. Mokkink LB, Terwee CB, Patrick DL. The COSMIN 5th ed. Oxford: Oxford University Press; 2015.
checklist for assessing the methodological quality of studies
on measurement properties of health status measurement 46. Cohen J. Statistical power analysis for the behavioral sci-
instruments: an international Delphi study. Qual Life Res ences. 2nd ed. Hillsdale, NJ: L. Erlbaum Associates; 1988.
2010;19:539–549. 47. Terwee CB, Bot SDM, de Boer MR. Quality criteria were
29. Amsterdam Public Health. COnsensus-based Standards proposed for measurement properties of health status ques-
for the selection of health Measurement INstruments (COS- tionnaires. J Clin Epidemiol 2007;60:34–42.
MIN). 2019. Available at: http://www.cosmin.ni. Accessed 48. Maisey NR, Norman A, Watson M, et al. Baseline quality
December 12, 2021. of life predicts survival in patients with advanced colorectal
30. Terwee CB, Mokkink LB, Knol DL, et al. Rating the cancer. Eur J Cancer 2002;38:1351–1357.
methodological quality in systematic reviews of studies on 49. Kato J, Nagahara A, Iijima K, et al. Evaluation of EORTC
measurement properties: a scoring system for the COSMIN QLQ-C30 questionnaire in patients undergoing in-hospital
Checklist. Qual Life Res 2012;21:651–657. chemotherapy for gastrointestinal cancer in Japan. J Gastro-
31. Kaasa T, Gillis K. Edmonton Functional Assessment tool enterol Hepatol 2008;23:268–272.
(EFAT) for palliative care test manual. Canada 1993: 62–75. 50. Allal AS, Gervaz P, Gertsch P, et al. Assessment of quality
in press. of life in patients with rectal cancer treated by preoperative
32. Messick S. Validity: in educational measurement. 2nd ed. radiotherapy: a longitudinal prospective study. Int J Radiat
Phoenix, AZ: Oryx Press; 1993. Oncol Biol Phys 2005;61:1129–1135.

Psychometric Properties of the Japanese Version Of

Uploaded by

Copyright:

Available Formats

You might also like

Psychometric Properties of the Japanese Version Of

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Psychometric Properties of the Japanese Version Of

Uploaded by

Copyright:

Available Formats

Vol. 67 No.

2 February 2024 Journal of Pain and Symptom Management 157

Psychometric Properties of the Japanese Version of the

Key Message Introduction

Age (years) 74.7 § 8.93 73.9 § 10.8

Figs. 2−4. There were zero no-scorers and zero full-

ECOG-PS 0.94 0.90, 0.97 <.001a

Table 4 The ISPOR Task Force has stated that inconsistent

Fig. 4. Histogram of KPS scores (n = 55). KPS: Karnofsky performance scale.

You might also like