Professional Documents
Culture Documents
Validated Assessment Scales For Cellulite Dimples On The Buttocks and Thighs in Female Patients
Validated Assessment Scales For Cellulite Dimples On The Buttocks and Thighs in Female Patients
Validated Assessment Scales For Cellulite Dimples On The Buttocks and Thighs in Female Patients
BACKGROUND New treatment methods for cellulite require globally accepted scales for aesthetic research
and patient evaluation.
OBJECTIVE To develop a set of grading scales for objective assessment of cellulite dimples on female but-
tocks and thighs and assess their reliability and validity.
MATERIALS AND METHODS Two photonumeric grading scales were created and validated for dimples in the
buttocks in female patients: Cellulite Dimples—At Rest, and Cellulite Dimples—Dynamic. Sixteen aesthetic
experts rated photographs of 50 women in 2 validation sessions. Responses were analyzed to assess inter-rater
and intra-rater reliability.
RESULTS Overall inter-rater reliability and intra-rater reliability were both “almost perfect” ($0.81, intraclass
correlation efficient and weighted kappa) for the At Rest scale. For the Dynamic scale, inter-rater reliability and
intra-rater reliability were “substantial” (0.61–0.80). There was a high correlation between the cellulite scales
and body mass index, age, weight, and skin laxity assessments.
CONCLUSION Consistent outcomes between raters and by individual raters at 2 time points confirm the
reliability of the cellulite dimple grading scales for buttocks and thighs in female patients and suggest they will
be a valuable tool for use in research and clinical practice.
Supported by Merz Pharmaceuticals GmbH, Frankfurt, Germany. The authors received an honorarium for
participating in the scale rating meeting. The authors have indicated no significant interest with commercial
supporters.
*Brazilian Center for Studies in Dermatology, Porto Alegre, Brazil; †Cosmetic Laser Dermatology, San Diego,
California; ‡Rosenparkklinik, Darmstadt, Germany; xThe Aesthetics, Vienna, Austria; ║Clı́nica Vida, São Paulo,
Brazil; ¶CHAO Institute of Aesthetic Medicine, Taipei, Taiwan; **Clı́nica Dermatológica Joana Costa, Brası́lia,
Brazil; ††Omni Aesthetic, New York, New York; ‡‡Merz Pharmaceuticals GmbH, Frankfurt, Germany; xxEuropean
Medical Aesthetics Ltd, London, United Kingdom; ║║Le Prioldy, Bieuzy les Eaux, France; ¶¶Division of Cosmetic Sciences,
University of Hamburg, Hamburg, Germany; ***Z. Paul Lorenc Aesthetic Plastic Surgery, New York, New
York; †††Lupo Center for Aesthetic and General Dermatology, New Orleans, Louisiana; ‡‡‡AZ Klina, Brasschaat,
Belgium; xxxWaldorf Dermatology Aesthetics, Nanuet, New York; ║║║Department of Cosmetology, Pacific State Medical
University of Health Ministry of Russia, Moscow, Russia; ¶¶¶SkinCare Physicians, Chestnut Hill, Massachusetts
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. All rights reserved.
· ·
ISSN: 1076-0512 Dermatol Surg 2019;45:S2–S11 DOI: 10.1097/DSS.0000000000001993
S2
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
HEXSEL ET AL
Between these fibrous strands, fat is stored in large years with a body mass index (BMI) in the range 18 to
globular adipocytes. It is believed that increased 42 kg/m2, Fitzpatrick skin Types I to VI, and even
tension in the fibrous septae as a result of either cellulite contour irregularities on both sides. Individ-
expansion of the fat cells or shortening of the septae uals were excluded if they had any dermatosis, scar-
due to connective tissue changes, such as trauma, leads ring, or tattoos on the buttock or thigh area or if they
to retraction at their cutaneous insertion points had received any previous aesthetic treatments or
causing the typical cellulite dimples.5,6 The raised procedures in these areas. Subject demographic data
areas between the dimples represent the projection of were collected including age, ethnicity, body mass
underlying adipocytes.7 In men, altered fat index (BMI) class, smoking status, Fitzpatrick skin
distribution and a crisscross rather than perpendicular phototypes, and self-reported exposure to sunlight
organization of the septae make the development of (based on a 5-point rating scale where 0 = never and 4 =
cellulite much less likely.3 The likelihood of cellulite very often). All subjects were informed of the objec-
developing is increased by a number of factors tives and targets of the study and gave consent to their
including a predisposing genetic background, photographs being rated, analyzed, and used in pub-
hormonal changes or imbalances, impaired lications for scientific purposes.
microcirculation, medications that cause water
retention, a sedentary lifestyle, unhealthy eating All subjects were photographed by a professional
habits, and Caucasian ethnic background.8–10 photographer using a Nikon D800 camera/70- to
Cellulite appearance is also worsened by age- 200-mm lens (Nikon Corporation, Tokyo, Japan).
associated skin laxity.11–13 Photographs were standardized as to framing, light-
ing, and subject orientation. The angle of the lights
In recent years, a better understanding of the etiology and distances between the platform, lights, and
of cellulite has led to the development of new treat- camera were all standardized and confirmed for each
ment approaches that target the underlying cause of photography session. The area to be captured cov-
the condition.6,14,15 As new pharmacological and ered the buttocks and the upper thighs up to about 8
technological medical advances reach the market, to 10 cm below the gluteal crease (infragluteal sul-
reliable and specific methods of cellulite assessment cus). Images included both posterior and oblique (45
become necessary to identify subjects appropriate for angle) views of both sides and were taken at rest and
therapy and to measure treatment outcomes. Cur- with maximum contraction of the musculus gluteus
rently available scales do not meet this need16,17 maximus (dynamic state). A microrelief image was
because they are not specific for cellulite dimples and also obtained.
because they are time-consuming for use in daily
clinical practice. In this article, the authors present
Creation of the Cellulite Dimple Scales
the cellulite dimple grading scales for the objective
quantification of the severity of cellulite dimples in The process of scale creation followed the method-
both static (relaxed or “at rest”) and dynamic states, ology used for the creation of the other Merz Aes-
as well as the validity and reliability of these photo- thetic Scales.18–21 In brief, the subjects’ images were
numeric scales. screened, and one subject’s image was chosen as the
base image for scale creation. Additional images were
then selected from the photographic database to
Methods
superimpose varying degrees of cellulite dimple
severity onto the base image to create composite
Subject Selection and Photographic Imaging
computer-generated images for the cellulite dimple
A photographic database of the buttocks and thighs of scale. The software used to produce the super-
120 female subjects was established to provide repre- imposed images was Adobe Photoshop. Several ver-
sentative images across the complete spectrum of cel- sions were reviewed with aesthetic experts/physicians
lulite dimple severity. The women were aged 18 to 65 and improved stepwise until a final version was
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
ASSESSMENT SCALES FOR CELLULITE DIMPLES
S4 DERMATOLOGIC SURGERY
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
HEXSEL ET AL
Figure 1. (A) Final set of Cellulite Dimple—At Rest and (B) Cellulite Dimple—Dynamic scales.
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
ASSESSMENT SCALES FOR CELLULITE DIMPLES
Correlation of the cellulite dimple scales with the skin ranged between Grade 4 (9.0%) and Grade 1 (38.6%);
laxity severity scales27 was also determined. The cor- 11.6% had no dimples. Mean ratings were compara-
relation coefficients were calculated by validation ble between validation sessions 1 and 2 at 1.8 (SD:
session for each aesthetic expert and over all aesthetic 1.26) and 1.7 (SD: 1.15), respectively, indicating mild-
experts. In addition, the Spearman correlation coef- to-moderate cellulite dimples.
ficients with bias adjustment between the at rest and
dynamic outcome measures were calculated by vali- For the “Cellulite Dimples—Dynamic” scale, the
dation session over all aesthetic experts. grading of experts at validation session 1 ranged from
Grade 4 “moderate” (11.5% of women) to Grade 1
All analyses were written, validated, and performed “mild” (20.4% of women); 5.8% had no dimples. For
using SAS version 9.3. validation session 2, grading ranged between Grade 4
(12.5%) and Grade 1 (20.8%); 7.1% had no dimples.
Results Mean ratings were again comparable for validation
sessions 1 and 2 at 2.2 (SD: 1.07) and 2.2 (SD: 1.11),
For each validation session and each cellulite scale,
respectively, indicating moderate cellulite dimples.
there were 800 planned ratings (16 aesthetic experts ·
50 subjects rated). For most experts, there was a Inter-rater Reliability
duration of 3 to 4 weeks between the 2 validation
sessions. A few aesthetic experts did not provide a The ICC and weighted kappa values for overall inter-
rating for each subject, but missing data were few rater reliability of the 2 cellulite dimple scales are
(<1%) in both validation sessions. presented by validation session in Table 1. Weighted
kappa and ICC values for inter-rater reliability were
Subject Characteristics very similar and showed qualitatively the same results.
Overall inter-rater reliability was determined to be
All the subjects were women with a mean age of 33.2 6
almost perfect ($0.81) at both validation sessions for
12.3 years in the Cellulite Dimples—At Rest pop-
the Cellulite Dimples—At Rest scale and substantial
ulation and 34.0 6 13.9 years in the Cellulite Dim-
(0.61–0.80) at both validation sessions for the Cellu-
ples—Dynamic population. Mean BMI values were
lite Dimples—Dynamic scale. For both scales, inter-
23.5 6 4.6 kg/m2 and 23.1 6 4.4 kg/m2, respectively.
rater reliabilities were slightly higher in validation
All Fitzpatrick skin Types (I–VI) were represented, but
session 1 compared with session 2.
the most frequent was Fitzpatrick skin Type III.
Exposure to sunlight “seldom,” “seldom to some-
Intra-rater Reliability
times,” or “sometimes” was reported by 78% and
84% of women, respectively, and 22% and 24%, The ICC and weighted kappa values for intra-rater
respectively, were current smokers. reliability of the 2 cellulite dimple scales are presented
in Table 2. Overall intra-rater reliability was deter-
Expert Characteristics mined to be almost perfect ($0.81) for the Cellulite
Dimples—At Rest scale and substantial (0.61–0.80)
Of the 16 aesthetic experts (9 women and 7 men), 12
for the Cellulite Dimples—Dynamic scale. Intra-rater
were dermatologists, 3 were plastic surgeons, and 1
reliability of individual aesthetic experts for the At
was an ophthalmologist.
Rest and Dynamic scales ranged from 0.69 to 0.93 and
0.57 to 0.89, respectively. For Cellulite Dimples—At
Descriptive Statistics
Rest, intra-rater reliability was $0.70 for all experts.
For the “Cellulite Dimples—At Rest” scale, the grad- For Cellulite Dimples—Dynamic, intra-rater reliabil-
ing of aesthetic experts at validation session 1 covered ity was $0.70 in 87.5% of experts and $0.60 in
all severity scores from Grade 4 “very severe” (12.3% 93.8% of experts. With such large numbers of experts,
of women) to Grade 1 “mild” (34.4% of women); individual reliability comparisons are expected to
15% had no dimples. For validation session 2, grading sometimes vary by chance, but the majority of the
S6 DERMATOLOGIC SURGERY
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
HEXSEL ET AL
TABLE 1. Inter-rater Reliability Estimates by Validation Session for Cellulite Dimple At Rest and Dynamic
Rating Scales
reliability estimates indicated at least substantial high correlation between the 2 cellulite dimple scales
reliability. and the recently released skin laxity scales for the
buttock, thigh, and knee area, which are being pub-
A bubble plot for all experts pooled, illustrating the lished in an accompanying article in this issue27
frequency of rating combinations between the first and (Table 4).
second validation session for the Cellulite Dimples—
At Rest scale, is shown in Figure 2. There were 477 of Discussion
793 ratings with perfect agreement and 24 of 793 The results of this validation study demonstrate that
ratings with a difference of more than 1 grade. The the newly developed Merz Aesthetics cellulite dimple
location of the high-frequency ratings on the diagonal grading scales are a reliable and reproducible scoring
line of the bubble plot demonstrates the high intra- system for aesthetic evaluation of cellulite dimples on
rater reliability. The bubble plot for ratings of “Cel- the buttocks and thighs. The scales provide 5-point
lulite Dimples—Dynamic” shows 425 of 794 ratings photonumeric assessments with photo guides of cel-
with perfect agreement and 16 of 794 ratings with a lulite severity at rest and in a dynamic state.
difference of more than 1 grade (Figure 3).
Validity of the Scales For evaluation scales to be accurate, they must reflect
the target population assessed. The subjects included
Relevant Spearman correlations between the cellulite in this study represented the whole spectrum of cellu-
dimple scale ratings and subject demographic char- lite severity grades and covered a large age range, BMI
acteristics are shown in Table 3. For both scales, levels, as well as all Fitzpatrick skin types. The scales
positive Spearman correlation coefficients were were therefore evaluated across all cellulite severity
observed for BMI, age, and weight, and a negative grades in a heterogeneous population similar to what a
correlation was observed for height. There was also a physician might encounter in clinical practice. To
TABLE 2. Intra-rater Reliability Estimates for the Cellulite Dimples Grading Scales
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
ASSESSMENT SCALES FOR CELLULITE DIMPLES
S8 DERMATOLOGIC SURGERY
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
HEXSEL ET AL
TABLE 3. Correlation of Cellulite Dimple Scale Ratings With Subject Demographic Data by Validation
Session (Spearman Correlation Coefficient With Bias Adjustment and 95% Confidence Interval)
reliability) also showed almost perfect intra-rater treatments that target cellulite dimples. While they can
ICC values ($0.81) for the Cellulite Dimples—At also be used to give an overall impression of cellulite
Rest scale. For the Cellulite Dimples—Dynamic scale, severity, they cannot be generalized to all cellulite-
intra-rater reliability was substantial (0.61–0.80) related deformities. Cellulite can also be influenced by
overall. With such large numbers of experts, indi- skin laxity, particularly in older individuals,11,12 and a
vidual reliability comparisons are expected to some- separate publication in this issue details the develop-
times vary by chance, but the majority of the ment of a new skin laxity scale for the buttock and
reliability estimates indicated at least substantial thigh area that can be used in conjunction with the
reliability. dimple scale when assessing cellulite severity and
deciding on the best treatment options.27 The cellulite
Validity of the cellulite dimple scale scores was also dimple scales differ from other cellulite severity
explored by means of correlations with the scales scales16,17 available in the literature in their specificity
themselves and other variables that might be expected for cellulite dimples and in their simplicity. The
to influence cellulite severity. There was a high corre- Nürnberger and Müller16 classification was developed
lation between the 2 cellulite scales themselves, and in 1978 and has 4 severity grades. It is based on
with a separate scale assessing skin laxity in the but- observations both at rest and in a dynamic state.
tock and thigh region.27 Other factors with a high Hexsel and colleagues17 included the Nürnberger and
correlation with cellulite dimples were BMI, followed Müller classification in the Cellulite Severity Scale,
by age and weight, supporting the concept that while which also comprises the 4 most important clinical
not causal, cellulite may be worsened by aging and features of cellulite (number of evident depressions,
weight gain. Fitzpatrick skin type, sun exposure, and depth of depressions, morphological appearance of
smoking status were found to have no influence on skin surface alterations, and grade of skin laxity). The
cellulite severity. severity of each of the 5 scale items is graded from 0 to
3, allowing a final sum of scores that range numerically
The cellulite dimple scales have been specifically from 1 to 15. Based on the final numeric score, cellulite
developed as a tool to assist physicians offering is classified as mild, moderate, or severe.17 The Hexsel
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
ASSESSMENT SCALES FOR CELLULITE DIMPLES
TABLE 4. Correlation Between Cellulite Dimple and Skin Laxity Scales by Validation Session (Spearman
Correlation Coefficient With Bias Adjustment and 95% Confidence Interval)
for up to 3 years,15,31 the Cellulaze laser-based treat- 6. Hexsel DM, Abreu M, Rodrigues TC, Soirefmann M, et al. Side-by-side
comparison of areas with and without cellulite depressions using
ment for the release of fibrous septae,32 and the manual magnetic resonance imaging. Dermatol Surg 2009;35:1471–7.
subcision for cellulite, which is the basis of the above
7. Hexsel D, Siega C, Schilling-Souza J, Porto MD, et al. A comparative
cited technologies.33,34 study of the anatomy of adipose tissue in areas with and without raised
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
HEXSEL ET AL
lesions of cellulite using magnetic resonance imaging. Dermatol Surg 23. Fleiss JL, Cohen J, Everitt B. Large sample standard errors of kappa and
2013;39:1877–86. weighted kappa. Psychol Bull 1969;72:323.
8. de la Casa Almeida M, Suarez Serrano C, Rebollo Roldán J, Jiménez 24. Fleiss JL, Cohen L. The equivalence of weighted kappa and the
Rejano JJ. Cellulite’s aetiology: a review. J Eur Acad Dermatol Venereol intraclass correlation coefficient as measures of reliablilty. Educ Psychol
2013;27:273–8. Mea 1973;33:613–9.
9. Rossi AB, Vergnanini AL. Cellulite: a review. J Eur Acad Dermatol 25. Landis JR, Koch GG. The measurement of observer agreement for
Venereol 2000;14:251–62. categorical data. Biometrics 1977;33:159–74.
10. Leszko M. Cellulite in menopause. Prz Menopauzalny 2014;13:298–304. 26. Shrout PE. Measurement reliability and agreement in psychiatry. Stat
Methods Med Res 1998;7:301–17.
11. Rosenbaum M, Prieto V, Hellmer J, Boschmann M, et al. An
exploratory investigation of the morphology and biochemistry of 27. Kaminer MS, Casabona G, Sattler G, Bartsch R, et al. Validated
cellulite. Plast Reconstr Surg 1998;101:1934‒9. assessment scales for skin laxity on the posterior thighs, buttocks,
anterior thighs, and knees in female patients. Dermatol Surg 2019:45:
12. Stavroulaki A, Pramantiotis G. Cellulite, smoking and angiotensin- S12–21.
converting enzyme (ACE) gene insertion/deletion polymorphism. J Eur
Acad Dermatol Venereol 2011;25:1116‒7. 28. Donofrio L, Carruthers J, Hardas B, Murphy DK, et al. Development
and validation of a photonumeric scale for evaluation of infraorbital
13. Lorencini M, Camozzato F, Hexsel D. Skin aging and cellulite in women. hollows. Dermatol Surg 2016;42(Suppl 1):S251–8.
In: Farage MA, Miller KW, Maibach HI. editors. Textbook of Aging
Skin. Heidelberg: Springer-Verlag Berlin Heidelberg; 2016; pp. 1–9. 29. Sykes JM, Carruthers A, Hardas B, Murphy DK, et al. Development
and validation of a photonumeric scale for assessment of chin retrusion.
14. Green JB, Cohen JL, Kaufman J, Metelitsa AI, et al. Therapeutic Dermatol Surg 2016;42(Suppl 1):S211–8.
approaches to cellulite. Semin Cutan Med Surg 2015;34:140–3.
30. De La Casa Almeida M, Suarez Serrano C, Jiménez Rejano JJ, Chillón
15. Kaminer MS, Coleman WP III, Weiss RA, Robinson DM, et al. A Martı́nez R, et al. Intra- and inter-observer reliability of the application
multicenter pivotal study to evaluate tissue stabilized-guided subcision of the cellulite severity scale to a Spanish female population. J Eur Acad
using the Cellfina device for the treatment of cellulite with 3-year Dermatol Venereol 2013;27:694–8.
follow-up. Dermatol Surg 2017;43:1240–8.
31. Kaminer MS, Coleman WP III, Weiss RA, Robinson DM, et al.
16. Nürnberger F, Müller G. So-called cellulite: an invented disease. J Multicenter pivotal study of vacuum-assisted precise tissue release for
Dermatol Surg Oncol 1978;4:221–9. the treatment of cellulite. Dermatol Surg 2015;41:336–47.
17. Hexsel DM, Dal’forno T, Hexsel CL. A validated photonumeric 32. DiBernardo BE, Sasaki GH, Katz BE, Hunstad JP, et al. A multicenter
cellulite severity scale. J Eur Acad Dermatol Venereol 2009;23:523–8. study for cellulite treatment using a 1440-nm Nd:YAG wavelength laser
18. Flynn TC, Carruthers A, Carruthers J, Geister TL, et al. Validated with side-firing fiber. Aesthet Surg J 2016;36:335–43.
assessment scales for the upper face. Dermatol Surg 2012;38:309–19. 33. Hexsel DM, Mazzuco R. Subcision: a treatment for cellulite. Int J
19. Geister TL, Bleßmann-Gurk B, Rzany B, Harrington L, et al. Validated Dermatol 2000;39:539–44.
assessment scale for platysmal bands. Dermatol Surg 2013;39:1217–25. 34. Hexsel D, Dal Forno T, Hexsel C, Schilling-Souza J, et al. Magnetic
20. Landau M, Geister TL, Leibou L, Blessmann-Gurk B, et al. Validated resonance imaging of cellulite depressed lesions successfully treated by
assessment scales for décolleté wrinkling and pigmentation. Dermatol subcision. Dermatol Surg 2016;42:693–6.
Surg 2016;42:842–52.
21. Rzany B, Carruthers A, Carruthers J, Flynn TC, et al. Validated composite Address correspondence and reprint requests to: Doris
assessment scales for the global face. Dermatol Surg 2012;38:294–308.
Hexsel, Brazilian Center for Studies in Dermatology, 1592
22. Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater Dom Pedro II Street, Porto Alegre 90550-141, Rio Grande
reliability. Psychol Bull 1979;86:420–8. do Sul, Brazil, or e-mail: doris@hexsel.com.br
© 2019 by the American Society for Dermatologic Surgery, Inc. Published by Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.