Acupuncture Treatment of Adhesive Capsulitis of the Shoulder: A Randomized Controlled Pilot Trial
Article information
Abstract
Background
Adhesive capsulitis (AC) is a common condition that includes shoulder pain and limited movement. Despite more than 100 years of AC treatment, the most efficacious treatment remains unclear. The aim of this study was to evaluate the feasibility of a randomized controlled trial (RCT) using acupuncture for AC.
Methods
Thirty participants with AC were randomly assigned to acupuncture (A) or sham acupuncture (SA) groups. The participants received 15 acupuncture sessions over 6 weeks, and follow-up occurred for an additional 4 weeks thereafter. The primary clinical outcome was the numeric rating scale (NRS) for shoulder pain 6 weeks from the baseline. Secondary outcomes included range of motion (ROM) in the shoulder, the shoulder pain and disability index (SPADI), the EuroQol-5 dimensions (EQ-5D), the Pittsburgh sleep quality index (PSQI), and the patient global impression of change (PGIC).
Results
Thirty participants were enrolled out of 37 screened individuals. Recruitment was conducted between August 2014 until May 2015. A total of 28 participants (93%) completed the 6-week intervention, and 26 participants (87%) completed the study. NRS, ROM, SPADI, EQ-5D, PSQI, and PGIC scores improved in both the experimental group and the sham group after 6 weeks, but the difference between the groups was not statistically significant. Adverse events were reported by 12 participants, although these events were not associated with acupuncture.
Conclusions
A future RCT for AC may be feasible with some modifications to the recruitment plan and the secondary outcome measurement methods.
Introduction
Adhesive capsulitis (AC) is a common condition that causes shoulder pain and limited movement [1], and occurs in 3% to 5% of the general population [2]. Despite more than 100 years of AC treatment, the definition, diagnosis, pathology, and treatment options remain unclear [3]. The goals of AC treatment include relieving pain, restoring movement, and regaining shoulder function [1]. Western medical treatments for AC include physiotherapy, hydrodilatation, manipulation under anesthesia, arthroscopic capsular release, and steroid administration [3]. In contrast, Eastern medical treatments for AC include techniques such as acupuncture, herbal medication, moxibustion, and cupping [4].
Acupuncture is frequently used to treat various shoulder problems [5]. Furthermore, a previous review indicated that acupuncture may be helpful for treating shoulder pain however, this study also suggested that further studies are required before it can be recommended as a treatment for AC [6]. Another review of randomized controlled trials (RCTs) of acupuncture for shoulder pain found that 11 out of 16 RCTs demonstrated significant differences between the acupuncture and control (physiotherapy, surface needling, etc.) groups. Moreover, 3 out of 16 RCTs used non-penetrating sham controls, and AC was not the target condition in 1 of 3 RCTs [7]. Hence, the efficacy of acupuncture on AC was not accurately assessed.
Considering this, a randomized controlled pilot trial was designed to assess the feasibility of a RCT using acupuncture for AC.
Materials and Methods
Trial design
This study was a 2-arm, randomized, controlled, and parallel-designed pilot trial. The study period was 10 weeks, which included a 6-week intervention procedure and a 4-week follow-up examination. The trial design is shown as a flow diagram in Fig. 1.
Participants
Participants were recruited from Daejeon Oriental Hospital of Daejeon University, Daejeon, Korea. When a participant contacted the clinical trial center by telephone or by e-mail through the recruitment advertisement, the clinical research coordinator scheduled the participant’s visit to the clinical trial center. When the participant visited, the clinical research coordinator gave the participant a screening number. The researcher then provided the participant with sufficient information about this study and received written consent. After that, the participant was evaluated for eligibility criteria of the study.
Inclusion criteria
Inclusion criteria were as follows: (1) men and women aged 19 to 65 years, (2) patients who had experienced chronic shoulder pain in either 1 or both shoulders for a period of 3 to 12 months, (3) patients who experienced shoulder pain and had weekly numeric rating scale (NRS) scores of ≥ 4 during a screening examination, and (4) providing written informed consent.
Exclusion criteria
Exclusion criteria were as follows: (1) diagnosis of cervical radiculopathy or treatment for cervical radiculopathy within the previous 3 months, (2) injured shoulder or shoulder surgery, (3) diagnosis of another pathological condition by clinical or radiological shoulder examination, such as rotator cuff injury, shoulder impingement syndrome, rheumatoid arthritis, or osteoarthritis, (4) use of adrenal cortex hormone or steroids for another medical condition, (5) the patient received another treatment for shoulder pain in the 2 weeks prior to screening, such as medications, various Eastern medicine treatments, injections, or physical therapy, (6) hypersensitivity to acupuncture, (7) the patient was pregnant, breastfeeding, or planning to become pregnant during the duration of the trial period, (8) psychoneurotic disorders including epilepsy, depression, or panic disorder, (9) surgery or other treatments for shoulder pain during the study, and (10) the investigator considered the individual unsuitable for the clinical trial (e.g. considered to be unable to cooperate in the research process such as questionnaire response due to difficult communication).
Ethical considerations
This study was approved by the institutional review board of the Daejeon Oriental Hospital of Daejeon University (djomc-120). The study was registered at the Clinical Research Information Service (KCT0001471). All participants provided written informed consent. The information included the purpose of the study, participation period, number of participants, study procedure, benefits from participation, anticipated harms, withdrawal of consent to participation, rules to follow during the study period, compensation and treatment in case of damage related to research, confidentiality of personal information, reasons for participation being discontinued during the study, and the name and contact information of the researcher. All care was taken to protect the privacy of participants.
Sample size
The sample size was determined by the recommended minimum number per group for pilot studies (12 participants) [8]. A predicted 20% dropout rate necessitated 15 participants per group (30 participants for the study).
Randomization
Participants were randomly assigned to acupuncture (A) or sham acupuncture (SA) groups, with equal numbers in each group. An independent statistician generated the randomization sequence using a computer program (Strategic Applications Software (SAS), version 9.1.3; SAS Institute Inc., Cary, NC, USA). Blocked randomization with randomized block size was applied.
Allocation concealment
Each generated randomization sequence was sealed in sequentially numbered, opaque envelopes. The researcher opened the envelopes in sequential order as each participant met all eligibility criteria. Opened envelopes were stored in a double-locked cabinet.
Blinding
Participants and outcome assessors remained blind to group assignments until study completion.
Interventions
Participants in both groups received 15 intervention sessions for 6 weeks. In the A group, treatment points were LI10 (Susamni), LI15 (Gyeonu), LI16 (Geogol), LU2 (Unmun), LU5 (Cheoktaek), SI9 (Gyeonjeong), SI11 (Cheonjong), TE14 (Gyeollyo), TE15 (Cheollyo), and SP19 (Hyunghyang) on the affected side. The acupoints were chosen based on the theory of meridians in traditional Oriental medicine [7,9,10]. After inducing a de qi sensation, the needles were retained for 20 minutes [7]. In the SA group, non-penetrating Park sham placebo devices were used [11]. The treatment points were LI16, SI9, SI11, TE14, TE15, BL12 (Pungmun), BL13 (Pyesu), BL14 (Gworeumsu), BL15 (Simsu), and BL16 (Doksu) on the affected side [7,9,10]. To maintain blinding, acupuncture points in the back (BL12, BL13, BL14, BL15, and BL16) were used instead of LI10, LI15, LU2, LU5, and SP19, which were visible to participants during the procedure. The devices remained in position for 20 minutes without inducing a de qi sensation [7]. During the study period, medications and Oriental medicine treatments for AC were not allowed, but self-initiated exercise and the usual management for AC were allowed. Detailed information on interventions are shown in Appendix 1.
Outcomes
NRS
The primary clinical outcome was the NRS score for shoulder pain at Week 7. The NRS consisted of a horizontal straight line with a numerical scale ranging from 0 to 10. The far-left side of the scale (0) corresponded to “no pain,” while “worst pain ever experienced” (10) was represented at the far-right end of the scale [12]. Participants were asked to select a number that corresponded to their average shoulder pain over the past week. The NRS scores measured at weeks 4 and 11 were secondary outcomes.
Shoulder range of motion
A digital goniometer (iGaging, San Clemente, CA, USA) was used to assess range of motion (ROM) in the shoulder joint, using flexion, abduction, internal rotation, and external rotation movements. To reduce measurement errors, prior to the start of the study, the ROM measurers were trained via a one-day course. The measurements were always performed in the same order (flexion, abduction, internal rotation, and external rotation). The ROM was measured at baseline, Week 4, Week 7, and Week 11.
Shoulder pain and disability index
The shoulder pain and disability index (SPADI) comprised the pain (5 items) and disability (8 items) subscales. Each item was scored on an 11-point scale. The SPADI total score was calculated by averaging the 2 subscale scores [13]. The SPADI was measured at baseline, Week 7, and Week 11.
EuroQol-5 dimensions
Quality of life was measured using the EuroQol-5 dimensions (EQ-5D). The EQ-5D descriptive system was a standardized tool for measuring health-related quality of life and consists of 5 dimensions (mobility, self-care, usual activities, pain/discomfort, and anxiety/depression) [14]. The EQ-5D was measured at baseline, Week 7, and Week 11.
Pittsburgh sleep quality index
The Pittsburgh sleep quality index (PSQI) assessed sleep quality and identified sleep disturbances [15]. Seven items, including subjective sleep quality, sleep latency, sleep time, habitual sleep efficiency, sleep disturbance, sleeping pills, and daytime dysfunction, were measured at baseline, Week 7, and Week 11.
Patient global impression of change
The patient global impression of change (PGIC) assessed the degree of improvement experienced by participants following treatment. Participants were asked how much their shoulder pain improved compared to before the study and were asked to choose 1 of the following 7 answers: “very much improved,” “much improved,” “minimally improved,” “no change,” “minimally worse,” “much worse,” or “very much worse” [16]. The PGIC was measured at Weeks 7 and 11.
Blind test
Participants were asked to guess which treatment they received, by choosing 1 of the following 3 answers: “classical acupuncture,” “non-classical acupuncture,” or “don’t know.” The blind test was conducted after the first and last study sessions.
Harms
All adverse events (AEs) were recorded by practitioners, using an AE report form. The following data were recorded: the date of occurrence and disappearance, severity and outcomes of AEs, measures taken in relation to the study and their relationships with the treatment, medicines or treatments other than the study treatment, and details of any treatment for AEs. The severity of AEs was classified into 3 grades (1 = mild, 2 = moderate, and 3 = severe). The relationship between acupuncture and AEs was classified into 6 categories (1 = definitely related, 2 = probably related, 3 = possibly related, 4 = probably not related, 5 = definitely not related, and 6 = unknown).
Statistical analyses
Between-group comparisons were analyzed using analysis of covariance (ANCOVA) with the evaluation values of each outcome as a dependent variable, baseline value as a covariate, and group as a fixed factor. The analysis was performed with an α-value of 0.05 and power of 0.80. Additionally, a repeated measures analysis of variance (RM ANOVA) was performed to assess a potential interaction between timepoint and group. Other outcomes (ROM, SPADI, EQ-5D, and PSQI) were similarly analyzed. PGIC was analyzed using analysis of variance (ANOVA), since PGIC had no baseline value and was not included in the RM ANOVA. The blind test was analyzed using a Fisher’s exact test. Within-group comparisons were analyzed with either paired t test or Wilcoxon signed-rank test. For sociodemographic characteristics, numeric variables were presented as means and standard deviations, while categorical variables were presented as frequencies and percentages. Analyses were conducted on both a full analysis set (FAS) and a per-protocol set (PPS). A FAS was used to describe the set that was as complete as possible and as close as possible to the intention-to-treat (ITT) ideal of including all randomized participants. The PPS included participants who were most compliant with the study protocol [17]. The main analyses were conducted on the FAS. Missing data were imputed by multiple imputation. Statistical analyses were performed using the SAS program (version 9.4; SAS Institute Inc., Cary, NC, USA).
Results
Recruitment and feasibility
Recruitment was conducted from August 2014 to May 2015. Thirty participants were selected from 37 screened individuals (Fig. 1). In the A group, 218 acupuncture sessions (97%) were administered during the study period. In the SA group, 211 acupuncture sessions (94%) were administered during the study period. A total of 28 participants (93%) completed the 6-week intervention. A total of 4 participants (13%) dropped out or were lost to follow-up (Fig. 1). The reasons for the drop-out were withdrawal of consent (n = 2: 1 in the A group, due to work shift; the other in the SA group, not enough time) and the use of prohibited concomitant treatments on the shoulder during the study period (n = 2: 1 in the A group, injection therapy; the other in the SA group, acupuncture and moxibustion treatment).
Baseline characteristics
The sociodemographic and clinical characteristics of the participants are given in Table 1. No significant differences were observed between the groups regarding gender, age, body mass index, exercise, alcohol consumption, and medical treatments for AC.
Clinical outcomes
No significant differences occurred between the 2 groups for NRS, ROM (Weeks 4, 7, and 11), EQ-5D, SPADI, or PSQI (Weeks 7 and 11) scores. In the A group, significant improvements occurred on the NRS (Weeks 4, 7, and 11), EQ-5D (Week 11), SPADI (pain, disability, and total, at both Weeks 7 and 11), and PSQI (Week 11) compared to baseline. In the SA group, significant improvements occurred on the NRS (Weeks 4, 7, and 11), ROM (flexion at Week 11; abduction at Weeks 4, 7, and 11; and external rotation at Week 7), EQ-5D (Week 11), SPADI (pain and total, at Weeks 7 and 11; disability at Week 7), and PSQI (Week 11) compared to baseline (Tables 2–5).
There was no interaction between timepoint and group on the NRS (p = 0.4106), ROM (flexion, p = 0.3343; internal rotation, p = 0.4634; external rotation p = 0.5667), SPADI (pain, p = 0.5168; disability, p = 0.7545; total, p = 0.9456), EQ-5D (p = 0.4826), or PSQI (p = 0.9926). The only significant difference was for ROM of shoulder abduction (p = 0.0308). No significant differences occurred between the 2 groups for PGIC scores (Week 7, p = 0.8674; Week 11, p = 0.8702). At Week 7, 12 of 14 participants in the A group (86%) answered that they improved (minimally improved, n = 4; much improved, n = 5; very much improved, n = 3); 11 of 14 participants in the SA group (79%) reported that they improved (minimally improved, n = 2; much improved, n = 6; very much improved, n = 3). At Week 11, 11 of 13 participants in the A group (85%) answered that they improved (minimally improved, n = 3; much improved, n = 5; very much improved, n = 3); 11 of 13 participants in the SA group (85%) reported that they improved (minimally improved, n = 2; much improved, n = 6; and very much improved, n = 3).
In the blind test, 6 of 15 participants in the A group (40%) guessed that they belonged to the A group, 1 participant (7%) guessed that he belonged to the SA group, and 8 participants answered “don’t know” (53%) after the first intervention. In the SA group, 7 of 15 participants (47%) guessed that they belonged to the A group, 4 participants (27%) guessed that they belonged to the SA group, and 4 participants answered “don’t know” (27%) after the first intervention. After the last intervention, 9 of 14 participants in the A group (64%) guessed that they belonged to the A group, 1 participant (7%) guessed that they belonged to the SA group, and 4 participants answered “don’t know” (29%). In the SA group, 9 of 14 participants (64%) guessed that they belonged to the A group, 2 participants (14%) guessed that they belonged to the SA group, and 3 participants answered “don’t know” (21%) after the last intervention. No significant differences occurred between the 2 groups on the blind test (after first session, p = 0.2425; after last session, p = 0.9999).
Discussion
The feasibility of a RCT was evaluated using acupuncture for AC treatment. Research procedures (e.g., screening, randomization, intervention, and questionnaire responses) worked well overall. More than 90% of the participants completed the 6-week intervention. AEs occurred for 12 participants (40%) (3 in the A group and 9 in the SA group), but these events were not associated with the study treatments. Recruiting began in mid-August 2014 and ended at the end of May 2015, making the average recruitment rate approximately 3 patients per month. In future studies, improvements in the recruitment rate would be desirable.
Among the clinical outcomes, SPADI, ROM, EQ-5D, and PSQI were chosen considering the most frequently addressed concepts using the international classification of functioning, disability and health (ICF) in measures of shoulder pain [18]. On the other hand, NRS and PGIC were chosen considering 2 of the core chronic pain outcome domains (pain intensity and participant ratings of overall improvement) recommended by the initiative on methods, measurement, and pain assessment in clinical trials (IMMPACT) [19].
In a previously reported meta-analysis of sham-controlled randomized clinical trials, 5 trials with a total of 495 participants compared mean shoulder pain scores between the A and SA groups. The standardized mean difference was -0.63 (range = -0.91 to -0.36, p < 0.001), indicating that there was a moderate effect of the A group compared to the SA group [20]. The effects of the A group may have been masked by differences in design (such as target disease, intervention, and control device) between studies included in the meta-analysis and the present study. Insufficient power to detect differences in efficacy between groups would also explain discrepancies in the results. In addition, improvement responders were considered (not initially planned) at the following 2 levels: 30% responder (≥ 30% reduction from baseline NRS) and 50% responder (≥ 50% reduction from baseline NRS). Reporting outcomes as the percentage of patients responding or achieving a low pain state is preferable to the average changes in pain intensity because the population distribution of pain intensity tends not to be bell-shaped [21]. In the present study, the 30% responder after 6 weeks of intervention was 10 in each of the A and SA groups (66.7%). The 50% responder after 6 weeks of intervention was 9 in each of the A and SA groups (60%). In future studies, it may be necessary to reexamine the results obtained by including responder outcomes in the evaluation variables.
For the within-group comparison, the change from baseline NRS scores was -2.57 (an improvement of 42.8%) in the A group, and -2.86 (improvement of 47.7%) in the SA group after 6 weeks. A RCT for chronic shoulder pain conducted in 2007, found changes from baseline of -2.5 (improvement of 50%) in the A group, and -0.7 (improvement of 16.3%) in the SA group after 10 treatments over 5 weeks [22]. A reduction from baseline of 2 points (approximately 30%) in pain intensity on the NRS represents a clinically important difference [16]. In the present study, change from baseline in SPADI total scores was -21.35 (improvement of 40.8%) in the A group, and -17.67 (improvement of 35.7%) in the SA group after 6 weeks. A RCT for chronic shoulder pain conducted in 2009 found that the change from baseline SPADI total scores was -20.4 in the A group, and -6.5 in the SA group after 12 treatments over 6 weeks (baseline SPADI values not presented) [23]. When a patient is tested and then retested, a change of 18 points represents the minimum difference in the SPADI total score required to state (with 95% confidence), that a real change is responsible for the difference, rather than measurement error (i.e., 18 is the minimum score indicating change) [24].
In the present study, changes in the NRS and SPADI scores of the A group demonstrated a clinically significant improvement similar to previously reported effects; however, the SA group scores indicated more improvement than those reported previously. In a validity study of SPADI conducted in 2012, SPADI had a significant correlation with NRS (correlation coefficient = 0.946) and ROM (correlation coefficient = -0.927) at p < 0.01 level [13]. In this study, NRS and SPADI scores before and after intervention tended to change in the same direction in both groups. However, the results of ROM measurement in the A group showed no significant change after the intervention, which was not consistent with the result of the previous study [13]. The ROM measurers and the acupuncture practitioners each received 1-day study-related education before the study initiation, but while the practitioners were already very familiar with acupuncture treatment, the ROM measurers were the first to measure the ROM, so the measurement error between the ROM measurers might be relatively larger than the intervention error between the practitioners. This might have affected the ROM measurement results of this study, leading to differences compared with those from the previous study [13]. It was reported that patients with AC had significant incidence of sleep disturbance and had a significantly lower quality of life [25]. In this study, PSQI and EQ-5D tended to change in the opposite direction before and after intervention in accordance with the result of the previous study [25].
To evaluate the success of blinding, index values were calculated (not initially planned) using the blinding index (BI) that was reported in 2004 [26]. After the first intervention, the BI value of the A group was 0.33 (95% confidence interval, 0.03 to 0.64), and the BI value of the SA group was -0.20 (-0.62 to 0.22). After the last intervention, the BI value of the A group was 0.57 (0.25 to 0.90) and the value of the SA group was -0.50 (-0.88 to -0.12). It was reported that “random guess” and “wishful thinking” (e.g., participants tend to believe that they received treatment) are the most ideal blinding scenarios for incurring minimal bias [27]. Using a 0.2 BI threshold value [28], the results of the BI calculation in this study after the last intervention were close to “wishful thinking.” This may partly explain the unexpected excellent improvements for the SA group.
This study has the following strengths and weaknesses. In terms of strengths, this study achieved high compliance with the intervention and confirmed the safety of acupuncture for AC. However, in terms of limitations, SA appears to have some physiological effects and may not be a perfect placebo. In a systematic review of 61 clinical trials comparing the efficacy of SA with other placebos, it was suggested that SA interventions might be associated with greater efficacy than pharmacological or other physical placebos [29]. Second, some unmeasured factors, such as self-initiated exercise and expectation of the treatment, may have affected the outcomes. Self-initiated exercise for AC was not prohibited during the study period, and data was not collected on whether the exercise was performed or how much participants in each group may have exercised. Participants’ expectations of acupuncture were also not investigated. In future studies, these factors should be measured, and sensitivity analyses that take these factors into account should be performed.
Conclusions
The results from this study suggest that a future RCT for AC may be possible, with some modifications to the recruitment plan and the secondary outcome measurement methods.
Notes
The authors have no conflicts of interest to declare.
Acknowledgements
This study was supported by the Korea Institute of Oriental Medicine (K14011, K17122, and K18121). The funder had no role in the design of the study; the collection, analysis, and interpretation of data; the composition of the manuscript; or the decision to submit the manuscript for publication.