Systematic review of predictive performance of injury severity scoring tools

Tohira, Hideo; Jacobs, Ian; Mountain, David; Gibson, Nick; Yeo, Allen

doi:10.1186/1757-7241-20-63

Review
Open access
Published: 10 September 2012

Systematic review of predictive performance of injury severity scoring tools

Hideo Tohira¹,
Ian Jacobs¹,
David Mountain¹,
Nick Gibson¹ &
…
Allen Yeo²

Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine volume 20, Article number: 63 (2012) Cite this article

11k Accesses
109 Citations
Metrics details

Abstract

Many injury severity scoring tools have been developed over the past few decades. These tools include the Injury Severity Score (ISS), New ISS (NISS), Trauma and Injury Severity Score (TRISS) and International Classification of Diseases (ICD)-based Injury Severity Score (ICISS). Although many studies have endeavored to determine the ability of these tools to predict the mortality of injured patients, their results have been inconsistent. We conducted a systematic review to summarize the predictive performances of these tools and explore the heterogeneity among studies. We defined a relevant article as any research article that reported the area under the Receiver Operating Characteristic curve as a measure of predictive performance. We conducted an online search using MEDLINE and Embase. We evaluated the quality of each relevant article using a quality assessment questionnaire consisting of 10 questions. The total number of positive answers was reported as the quality score of the study. Meta-analysis was not performed due to the heterogeneity among studies. We identified 64 relevant articles with 157 AUROCs of the tools. The median number of positive answers to the questionnaire was 5, ranging from 2 to 8. Less than half of the relevant studies reported the version of the Abbreviated Injury Scale (AIS) and/or ICD (37.5%). The heterogeneity among the studies could be observed in a broad distribution of crude mortality rates of study data, ranging from 1% to 38%. The NISS was mostly reported to perform better than the ISS when predicting the mortality of blunt trauma patients. The relative performance of the ICSS against the AIS-based tools was inconclusive because of the scarcity of studies. The performance of the ICISS appeared to be unstable because the performance could be altered by the type of formula and survival risk ratios used. In conclusion, high-quality studies were limited. The NISS might perform better in the mortality prediction of blunt injuries than the ISS. Additional studies are required to standardize the derivation of the ICISS and determine the relative performance of the ICISS against the AIS-based tools.

Background

Many scoring systems to assess injury severity have been developed over the past few decades. The need to improve the quality of trauma care has led researchers to develop more accurate tools that allow physicians to predict the outcomes of injured patients. The Abbreviated Injury Scale (AIS) was the first comprehensive injury severity scoring system to describe injuries and to measure injury severity[1]. Because the AIS cannot measure the overall injury severity of a patient with multiple injuries, tools that can measure the overall severity of multiple injuries were developed using the AIS. These tools include the Injury Severity Score (ISS)[2], the New Injury Severity Score (NISS)[3] and the Trauma and Injury Severity Score (TRISS)[4].

AIS codes are not always available for all injured patients because of the limited resources available for maintaining a trauma registry. However, the International Classification of Diseases (ICD) codes are routinely collected in administrative databases, including morbidity or mortality databases. Osler et al. introduced the ICD-based Injury Severity Score (ICISS) to overcome the unavailability of AIS-based tools[5]. It was reported that the ICISS performed as well as the AIS-based tools in predicting trauma patient outcomes[6–17].

Many researchers have studied the predictive performances of injury severity scoring tools. Their results, however, were inconsistent. This disparity may be due to the differences in study populations and the differences between the formulas used to calculate severity. For example, the ICISS employs at least two types of formulas for its calculation: a product of multiple survival risk ratios (SRRs), referred to as the traditional ICISS, and the use of a single worst injury (SWI) risk ratio. In 2003, Kilgo et al. first reported that the SWI performed better than the traditional ICISS[18]. The superiority of the SWI remains uncertain because of conflicting results from another researcher[19].

The predictive performances of the TRISS and ICISS have rarely been compared. The TRISS utilizes a logistic regression model that incorporates the ISS as a predictor; therefore, the TRISS intuitively outperforms the ISS. In contrast to the TRISS, the ICISS is based on a multiplicative model that uses SRRs. Because the TRISS and ICISS are based on different mathematical models, the superiority of one tool over the other remains inconclusive.

Although several traditional narrative reviews have been conducted, these reviews typically focused on how each tool was established and how each score was derived[20–24]. A few reviews have addressed the predictive performances of injury severity scoring tools[21, 23]. However, the methodologies used for selecting studies were unclear, and the interpretation of the results was subjective. It is more appropriate to review studies in a systematic manner to best integrate all of the evidence. Currently, there is no systematic review that evaluates the predictive performances of injury severity scoring tools.

Aims

In this systematic review, we aimed to summarize the ability of the injury severity scoring tools that are currently in use to predict the mortality of injured patients. We also aimed to explore the potential sources of the heterogeneity among studies to better understand comparative studies of injury severity scoring tools.

Methods

Injury severity scoring tools

To investigate predictive performances, we chose the following injury severity scoring tools: the ISS[2], the NISS[3], the TRISS[4] and the ICISS[25]. These tools are hereafter referred to as the “target tools.” These target tools were selected because they were frequently found in injury research articles. We included the TRISS in the target tools to determine the superiority of the TRISS over the ICISS or vice versa.

We subdivided the TRISS and the ICISS further when reporting their predictive performance. We classified the TRISS into two types: the TRISS that used coefficients derived from the Major Trauma Outcome Study (MTOS TRISS) and one that used coefficients derived from non-MTOS populations (non-MTOS TRISS).

We categorized the ICISS into four subgroups based on the type of formula and SRR. There are two types of formula, as described previously. There also are two types of SRR: traditional SRR and independent SRR. An SRR is calculated by dividing the number of survivors with a given ICD code by the total number of patients with the ICD code. Traditional SRRs are calculated using not only single-trauma patients but also multiple-trauma patients, whereas independent SRRs are derived using cases with a single injury only. The independent SRRs are mathematically correct because the traditional SRRs violate the independence assumption of probability. Based on the two types of formula and SRR, we considered the following four subgroups: the traditional ICISS with traditional SRRs, traditional ICISS with independent SRRs, SWI with traditional SRRs and SWI with independent SWI.

Search strategies

We defined a relevant article as any research article that reported an outcome predictive performance of any of the target tools and that was published between 1990 and 2008. We considered mortality to be an outcome for this study. We set the starting year at 1990 because the AIS currently in use was launched in 1990. We excluded articles that investigated specific age cohorts (e.g., elderly populations) and those that were limited to patients with a specific anatomical injury (e.g., head trauma patients). We also excluded studies that used the AIS 85 (or earlier versions) for score calculation.

In this systematic review, we selected the area under the Receiver Operating Characteristic curve (AUROC) as a measure of predictive performance. The AUROC is equivalent to the probability that a randomly selected subject who experienced a given event has a higher predicted risk than a randomly selected person who did not experience the event[26]. Thus, a tool with a large AUROC can accurately select patients with specific injury severities and can, in turn, reduce the selection bias for a missing target cohort. The highest AUROC is 1.0, meaning that the tool can discriminate events and non-events completely. The lowest AUROC is 0.5, meaning that the tool predicts events just by chance.

We conducted an online database search in June 2009 using MEDLINE and Embase with predetermined search words (Additional file1). We set no language restrictions and sought a translation if required. We checked the references of relevant articles. Conference abstracts, letters and unpublished studies were not included.

Finding relevant articles

We carefully examined the titles and abstracts of all of the articles retrieved from the online databases. We read the entire article if the relevance was unclear. After this first screening, we carefully read the complete articles to identify additional relevant articles that fulfilled the predetermined criteria described above.

Information extraction

We extracted information relating to methodologies, study population, injury severity scoring tools and performance scores from relevant articles [see Additional file2, which includes the information that we sought].

Evaluating articles

We evaluated the quality of each relevant article using a quality assessment questionnaire. Because there is no widely used quality assessment tool for this type of systematic review, we developed a questionnaire to meet the needs of our study by referring to a systematic review of diagnostic tools and outcome prediction models[27–30]. Our questionnaire contains ten questions, which include two questions that were only applicable to the TRISS or the ICISS (Additional file3). The total number of positive answers to the eight questions that could be applied to all of the target tools was reported as the overall quality score of the study. We did not sum the weights of each question because there is no consensus on how to do so.

Statistical analysis

We did not conduct a meta-analysis in this review because we could not control the heterogeneity among studies by employing random effect models and performing subgroup analyses.

A protocol did not exist for this systematic review, and this review was not prospectively registered.

Results

Relevant articles

In total, we retrieved 5,608 potential articles from the online database search. After carefully reading all of the titles and abstracts and checking references using relevant journals, we finally identified 61 relevant articles written in English[3, 7–17, 19, 31–78] and three non-English articles[79–81]. The total number of relevant articles was 64, with 157 AUROCs of the target tools (Figure1, Table1).

Table 1 List of included studies and reported AUROCs

Full size table

In these articles, the ISS was the most frequently studied target tool (58%), followed by the TRISS (53%), the ICISS (31%) and the NISS (25%). The MTOS TRISS were more frequently reported than the non-MTOS TRISS (see Additional file2 for the details). Regarding the formulas used in ICISS calculation, 32 out of 39 AUROCs were derived from the traditional ICISS, whereas 7 AUROCs were derived from the SWI. There were 33 and 6 AUROCs of the ICISS using traditional and independent SRRs, respectively (Table1).

Of the 64 relevant studies, 26 studies were conducted in the U.S., and 26 studies used data from a single hospital. Only three studies included data from hospitals in multiple countries (see Additional file2).

Quality assessment

The results of the quality assessment are shown in Table2. The distribution of the number of positive answers was positively skewed, and the median was 5 out of 8, ranging from 2 (4 studies) to 8 (2 studies) (Figure2).

Table 2 Results of quality assessment

Full size table

Most studies described the selection criteria for the study subjects and the demographics of the subjects. In contrast, less than half of the studies reported the following items: the version of AIS and/or ICD used (37.5%); the quality assurance measure for collecting and measuring scores (37.5%); and the precision of the AUROCs (48.4%).

Regarding the two questions that were only relevant to the TRISS and ICISS, the majority of studies reported the origin of the coefficients of the TRISS or SRRs of the ICISS (41 out of 52 studies). The TRISS and ICISS that used newly derived coefficients or SRRs were internally or externally validated in 25 out of 28 studies.

Discussion

We identified 64 relevant articles with 157 AUROCs. The ISS was most frequently reported (48 AUROCs), followed by the TRISS (45 AUROCs), ICISS (40 AUROCs) and NISS (24 AUROCs). We could not pool the AUROCs because of the heterogeneity among the studies.

Study quality

There was a scarcity of high-quality studies that investigated the performance of the target tools. Specifically, the version of the injury code system and any quality assurance measure were poorly described.

Most studies described their selection criteria and reported the demographic data of the study population; however, key information that can influence the predictive performance was underreported. An AUROC can be affected by two types of factors: factors that influence the measurement of injury severity scores and those that affect the outcome[82]. The former include the version of the injury codes, type of formula and derivation of coefficients and/or SRRs; the latter includes the distribution of age, mechanism of injury and inclusion/exclusion of special cohorts (e.g., elderly patients with an isolated hip fracture, dead on hospital arrival). One of these factors (the version of the injury code system) was found to be underreported by the questionnaire. These factors may need to be fully described as much as possible to improve the quality of studies on injury severity scoring tools.

Source of heterogeneity

The sources of the heterogeneity among the relevant studies could be found in the different characteristics of their study populations. For instance, we found a wide range of crude mortality rates of the study populations, ranging from 1.1%[15] to 38%[81]. This wide distribution of the rates might be due to the difference in the type of database used between the ICISS and AIS-based tools. Studies that investigated the ICISS mostly used administrative databases, whereas studies that analyzed AIS-based tools generally used a trauma registry. Because the majority of studies of the ICISS used such a database without considering the mechanism of injury, severity of injury or, sometimes, age groups, the crude mortality rates of these studies were lower than those of the studies of AIS-based tools. Among 19 studies of the ICISS, only two studies reported that their crude mortality rates were more than 10%, whereas the rates of all of the other studies were less than 10%. In contrast, among 45 articles that did not study the ICISS, 22 studies reported more than 10% as the crude mortality rate. These high mortality rates of studies of the AIS-based tools may be explained by the fact that these studies used trauma registries that generally have inclusion and/or exclusion criteria that prevent many minor injuries from being registered.

ISS vs. NISS

We identified 16 studies that reported 24 pairs of AUROCs of the ISS and NISS[3, 7, 9, 10, 14, 15, 37, 39, 51, 53, 58, 66, 72, 74, 78, 79]. Among the 24 pairs of AUROCs, eight pairs demonstrated that the ISS had a greater AUROC than the NISS[9, 15, 39, 58, 74, 78], whereas the other 16 pairs showed greater AUROCs for the NISS than for the ISS. There were seven pairs of AUROCs that were derived using only blunt trauma patients[7, 37, 39, 51, 58, 74, 78]. Among these seven pairs, only one pair had a higher AUROC for the ISS than the NISS[39]. There were four pairs of AUROCs that were derived using penetrating trauma patients[51, 58, 74, 78]. Only one study had a higher AUROC for the NISS than the ISS[51]. Although further studies are required, the NISS might be better at predicting the outcomes of blunt trauma patients than the ISS, and vice versa for penetrating trauma patients. Because the mechanism of injury might affect the predictive performance of the ISS and the NISS, researchers should clearly describe the mechanism of injury of the study population and analyze blunt and penetrating trauma patients separately when investigating the predictive performance.

ICISS vs. AIS-based tools

We could not clearly determine the relative performance of the ICISS against the AIS-based tools because of the scarcity of comparative studies. We identified 11 studies that reported AUROCs of the ICISS and ISS and/or NISS[8–15, 17, 49, 69]. Most of these studies reported greater AUROCs for the ICISS than for the ISS/NISS, with one exception[17]. In contrast, the ICISS was rarely compared with the TRISS. We could find three studies that reported AUROCs of both the ICISS and the TRISS[8, 13, 16]. Among these studies, two studies showed that the TRISS performed better than the ICISS[8, 13], and one study demonstrated the opposite[16]. Based on these results, the ICISS is better at predicting outcomes than the ISS/NISS, but the superiority of the TRISS over the ICISS was inconclusive.

Instability of the ICISS

The ICISS appeared to be unstable in terms of its predictive performance for two reasons: the multiplicity of its formula and SRR and the dependency on the data source for the SRRs. We divided the AUROCs of the ICISS into four subgroups based on the formula and type of SRR. There was only one study that reported AUROCs of all four types of the ICISS[49]. According to this study, the SWI with traditional SRRs performed best (AUROC = 0.764), followed by the SWI with independent SRRs (0.754), the traditional ICISS with traditional SRRs (0.745) and the traditional ICISS with independent SRRs (0.744). Glance et al. supported the superiority of the SWI over the traditional ICISS[47], but Burd et al. reported a greater AUROC for the traditional ICISS than for the SWI[19]. Regarding the type of SRRs, there were three studies that compared the traditional SRRs with independent SRRs[17, 49, 61]. The results were inconclusive; one reported that independent SRRs were better than traditional SRRs, but the other two reported the opposite.

The predictive performance of the ICISS was also dependent on the data sources from which the SRRs were derived. Rutledge et al. reported AUROCs of traditional ICISSs using different sets of SRRs derived from four different databases[69]. One of the four AUROCs was greater than that of the ISS, but the other three AUROCs were the same as or less than that of the ISS. Kim et al. demonstrated another type of difference in the source data of SRRs. These authors showed that the traditional ICISS based on the ICD9CM performed better than the ISS but that the traditional ICISS using the ICD10 performed worse than the ISS. As a whole, the type of data used for SRR derivation appeared to be a crucial factor in determining the predictive performance of the ICISS.

Generalizability

It is difficult to draw broad generalizations from this study because 41% of the studies evaluated were conducted in the U.S., and 41% of the studies contained data from a single hospital (see Additional file2 for the details). In short, the results derived from narrowly recruited study populations cannot be readily applied to other populations. One can increase the generalizability of results with data from multiple hospitals and/or multiple countries. Trauma registries in which multiple countries take part have recently been developed[83, 84]. The use of such registries might constitute an alternative way to increase the generalizability of study results.

Potential biases

We searched relevant articles using two major online databases, MEDLINE and Embase. We set no language restrictions and checked the references of the relevant articles. These processes enabled us to identify as many relevant articles as possible and to reduce dissemination bias. We might have been able to reduce the bias further if we used other databases (e.g., CINAHL), although the effect of adding another database might have been minimal.

Limitations

We only focused on four injury severity scoring systems. We acknowledge that there are other tools, including A Severity Characterization of Trauma (ASCOT)[85], the Anatomic Profile Score (APS) and the modified Anatomic Profile (mAP)[86]. However, because these tools were not widely used when this study was conducted, we excluded these tools from this review.

Future research directions

Future studies might need to focus more on statistical models that incorporate an injury severity scoring tool with a risk adjustment. Such models could potentially yield a higher predictive performance than the tools in this review. Moore et al. reported on the Trauma Risk Adjustment Model (TRAM), which was superior to the TRISS with regard to both discrimination and calibration[63]. Such high performance predictive models play a key role in hospital performance rankings (e.g., the Trauma Quality Improvement Program)[87]. Furthermore, although systematic reviews studying predictive models for brain trauma injury have been conducted[28], a review that focuses on predictive models for general trauma populations, including the TRAM, has not yet been performed. Reviewing the statistical models used to predict the outcomes of injured patients would provide researchers with clues for important predictors and appropriate statistical techniques.

Conclusions

We could not pool reported evidence because of the heterogeneity among the relevant studies. The NISS appeared to be better at predicting the mortality of blunt trauma patients than the ISS. We could not determine the relative performance of the ICISS against the TRISS. The ICISS appeared to be less stable in its predictive performance than the AIS-based tools because of the many variations in its computational method. Additional studies are required to standardize the derivation of the ICISS and determine the relative performance of the ICISS against the AIS-based tools.

References

AMA: Rating the severity of tissue damage. I. The abbreviated scale. JAMA. 1971, 215: 277-280.
Article Google Scholar
Baker SP, O'Neill B, Haddon W, Long WB: The injury severity score: a method for describing patients with multiple injuries and evaluating emergency care. J Trauma. 1974, 14: 187-196. 10.1097/00005373-197403000-00001.
Article CAS PubMed Google Scholar
Osler T, Baker SP, Long W: A modification of the injury severity score that both improves accuracy and simplifies scoring. J Trauma. 1997, 43: 922-925. 10.1097/00005373-199712000-00009.
Article CAS PubMed Google Scholar
Champion HR, Sacco WJ, Hunt TK: Trauma severity scoring to predict mortality. World J Surg. 1983, 7: 4-11. 10.1007/BF01655906.
Article CAS PubMed Google Scholar
Osler T: ICISS: An International Classification of Disease-9 Based Injury Severity Score. J Trauma. 1996, 41: 380-10.1097/00005373-199609000-00002.
Article CAS PubMed Google Scholar
Al-Salamah MA, McDowell I, Stiell IG, Wells GA, Perry J, Al-Sultan M, Nesbitt L: Initial emergency department trauma scores from the OPALS study: the case for the motor score in blunt trauma. Acad Emerg Med. 2004, 11: 834-842. 10.1111/j.1553-2712.2004.tb00764.x.
Article PubMed Google Scholar
Hannan EL, Waller CH, Farrell LS, Cayten CG: A comparison among the abilities of various injury severity measures to predict mortality with and without accompanying physiologic information. J Trauma. 2005, 58: 244-251. 10.1097/01.TA.0000141995.44721.44.
Article PubMed Google Scholar
Kim Y, Jung KY, Kim CY, Kim YI, Shin Y: Validation of the International Classification of Diseases 10th Edition-based Injury Severity Score (ICISS). J Trauma. 2000, 48: 280-285. 10.1097/00005373-200002000-00014.
Article CAS PubMed Google Scholar
Meredith JW, Evans G, Kilgo PD, MacKenzie E, Osler T, McGwin G, Cohn S, Esposito T, Gennarelli T, Hawkins M: A comparison of the abilities of nine scoring algorithms in predicting mortality. J Trauma. 2002, 53: 621-628. 10.1097/00005373-200210000-00001.
Article PubMed Google Scholar
Moore L, Lavoie A, Le Sage N, Bergeron E, Emond M, Abdous B: Consensus or data-derived anatomic injury severity scoring?. J Trauma. 2008, 64: 420-426. 10.1097/01.ta.0000241201.34082.d4.
Article PubMed Google Scholar
Osler T, Rutledge R, Deis J, Bedrick E: ICISS: an international classification of disease-9 based injury severity score. J Trauma. 1996, 41: 380-386. 10.1097/00005373-199609000-00002.
Article CAS PubMed Google Scholar
Osler TM, Cohen M, Rogers FB, Camp L, Rutledge R, Shackford SR: Trauma registry injury coding is superfluous: a comparison of outcome prediction based on trauma registry International Classification of Diseases-Ninth Revision (ICD-9) and hospital information system ICD-9 codes. J Trauma. 1997, 43: 253-256. 10.1097/00005373-199708000-00008.
Article CAS PubMed Google Scholar
Rutledge R, Osler T, Emery S, Kromhout-Schiro S: The end of the Injury Severity Score (ISS) and the Trauma and Injury Severity Score (TRISS): ICISS, an International Classification of Diseases, ninth revision-based prediction tool, outperforms both ISS and TRISS as predictors of trauma patient survival, hospital charges, and hospital length of stay. J Trauma. 1998, 44: 41-49. 10.1097/00005373-199801000-00003.
Article CAS PubMed Google Scholar
Sacco WJ, MacKenzie EJ, Champion HR, Davis EG, Buckman RF: Comparison of alternative methods for assessing injury severity based on anatomic descriptors. J Trauma. 1999, 47: 441-446. 10.1097/00005373-199909000-00001.
Article CAS PubMed Google Scholar
Stephenson SCR, Langley JD, Civil ID: Comparing measures of injury severity for use with large databases. J Trauma. 2002, 53: 326-332. 10.1097/00005373-200208000-00023.
Article PubMed Google Scholar
West TA, Rivara FP, Cummings P, Jurkovich GJ, Maier RV: Harborview assessment for risk of mortality: an improved measure of injury severity on the basis of ICD-9-CM. J Trauma. 2000, 49: 530-540. 10.1097/00005373-200009000-00022.
Article CAS PubMed Google Scholar
Wong SSN, Leung GKK: Injury Severity Score (ISS) vs. ICD-derived Injury Severity Score (ICISS) in a patient population treated in a designated Hong Kong trauma centre. Mcgill J Med. 2008, 11: 9-13.
PubMed Central PubMed Google Scholar
Kilgo PD, Osler TM, Meredith W, Kilgo PD, Osler TM, Meredith W: The worst injury predicts mortality outcome the best: rethinking the role of multiple injuries in trauma outcome scoring. J Trauma. 2003, 55: 599-606. 10.1097/01.TA.0000085721.47738.BD.
Article PubMed Google Scholar
Burd RS, Ouyang M, Madigan D: Bayesian logistic injury severity score: a method for predicting mortality using international classification of disease-9 codes. Acad Emerg Med. 2008, 15: 466-475. 10.1111/j.1553-2712.2008.00105.x.
Article PubMed Google Scholar
Chawda MN, Hildebrand F, Pape HC, Giannoudis PV: Predicting outcome after multiple trauma: which scoring system?. Injury. 2004, 35: 347-358. 10.1016/S0020-1383(03)00140-2.
Article CAS PubMed Google Scholar
Lefering R: Trauma score systems for quality assessment. European Journal of Trauma. 2002, 28: 52-63. 10.1007/s00068-002-0170-y.
Article Google Scholar
Osler T: Injury severity scoring: perspectives in development and future directions. Am J Surg. 1993, 165: 43S-51S. 10.1016/S0002-9610(05)81206-1.
Article CAS PubMed Google Scholar
Osler T, Nelson LS, Bedrick EJ: Injury severity scoring. J Intensive Care Med. 1999, 14: 9-19.
Article Google Scholar
Sacco WJ, Jameson JW, Copes WS, Lawnick MM, Keast SL, Champion HR: Progress toward a new injury severity characterization: severity profiles. Comput Biol Med. 1988, 18: 419-429. 10.1016/0010-4825(88)90059-5.
Article CAS PubMed Google Scholar
Rutledge R, Fakhry S, Baker C, Oller D: Injury severity grading in trauma patients: a simplified technique based upon ICD-9 coding. J Trauma. 1993, 35: 497-506. 10.1097/00005373-199310000-00001. discussion 506–497
Article CAS PubMed Google Scholar
Hanley JA, McNeil BJ: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. 1982, 143: 29-36.
Article CAS PubMed Google Scholar
Whiting P, Rutjes AW, Reitsma JB, Bossuyt PM, Kleijnen J, Whiting P, Rutjes AWS, Reitsma JB, Bossuyt PMM, Kleijnen J: The development of QUADAS: a tool for the quality assessment of studies of diagnostic accuracy included in systematic reviews. BMC Med Res Methodol. 2003, 3: 25-10.1186/1471-2288-3-25.
Article PubMed Central PubMed Google Scholar
Perel P, Edwards P, Wentz R, Roberts I: Systematic review of prognostic models in traumatic brain injury. BMC Med Inf Decis Mak. 2006, 6: 38-10.1186/1472-6947-6-38.
Article Google Scholar
Counsell C, Dennis M: Systematic review of prognostic models in patients with acute stroke. Cerebrovasc Dis. 2001, 12: 159-170. 10.1159/000047699.
Article CAS PubMed Google Scholar
Hayden JA, Cote P, Bombardier C: Evaluation of the quality of prognosis studies in systematic reviews. Ann Intern Med. 2006, 144: 427-437.
Article PubMed Google Scholar
Barbieri S, Michieletto E, Feltracco P, Meroni M, Salvaterra F, Scalone A, Gasparetto M, Pengo G, Cacciani N, Lodi G, Giron GP: Prognostic systems in intensive care: TRISS, SAPS II, APACHE III. Minerva Anestesiol. 2001, 67: 519-538.
CAS PubMed Google Scholar
Becalick DC, Coats TJ: Comparison of artificial intelligence techniques with UKTRISS for estimating probability of survival after trauma. UK Trauma and Injury Severity Score. J Trauma. 2001, 51: 123-133. 10.1097/00005373-200107000-00020.
Article CAS PubMed Google Scholar
Bergeron E, Rossignol M, Osler T, Clas D, Lavoie A, Bergeron E, Rossignol M, Osler T, Clas D, Lavoie A: Improving the TRISS methodology by restructuring age categories and adding comorbidities. J Trauma. 2004, 56: 760-767. 10.1097/01.TA.0000119199.52226.C0.
Article PubMed Google Scholar
Bijlsma TS, Van Der Graaf Y, Leenen LPH, Van Der Werken C: The Hospital Trauma Index: Impact of equal injury severity grades in different organ systems. European Journal of Trauma. 2004, 30: 171-176.
Article Google Scholar
Bouamra O, Wrotchford A, Hollis S, Vail A, Woodford M, Lecky F, Bouamra O, Wrotchford A, Hollis S, Vail A: Outcome prediction in trauma. Injury. 2006, 37: 1092-1097. 10.1016/j.injury.2006.07.029.
Article PubMed Google Scholar
Bouillon B, Lefering R, Vorweg M, Tiling T, Neugebauer E, Troidl H: Trauma score systems: Cologne Validation Study. J Trauma. 1997, 42: 652-658. 10.1097/00005373-199704000-00012.
Article CAS PubMed Google Scholar
Brenneman FD, Boulanger BR, McLellan BA, Redelmeier DA: Measuring injury severity: time for a change?. J Trauma. 1998, 44: 580-582. 10.1097/00005373-199804000-00003.
Article CAS PubMed Google Scholar
Davie G, Cryer C, Langley J: Improving the predictive ability of the ICD-based Injury Severity Score. Inj Prev. 2008, 14: 250-255. 10.1136/ip.2007.017640.
Article CAS PubMed Google Scholar
Dillon B, Wang W, Bouamra O: A comparison study of the injury score models. European Journal of Trauma. 2006, 32: 538-547. 10.1007/s00068-006-5102-9.
Article Google Scholar
DiRusso SM, Sullivan T, Holly C, Cuff SN, Savino J: An artificial neural network as a model for prediction of survival in trauma patients: validation for a regional trauma area. J Trauma. 2000, 49: 212-220. 10.1097/00005373-200008000-00006.
Article CAS PubMed Google Scholar
Eftekhar B, Zarei MR, Ghodsi M, Moezardalan K, Zargar M, Ketabchi E: Comparing logistic models based on modified GCS motor component with other prognostic tools in prediction of mortality: results of study in 7226 trauma patients. Injury. 2005, 36: 900-904. 10.1016/j.injury.2004.12.067.
Article PubMed Google Scholar
Fischler L, Lelais F, Young J, Buchmann B, Pargger H, Kaufmann M: Assessment of three different mortality prediction models in four well-defined critical care patient groups at two points in time: a prospective cohort study. Eur J Anaesthesiol. 2007, 24: 676-683. 10.1017/S026502150700021X.
Article CAS PubMed Google Scholar
Frankema SP, Edwards MJR, Steyerberg EW, Van Vugt AB: Predicting survival after trauma: A comparison of TRISS and ASCOT in the Netherlands. European Journal of Trauma. 2002, 28: 355-364. 10.1007/s00068-002-1217-9.
Article Google Scholar
Frankema SP, Steyerberg EW, Edwards MJR, van Vugt AB: Comparison of current injury scales for survival chance estimation: an evaluation comparing the predictive performance of the ISS, NISS, and AP scores in a Dutch local trauma registration. J Trauma. 2005, 58: 596-604. 10.1097/01.TA.0000152551.39400.6F.
Article PubMed Google Scholar
Gabbe BJ, Cameron PA, Wolfe R, Simpson P, Smith KL, McNeil JJ: Predictors of mortality, length of stay and discharge destination in blunt trauma. ANZ J Surg. 2005, 75: 650-656. 10.1111/j.1445-2197.2005.03484.x.
Article PubMed Google Scholar
Glance LG, Osier TM, Mukamel DB, Meredith W, Dick AW: Expert consensus vs empirical estimation of injury severity effect on quality measurement in trauma. Arch Surg. 2009, 144: 326-332. 10.1001/archsurg.2009.8.
Article PubMed Google Scholar
Glance LG, Osler TM, Mukamel DB, Meredith W, Wagner J, Dick AW: TMPM-ICD9: a trauma mortality prediction model based on ICD-9-CM codes. Ann Surg. 2009, 249: 1032-1039. 10.1097/SLA.0b013e3181a38f28.
Article PubMed Google Scholar
Guzzo JL, Bochicchio GV, Napolitano LM, Malone DL, Meyer W, Scalea TM, Guzzo JL, Bochicchio GV, Napolitano LM, Malone DL: Prediction of outcomes in trauma: anatomic or physiologic parameters?. J Am Coll Surg. 2005, 201: 891-897. 10.1016/j.jamcollsurg.2005.07.013.
Article PubMed Google Scholar
Hannan EL, Farrell LS: Predicting trauma inpatient mortality in an administrative database: an investigation of survival risk ratios using New York data. J Trauma. 2007, 62: 964-968. 10.1097/01.ta.0000215375.07314.bd.
Article PubMed Google Scholar
Hannan EL, Farrell LS, Gorthy SF, Bessey PQ, Cayten CG, Cooper A, Mottley L: Predictors of mortality in adult patients with blunt injuries in New York State: a comparison of the Trauma and Injury Severity Score (TRISS) and the International Classification of Disease, Ninth Revision-based Injury Severity Score (ICISS). J Trauma. 1999, 47: 8-14. 10.1097/00005373-199907000-00003.
Article CAS PubMed Google Scholar
Harwood PJ, Giannoudis PV, Probst C, Van Griensven M, Krettek C, Pape HC, Harwood PJ, Giannoudis PV, Probst C, The Polytrauma Study Group of the German Trauma S: Which AIS based scoring system is the best predictor of outcome in orthopaedic blunt trauma patients?. J Trauma. 2006, 60: 334-340. 10.1097/01.ta.0000197148.86271.13.
Article PubMed Google Scholar
Hunter A, Kennedy L, Henry J, Ferguson I: Application of neural networks and sensitivity analysis to improved prediction of trauma survival. Comput Methods Programs Biomed. 2000, 62: 11-19. 10.1016/S0169-2607(99)00046-2.
Article CAS PubMed Google Scholar
Jamulitrat S, Sangkerd P, Thongpiyapoom S, Na Narong M: A comparison of mortality predictive abilities between NISS and ISS in trauma patients. J Med Assoc Thai. 2001, 84: 1416-1421.
CAS PubMed Google Scholar
Kilgo PD, Meredith JW, Osler TM, Kilgo PD, Meredith JW, Osler TM: Incorporating recent advances to make the TRISS approach universally available. J Trauma. 2006, 60: 1002-1008. 10.1097/01.ta.0000215827.54546.01. discussion 1008–1009
Article PubMed Google Scholar
Kroezen F, Bijlsma TS, Liem MSL, Meeuwis JD, Leenen LPH: Base deficit-based predictive modeling of outcome in trauma patients admitted to intensive care units in Dutch trauma centers. J Trauma. 2007, 63: 908-913. 10.1097/TA.0b013e318151ff22.
Article PubMed Google Scholar
Kuhls DA, Malone DL, McCarter RJ, Napolitano LM, Kuhls DA, Malone DL, McCarter RJ, Napolitano LM: Predictors of mortality in adult trauma patients: the physiologic trauma score is equivalent to the Trauma and Injury Severity Score. J Am Coll Surg. 2002, 194: 695-704. 10.1016/S1072-7515(02)01211-5.
Article PubMed Google Scholar
Lane PL, Doig G, Mikrogianakis A, Charyk ST, Stefanits T: An evaluation of Ontario trauma outcomes and the development of regional norms for Trauma and Injury Severity Score (TRISS) analysis. J Trauma. 1996, 41: 731-734. 10.1097/00005373-199610000-00023.
Article CAS PubMed Google Scholar
Lavoie A, Moore L, LeSage N, Liberman M, Sampalis JS: The New Injury Severity Score: a more accurate predictor of in-hospital mortality than the Injury Severity Score. J Trauma. 2004, 56: 1312-1320. 10.1097/01.TA.0000075342.36072.EF.
Article PubMed Google Scholar
MacLeod JBA, Kobusingye O, Frost C, Lett R, Kirya F, Shulman C: A Comparison of the Kampala Trauma Score (KTS) with the Revised Trauma Score (RTS), Injury Severity Score (ISS) and the TRISS Method in a Ugandan Trauma Registry: Is Equal Performance Achieved with Fewer Resources?. European Journal of Trauma. 2003, 29: 392-398. 10.1007/s00068-003-1277-5.
Article Google Scholar
Meredith JW, Kilgo PD, Osler T: A fresh set of survival risk ratios derived from incidents in the National Trauma Data Bank from which the ICISS may be calculated. J Trauma. 2003, 55: 924-932. 10.1097/01.TA.0000085645.62482.87.
Article PubMed Google Scholar
Meredith JW, Kilgo PD, Osler TM: Independently derived survival risk ratios yield better estimates of survival than traditional survival risk ratios when using the ICISS. J Trauma. 2003, 55: 933-938. 10.1097/01.TA.0000085646.71451.5F.
Article PubMed Google Scholar
Millham FH, LaMorte WW, Millham FH, LaMorte WW: Factors associated with mortality in trauma: re-evaluation of the TRISS method using the National Trauma Data Bank. J Trauma. 2004, 56: 1090-1096. 10.1097/01.TA.0000119689.81910.06.
Article PubMed Google Scholar
Moore L, Lavoie A, Turgeon AF, Abdous B, Le Sage N, Emond M, Liberman M, Bergeron E: The trauma risk adjustment model: a new model for evaluating trauma care. Ann Surg. 2009, 249: 1040-1046. 10.1097/SLA.0b013e3181a6cd97.
Article PubMed Google Scholar
Osler T, Glance L, Buzas JS, Mukamel D, Wagner J, Dick A: A trauma mortality prediction model based on the anatomic injury scale. Ann Surg. 2008, 247: 1041-1048. 10.1097/SLA.0b013e31816ffb3f.
Article PubMed Google Scholar
Rabbani A, Moini M, Rabbani A, Moini M: Application of "Trauma and Injury Severity Score" and "A Severity Characterization of Trauma" score to trauma patients in a setting different from "Major Trauma Outcome Study". Arch Iran Med. 2007, 10: 383-386.
PubMed Google Scholar
Raum MR, Nijsten MW, Vogelzang M, Schuring F, Lefering R, Bouillon B, Rixen D, Neugebauer EA, Ten Duis HJ, Polytrauma Study Group of the German Trauma S: Emergency trauma score: an instrument for early estimation of trauma severity. Crit Care Med. 2009, 37: 1972-1977. 10.1097/CCM.0b013e31819fe96a.
Article PubMed Google Scholar
Reiter A, Mauritz W, Jordan B, Lang T, Polzl A, Pelinka L, Metnitz PGH: Improving risk adjustment in critically ill trauma patients: the TRISS-SAPS Score. J Trauma. 2004, 57: 375-380. 10.1097/01.TA.0000104016.78539.94.
Article PubMed Google Scholar
Rhee KJ, Baxt WG, Mackenzie JR, Willits NH, Burney RE, O'Malley RJ, Reid N, Schwabe D, Storer DL, Weber R: APACHE II scoring in the injured patient. Crit Care Med. 1990, 18: 827-830. 10.1097/00003246-199008000-00006.
Article CAS PubMed Google Scholar
Rutledge R, Hoyt DB, Eastman AB, Sise MJ, Velky T, Canty T, Wachtel T, Osler TM: Comparison of the Injury Severity Score and ICD-9 diagnosis codes as predictors of outcome in injury: analysis of 44,032 patients. J Trauma. 1997, 42: 477-487. 10.1097/00005373-199703000-00016.
Article CAS PubMed Google Scholar
Rutledge R, Osler T: The ICD-9-based illness severity score: a new model that outperforms both DRG and APR-DRG as predictors of survival and resource utilization. J Trauma. 1998, 45: 791-799. 10.1097/00005373-199810000-00032.
Article CAS PubMed Google Scholar
Sammour T, Kahokehr A, Caldwell S, Hill AG: Venous glucose and arterial lactate as biochemical predictors of mortality in clinically severely injured trauma patients–A comparison with ISS and TRISS. Injury. 2009, 40: 104-108. 10.1016/j.injury.2008.07.032.
Article PubMed Google Scholar
Tamim H, Al Hazzouri AZ, Mahfoud Z, Atoui M, El-Chemaly S: The injury severity score or the new injury severity score for predicting mortality, intensive care unit admission and length of hospital stay: experience from a university hospital in a developing country. Injury. 2008, 39: 115-120. 10.1016/j.injury.2007.06.007.
Article PubMed Google Scholar
Suarez-Alvarez JR, Miquel J, Del Rio FJ, Ortega P: Epidemiologic aspects and results of applying the TRISS methodology in a Spanish trauma intensive care unit (TICU). Intensive Care Med. 1995, 21: 729-736. 10.1007/BF01704740.
Article CAS PubMed Google Scholar
Tay S-Y, Sloan EP, Zun L, Zaret P: Comparison of the New Injury Severity Score and the Injury Severity Score. J Trauma. 2004, 56: 162-164. 10.1097/01.TA.0000058311.67607.07.
Article PubMed Google Scholar
Ulvik A, Wentzel-Larsen T, Flaatten H: Trauma patients in the intensive care unit: short- and long-term survival and predictors of 30-day mortality. Acta Anaesthesiol Scand. 2007, 51: 171-177. 10.1111/j.1399-6576.2006.01207.x.
Article CAS PubMed Google Scholar
Vassar MJ, Lewis FR, Chambers JA, Mullins RJ, O'Brien PE, Weigelt JA, Hoang MT, Holcroft JW: Prediction of outcome in intensive care unit trauma patients: a multicenter study of Acute Physiology and Chronic Health Evaluation (APACHE), Trauma and Injury Severity Score (TRISS), and a 24-hour intensive care unit (ICU) point system. J Trauma. 1999, 47: 324-329. 10.1097/00005373-199908000-00017.
Article CAS PubMed Google Scholar
Wong DT, Barrow PM, Gomez M, McGuire GP: A comparison of the Acute Physiology and Chronic Health Evaluation (APACHE) II score and the Trauma-Injury Severity Score (TRISS) for outcome assessment in intensive care unit trauma patients. Crit Care Med. 1996, 24: 1642-1648. 10.1097/00003246-199610000-00007.
Article CAS PubMed Google Scholar
Zhao X-G, Ma Y-F, Zhang M, Gan J-X, Xu S-W, Jiang G-Y: Comparison of the new injury severity score and the injury severity score in multiple trauma patients. Chin J Traumatol. 2008, 11: 368-371.
Article PubMed Google Scholar
Aydin SA, Bulut M, Ozguc H, Ercan I, Turkmen N, Eren B, Esen M: Should the New Injury Severity Score replace the Injury Severity Score in the Trauma and Injury Severity Score?. Turkish Journal of Trauma & Emergency Surgery. 2008, 14: 308-312.
Google Scholar
Bonaventura J, Pestal M, Hajek R: The prognostic scoring of polytraumatized patients and the validity of ISS and TRISS for prediction of survival. Anesteziologie a Neodkladna Pece. 2001, 12: 81-86.
Google Scholar
Chytra I, Kasal E, Pelnar P, Lejcko J, Lavicka P, Machart S, Pradl R, Bilek M: The evaluation of accuracy of prediction of survival probability in trauma patients according to APACHE II, ISS and TRISS. Anesteziologie a Neodkladna Pece. 1999, 10: 36-39.
Google Scholar
Janes H, Pepe MS: Adjusting for covariates in studies of diagnostic, screening, or prognostic markers: an old concept in a new setting. Am J Epidemiol. 2008, 168: 89-97. 10.1093/aje/kwn099.
Article PubMed Google Scholar
National Trauma Registry Consortium (Australia & New Zealand): The National Trauma Registry (Australia & New Zealand) Report: 2005. Book The National Trauma Registry (Australia & New Zealand) Report: 2005. 2008
Google Scholar
Edwards A, Di Bartolomeo S, Chieregato A, Coats T, Della Corte F, Giannoudis P, Gomes E, Groenborg H, Lefering R, Leppaniemi A: A comparison of European Trauma Registries. The first report from the EuroTARN Group. Resuscitation. 2007, 75: 286-297.
Article PubMed Google Scholar
Champion HR, Copes WS, Sacco WJ, Lawnick MM, Bain LW, Gann DS, Gennarelli TA, Mackenzie E, Schwaitzberg S: A new characterization of injury severity. J Trauma. 1990, 30: 539-545. 10.1097/00005373-199005000-00003.
Article CAS PubMed Google Scholar
Copes WS, Champion HR, Sacco WJ, Lawnick MM, Gann DS, Gennarelli T, MacKenzie E, Schwaitzberg S: Progress in characterizing anatomic injury. J Trauma. 1990, 30: 1200-1207. 10.1097/00005373-199010000-00003.
Article CAS PubMed Google Scholar
Nathens AB, Xiong W, Shafi S: Ranking of trauma center performance: the bare essentials. J Trauma. 2008, 65: 628-635. 10.1097/TA.0b013e3181837994.
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

School of Primary, Aboriginal and Rural Health Care, The University of Western Australia, M516 The University of Western Australia, 35 Stirling Highway, Crawley, WA, 6009, Australia
Hideo Tohira, Ian Jacobs, David Mountain & Nick Gibson
Department of Surgery, Sir Charles Gairdner Hospital, Hospital Avenue, Nedlands, WA, 6009, Australia
Allen Yeo

Authors

Hideo Tohira
View author publications
You can also search for this author in PubMed Google Scholar
Ian Jacobs
View author publications
You can also search for this author in PubMed Google Scholar
David Mountain
View author publications
You can also search for this author in PubMed Google Scholar
Nick Gibson
View author publications
You can also search for this author in PubMed Google Scholar
Allen Yeo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hideo Tohira.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

HT and IJ designed the study and directed its implementation. DM, NG and AY helped to design the statistical strategy of the study. IJ helped to extract data and assess the quality of the studies. HT analyzed the data and wrote the paper. All authors read and approved the final manuscript.

Electronic supplementary material

Additional file 1: MEDLINE and Embase search strategy using OvidSP version 3. (PPT 148 KB)

Additional file 2: Study characteristics of the relevant studies. Details of study characteristics. (DOC 168 KB)

Additional file 3: Quality assessment questionnaire of injury severity scoring tools. (PPT 252 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Tohira, H., Jacobs, I., Mountain, D. et al. Systematic review of predictive performance of injury severity scoring tools. Scand J Trauma Resusc Emerg Med 20, 63 (2012). https://doi.org/10.1186/1757-7241-20-63

Download citation

Received: 26 March 2012
Accepted: 28 August 2012
Published: 10 September 2012
DOI: https://doi.org/10.1186/1757-7241-20-63

Systematic review of predictive performance of injury severity scoring tools

Abstract

Background

Aims

Methods

Injury severity scoring tools

Search strategies

Finding relevant articles

Information extraction

Evaluating articles

Statistical analysis

Results

Relevant articles

Quality assessment

Discussion

Study quality

Source of heterogeneity

ISS vs. NISS

ICISS vs. AIS-based tools

Instability of the ICISS

Generalizability

Potential biases

Limitations

Future research directions

Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Electronic supplementary material

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Scandinavian Journal of Trauma, Resuscitation and Emergency Medicine

Contact us