TY - JOUR
T1 - Clustering Analysis of a Colombian Toxicological Database
AU - Isaza, Juan Pablo
AU - Muñoz, Felipe
AU - Avila, Alba
N1 - Funding Information:
The authors acknowledge the support from the Administrative Department of Science, Technology and Innovation, Colciencias, Scholarship 510 program “Joven Investigador.” The authors also acknowledge the help provided by the Consejo Colom-biano de Seguridad (CCS) and especially Diana Gil who gave us access to the TED databases and openly answered all questions. The authors also appreciate the comments from Andrea Parra.
PY - 2014/7
Y1 - 2014/7
N2 - The database analysis allows the return of experience needed to support decision-making processes in risk management. A Colombian toxicological database (TED) developed and maintained by the Center of Safety Information and Chemical Products (CISPROQUIM by its Spanish abbreviation in) is analyzed here using a demographic clustering technique. Data-quality processes were performed on the raw data (more than 170 variables) and as a result the database was reduced to 20 meaningful variables. The variables characterized by values with categories were selected for clustering analysis: gender, age, type of emergency, emergency location, means of poisoning, product use, and physical state of the toxic substance. Clustering analysis showed that there are three profiles that are prevalent in the TED database: Young Adult Suicidal Woman, Unsupervised Child, and Man at Work. These profiles could not be identified using traditional statistical analyses performed on the data collected by CISPROQUIM or defined a priori from the categorical variables. The identification of vulnerable populations and the cause of the toxicological events are critical in order to develop national prevention programs and policies. The analysis described provided a methodology for a critical analysis of toxicological databases that can be applied to other databases such as security databases.
AB - The database analysis allows the return of experience needed to support decision-making processes in risk management. A Colombian toxicological database (TED) developed and maintained by the Center of Safety Information and Chemical Products (CISPROQUIM by its Spanish abbreviation in) is analyzed here using a demographic clustering technique. Data-quality processes were performed on the raw data (more than 170 variables) and as a result the database was reduced to 20 meaningful variables. The variables characterized by values with categories were selected for clustering analysis: gender, age, type of emergency, emergency location, means of poisoning, product use, and physical state of the toxic substance. Clustering analysis showed that there are three profiles that are prevalent in the TED database: Young Adult Suicidal Woman, Unsupervised Child, and Man at Work. These profiles could not be identified using traditional statistical analyses performed on the data collected by CISPROQUIM or defined a priori from the categorical variables. The identification of vulnerable populations and the cause of the toxicological events are critical in order to develop national prevention programs and policies. The analysis described provided a methodology for a critical analysis of toxicological databases that can be applied to other databases such as security databases.
KW - clustering
KW - Colombia
KW - data mining
KW - database
KW - toxicological events
UR - http://www.scopus.com/inward/record.url?scp=84893900565&partnerID=8YFLogxK
U2 - 10.1080/10807039.2012.740653
DO - 10.1080/10807039.2012.740653
M3 - Artículo
AN - SCOPUS:84893900565
VL - 20
SP - 1058
EP - 1076
JO - Human and Ecological Risk Assessment
JF - Human and Ecological Risk Assessment
SN - 1080-7039
IS - 4
ER -