The aim of the study was to assess the capabilities of age determination (age group) at death using classification techniques by histomorphometric characteristics of osseous and cartilaginous tissue aging. Materials and Methods. The study material was a database containing the findings of morphometric researches of osseous and cartilaginous tissue histologic specimens from 294 categorized male corpses aged 10–93 years. For data analysis and classification we used modern machine learning methods: k-NN, SVM, logistic regression, CatBoost, SGD, naive Bayes, random forest, nonlinear dimensionality reduction methods (t-SNE and uMAP), and recursive feature elimination for feature selection. Results. The used techniques (algorithms) provided effective representation of a complex data set (76 histomorphometric features), allowing to reveal the cluster structure inside the low dimensional feature space, thus fitting the classifier becomes even more reasonable. During feature selection, we estimated their importance for age group classification and studied the relationship between classification quality and the number of features inside the feature space. Data pre-processing made it possible to get rid of noise and keep most informative features, thereby accelerating a learning process and improving the classification quality. Data projection showed more well-defined cluster structure in the space of selected features. The accuracy of establishing certain groups was equal to 90%. It proves high efficiency of machine learning techniques used for forensic age diagnostics based on histomorphometric findings.
Age Classification in Forensic Medicine Using Machine Learning Techniques
Cameriere R.
2022-01-01
Abstract
The aim of the study was to assess the capabilities of age determination (age group) at death using classification techniques by histomorphometric characteristics of osseous and cartilaginous tissue aging. Materials and Methods. The study material was a database containing the findings of morphometric researches of osseous and cartilaginous tissue histologic specimens from 294 categorized male corpses aged 10–93 years. For data analysis and classification we used modern machine learning methods: k-NN, SVM, logistic regression, CatBoost, SGD, naive Bayes, random forest, nonlinear dimensionality reduction methods (t-SNE and uMAP), and recursive feature elimination for feature selection. Results. The used techniques (algorithms) provided effective representation of a complex data set (76 histomorphometric features), allowing to reveal the cluster structure inside the low dimensional feature space, thus fitting the classifier becomes even more reasonable. During feature selection, we estimated their importance for age group classification and studied the relationship between classification quality and the number of features inside the feature space. Data pre-processing made it possible to get rid of noise and keep most informative features, thereby accelerating a learning process and improving the classification quality. Data projection showed more well-defined cluster structure in the space of selected features. The accuracy of establishing certain groups was equal to 90%. It proves high efficiency of machine learning techniques used for forensic age diagnostics based on histomorphometric findings.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.