A multi-stage method to predict carbon dioxide emissions using dimensionality reduction, clustering, and machine learning techniques