Francisco Herrera (SM’15) received his M.Sc. in Mathematics in 1988 and Ph.D. in Mathematics in 1991, both from the University of Granada, Spain. He is currently a Professor in the Department of Computer Science and Artificial Intelligence at the University of Granada. He has been the supervisor of 43 Ph.D. students. He has published more than 400 journal papers that have received more than 62000 citations (Scholar Google, H-index 12i). He is coauthor of the books “Genetic Fuzzy Systems” (World Scientific, 2001) and “Data Preprocessing in Data Mining” (Springer, 2015), “The 2-tuple Linguistic Model. Computing with Words in Decision Making” (Springer, 2015), “Multilabel Classification. Problem analysis, metrics and techniques” (Springer, 2016), “Multiple Instance Learning. Foundations and Algorithms” (Springer, 2016) and “Learning from Imbalanced DataSets” (Springer, 2018).

He currently acts as Editor in Chief of the international journals “Information Fusion” (Elsevier) and “Progress in Artificial Intelligence” (Springer). He acts as editorial member of a dozen of journals.

He received the following honors and awards: ECCAI Fellow 2009, IFSA Fellow 2013, 2010 Spanish National Award on Computer Science ARITMEL to the “Spanish Engineer on Computer Science”, International Cajastur “Mamdani” Prize for Soft Computing (Fourth Edition, 2010), IEEE Transactions on Fuzzy System Outstanding 2008 and 2012 Paper Award (bestowed in 2011 and 2015 respectively), 2011 Lotfi A. Zadeh Prize Best paper Award of the International Fuzzy Systems Association, 2013 AEPIA Award to a scientific career in Artificial Intelligence, and 2014 XV Andalucía Research Prize Maimónides (by the regional government of Andalucía), 2017 Security Forum I+D+I Prize, and 2017 Andalucía Medal (by the regional government of Andalucía), 2018 “Granada: Science and Innovation City”.

He has been selected as a Highly Cited Researcher (in the fields of Computer Science and Engineering, respectively, 2014 to present, Clarivate Analytics).

His current research interests include among others, soft computing (including fuzzy modeling, evolutionary algorithms and deep learning), computing with words, information fusion and decision making, and data science (including data preprocessing, prediction and big data).

Website: decsai.ugr.es/~herrera

Quality Data to drive Deep Learning applications

In the last years, deep learning methods and particularly Convolutional Neural Networks (CNNs) have exhibited excellent accuracies in many image and pattern classification problems, among others. To get quality data is the foundation for good data analytics in general, and it is also very important for getting a good deep learning model from an applied point of view.

Quality data requires a data preprocessing analysis to adapt the raw data to fulfill the input demands of each learning algorithm. Data preprocessing is an essential part of any data mining process.

In contrast to the classical classification models, the high abstraction capacity of CNNs allows them to work on the original high dimensional space, which reduces the need for manually preparing the input. However, a suitable preprocessing is still important to improve the quality of the result. One of the most used preprocessing techniques with CNNs is data augmentation for small image datasets, which increases the volume of the training dataset by applying several transformations to the original input. There are other guided preprocessing procedures for specific problems, like brightness and other images features.

In this talk we present the connection between deep learning and data guided preprocessing approaches throughout all families of methods used to improve the deep learning capabilities, together with some applications in different areas.