Mostrar el registro sencillo del ítem

resumen

Resumen
Predicting the risk of a disease in a pathosystem based on a set of climatic variables usually requires handling a high number of input variables, many of which are often irrelevant and/or redundant. Building linear predictive models entails not only dimensionality issues but also the negative impact of multicollinearity. Several feature selection methods have proved to be efficient in both linear and non-linear models, regardless of those issues. [ver mas...]
dc.contributor.authorSuarez, Franco
dc.contributor.authorBruno, Cecilia
dc.contributor.authorKurina Giannini, Franca
dc.contributor.authorGimenez, Maria
dc.contributor.authorRodriguez Pardina, Patricia
dc.contributor.authorBalzarini, Mónica Graciela
dc.date.accessioned2023-10-23T10:21:16Z
dc.date.available2023-10-23T10:21:16Z
dc.date.issued2023-10-11
dc.identifier.issn1161-0301
dc.identifier.otherhttps://doi.org/10.1016/j.eja.2023.126995
dc.identifier.urihttp://hdl.handle.net/20.500.12123/15634
dc.identifier.urihttps://www.sciencedirect.com/science/article/pii/S1161030123002630
dc.description.abstractPredicting the risk of a disease in a pathosystem based on a set of climatic variables usually requires handling a high number of input variables, many of which are often irrelevant and/or redundant. Building linear predictive models entails not only dimensionality issues but also the negative impact of multicollinearity. Several feature selection methods have proved to be efficient in both linear and non-linear models, regardless of those issues. However, in a machine learning (ML) context, it is necessary to evaluate these feature selection methods embedded into the model fitting algorithm to obtain the greatest accuracy. The aim of this work was to assess different combinations of variable selection methods with linear and non-linear predictors to fit climate-based models that predict the occurrence of a disease in a pathosystem. Four selection methods were compared: stepwise, which is frequently used in linear models, combined with VIF and p-value statistical criteria (Step+VIF+Pv), and other methods commonly used in ML: filter (F), genetic algorithm (GA), and Boruta (B). The disease risk predictors were constructed with a logistic linear regression model (LR) and the random forest (RF) algorithm, using all the available variables and the subgroups of variables selected by each feature selection method. Data from three pathosystems were processed: two involving Begomovirus –one in common bean (Phaseolus vulgaris L) and the other in soybean (Glycine max)– and the third one involving Mal de Rio Cuarto virus in maize (Zea mays L.). The data sets differed in sample size and number of variables. The accuracy of RF prediction did not vary among feature selection methods. Step+VIF+Pv was used to reduce the model outperformed the other feature selection methods in fitting LR. Our proposal suggests that the appropriate pairing of variable selection and prediction models would improve the modeling of plant disease risk.eng
dc.formatapplication/pdfes_AR
dc.language.isoenges_AR
dc.publisherElsevieres_AR
dc.rightsinfo:eu-repo/semantics/restrictedAccesses_AR
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/es_AR
dc.sourceEuropean Journal of Agronomy 151: 126995 (November 2023)es_AR
dc.subjectMulticollinearityeng
dc.subjectMulticolinearidades_AR
dc.subjectPlant Diseaseseng
dc.subjectEnfermedades de las Plantases_AR
dc.subject.otherLogistic Regressioneng
dc.subject.otherRandom Foresteng
dc.subject.otherFeature Selectioneng
dc.subject.otherPrediction Modelseng
dc.subject.otherPathosystemseng
dc.titleMarriage between variable selection and prediction methods to model plant disease riskes_AR
dc.typeinfo:ar-repo/semantics/artículoes_AR
dc.typeinfo:eu-repo/semantics/articlees_AR
dc.typeinfo:eu-repo/semantics/publishedVersiones_AR
dc.rights.licenseCreative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)es_AR
dc.description.origenInstituto de Patología Vegetales_AR
dc.description.filFil: Suarez, Franco. Universidad Nacional de Córdoba. Facultad de Ciencias Agropecuarias. Cátedra de Estadística y Biometría; Argentinaes_AR
dc.description.filFil: Suarez, Franco. Consejo Nacional de Investigaciones Científicas y Técnicas. Unidad de Fitopatología y Modelización Agrícola (UFyMA); Argentinaes_AR
dc.description.filFil: Suarez, Franco. Instituto Nacional de Tecnología Agropecuaria (INTA). Instituto de Patología Vegetal; Argentinaes_AR
dc.description.filFil: Bruno, Cecilia. Universidad Nacional de Córdoba. Facultad de Ciencias Agropecuarias. Cátedra de Estadística y Biometría; Argentinaes_AR
dc.description.filFil: Bruno, Cecilia. Consejo Nacional de Investigaciones Científicas y Técnicas. Unidad de Fitopatología y Modelización Agrícola (UFyMA); Argentinaes_AR
dc.description.filFil: Bruno, Cecilia. Instituto Nacional de Tecnología Agropecuaria (INTA). Instituto de Patología Vegetal; Argentinaes_AR
dc.description.filFil: Kurina Giannini, Franca. Aarhus Universitet. institut for agroøkologi. Jornær sektioner; Dinamarcaes_AR
dc.description.filFil: Gimenez, Maria De La Paz. Instituto Nacional de Tecnología Agropecuaria (INTA). Instituto de Patología Vegetal; Argentinaes_AR
dc.description.filFil: Gimenez, Maria De La Paz. Consejo Nacional de Investigaciones Científicas y Técnicas. Unidad de Fitopatología y Modelización Agrícola (UFyMA); Argentinaes_AR
dc.description.filFil: Rodriguez Pardina, Patricia. Instituto Nacional de Tecnología Agropecuaria (INTA). Instituto de Patología Vegetal; Argentinaes_AR
dc.description.filFil: Rodriguez Pardina, Patricia. Consejo Nacional de Investigaciones Científicas y Técnicas. Unidad de Fitopatología y Modelización Agrícola (UFyMA); Argentinaes_AR
dc.description.filFil: Balzarini, Mónica Graciela. Universidad Nacional de Córdoba. Facultad de Ciencias Agropecuarias. Cátedra de Estadística y Biometría; Argentinaes_AR
dc.description.filFil: Balzarini, Mónica Graciela. Consejo Nacional de Investigaciones Científicas y Técnicas. Unidad de Fitopatología y Modelización Agrícola (UFyMA); Argentinaes_AR
dc.description.filFil: Balzarini, Mónica Graciela. Instituto Nacional de Tecnología Agropecuaria (INTA). Instituto de Patología Vegetal; Argentinaes_AR
dc.subtypecientifico


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

common

Mostrar el registro sencillo del ítem

info:eu-repo/semantics/restrictedAccess
Excepto si se señala otra cosa, la licencia del ítem se describe como info:eu-repo/semantics/restrictedAccess