Mostrar el registro sencillo del ítem

dc.rights.licensehttps://creativecommons.org/licenses/by-sa/2.5/ar/es_AR
dc.contributor.advisorMerener, Martín
dc.contributor.authorRuiz Moreno, Tobíases_AR
dc.coverage.spatialArgentinaes_AR
dc.date.accessioned2023-06-06T17:21:23Z
dc.date.available2023-06-06T17:21:23Z
dc.date.issued2019
dc.identifier.urihttps://repositorio.utdt.edu/handle/20.500.13098/11868
dc.description.abstractWorld cereal production is set to grow by around 1% per year for the next decade, and while crop areas are not expanding, the major driver for the growth production is expected to come from yield improvements. Crop yields have been commonly modelled in two ways: process-based modelling (also known as crop simulation) and statistical modelling. Recently, machine learning started to deliver interesting results, mainly because it has the advantage of dealing with non-linear relationships between factors. Weather plays an important role in defining crop yields. Being able to simulate accurate weather conditions and predict crop yield has been an important topic in the industry. The objective of this work is to model crop yields using Random Forest regressor and Long Short-Term Memory (LSTM) Neural Networks (NN) in 9 annual crops in Argentina: wheat, barley, maize, soybean, sunflower, sorghum, rice, cotton and peanut. Soil and weather data was collected and transformed for 80 counties in Argentina. Hyperparameters for the 2 models were optimized and accuracy metrics were compared. Weather information was simulated estimating the distribution of the historical information using KDE (Kernel Density Estimator) and Monte Carlo to generate random sampling. Feature importance analysis allowed to reduce the number of factors up to 7 without compromising model accuracy. From the 9 crops studied, soybean, maize, sunflower, sorghum, wheat and barley models returned reasonable accuracy metrics. Except for the last two (wheat and barley) which are winter crops, the remaining 4 summer crops (soybean, maize, sorghum and sunflower) were forecasted simulating rainfall in different stages of the growing season and returned estimations with an error below 20% (MAPE) before harvest. Random forest outperformed classic MLR statistical model by more than 30% on average over all the crops, but overfitting was significantly high. LSTM did not perform as well as Random forest: although LSTM did not overfit, performance was slightly better than baseline with large variations between crops. This work demonstrates that machine learning algorithms are a competitive alternative to statistical modelling for crop yield prediction, and weather simulations can return reasonably accurate predictions before harvest. This allows the agricultural community to anticipate strategic decisions based on crop production forecasts.es_AR
dc.format.extent100 p.es_AR
dc.format.mediumapplication/pdfes_AR
dc.languageenges_AR
dc.publisherUniversidad Torcuato Di Tellaes_AR
dc.rightsinfo:eu-repo/semantics/openAccesses_AR
dc.subjectPredicción tecnológicaes_AR
dc.subjectStatisticales_AR
dc.subjectmodelo econométricoes_AR
dc.titleCrop yield prediction with ensemble algorithms and Artificial Neural Networks (ANN)es_AR
dc.typeinfo:eu-repo/semantics/masterThesises_AR
dc.typeinfo:ar-repo/semantics/tesis de maestríaes_AR
thesis.degree.nameMaster in Management + Analytics
dc.subject.keywordWorld Cereal productiones_AR
dc.subject.keywordYield improvementses_AR
dc.subject.keywordMachine Learninges_AR
dc.subject.keywordRandom Forestes_AR
dc.subject.keywordLong Short-Term Memory (LSTM) Neural Networkses_AR
dc.subject.keywordSimulación de Monte Carloes_AR
dc.type.versioninfo:eu-repo/semantics/acceptedVersiones_AR


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem